Do not transition servers into the healthy state until at least one healthy probe is detected.#2415
Merged
MihaZupan merged 1 commit intodotnet:mainfrom Feb 27, 2024
Merged
Conversation
MihaZupan
reviewed
Feb 27, 2024
Member
MihaZupan
left a comment
There was a problem hiding this comment.
Thank you for contributing here.
I think the behavior we want here is to keep the destination health as Unknown in this case until we see enough failures - see #2079 (comment).
We should have the necesarry bookkeeping in place already. We can pass the current destination health to EvaluateHealthState and then change the failure condition like so
-newHealth = currentFailureCount < threshold ? DestinationHealth.Healthy : DestinationHealth.Unhealthy;
+newHealth = currentFailureCount < threshold ? previousHealth : DestinationHealth.Unhealthy;…ation's previous active health state as the next health state. This has the effect of a destination with a health state of Unknown in the unknown state until the threshold is reached, at which time the destination transitions into Unhealthy.
2bb1288 to
2d95f7c
Compare
Contributor
Author
|
By simply using the previous state, the change can be smaller. The changes to AtomicCounter (and those tests) are no longer required. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Fixes #2079
This fix is sort of a hack, but does not add any additional book keeping or change the health model while still yielding the desired behavior.By simply initializing the failure count to the failure threshold, the first health probe will either keep the destination unhealthy, or it will swap to healthy just like before.Use the destination's previous state to inform the next state if we are below the failure theshold. This will prevent new destinations with slower server startup times from erroneously tranisitioning into healthy temporarily until the unhealthy threshold is eventually reached.
Unknownwill continue to be unknown until either the failure threshold is reached (=> Unhealthy) or a successful probe occurs (=> Healthy). This scenario is the patch.