Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Avalanche liveness health checks #1287

Merged
merged 3 commits into from
Apr 3, 2023

Conversation

StephenButtolph
Copy link
Contributor

Why this should be merged

Blocks/vertices/transactions hanging in processing is often the first indication of network instability. We already become unhealthy if blocks are slow to process.

How this works

Adds alerting if vertices or transactions remain in processing for longer than expected.

How this was tested

CI

@StephenButtolph StephenButtolph added monitoring This primarily focuses on logs, metrics, and/or tracing incident response labels Apr 3, 2023
@StephenButtolph StephenButtolph added this to the v1.10.0 (Cortina) milestone Apr 3, 2023
@StephenButtolph StephenButtolph self-assigned this Apr 3, 2023
Copy link

@danlaine danlaine left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

naming nits. Otherwise LGTM

@StephenButtolph StephenButtolph merged commit 5f45fe5 into dev Apr 3, 2023
@StephenButtolph StephenButtolph deleted the add-avalanche-liveness-health-checks branch April 3, 2023 21:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
incident response monitoring This primarily focuses on logs, metrics, and/or tracing
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants