-
Notifications
You must be signed in to change notification settings - Fork 349
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Production - [Alerting] Build Analysis: Exceptions and Errors Alert #12954
Comments
The alert description mentioned this is usually for catastrophic/unexpected errors, so I took a look in case there was an outage: We're throwing non-stop exceptions trying to parse the known issue in dotnet/runtime#83655
I escaped the A few questions on this:
|
FIgured this one out at least. Initially this issue had an error message which got the hits, and this only started failing after the issue was switched to an errorPattern that uses the regex. |
@AlitzelMendez do you think we should turn this into a warning instead of an error when and invalid pattern occurs? |
I believe I initially mentioned I thought this should be an error, since we wouldn't have any way of knowing the service is failing to process these if they were warnings. I'm fine with downgrading if a single bad issue is enough to trigger a critical alert. |
We have stopped throwing the errors as of the fixed regex. Waiting until the alert clears. |
I feel that errors should incur alerts if there's something wrong on our side of things, but malformed regex provided by the user shouldn't meet that bar. |
https://dev.azure.com/dnceng/internal/_git/dotnet-helix-service/pullrequest/30231?_a=files to downgrade this to a warning and stop logging the exception. |
💔 Metric state changed to alerting
|
💚 Metric state changed to ok |
💔 Metric state changed to alerting
|
💚 Metric state changed to ok |
💔 Metric state changed to alerting
Go to rule
@dotnet/dnceng, please investigate
Release Note Category
Release Note Description
No need for release notes
Automation information below, do not change
Grafana-Automated-Alert-Id-6fe0b7b34a004f0bad0064a42f9b9135
The text was updated successfully, but these errors were encountered: