Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Staging - [Alerting] Servicing Builds Running in R&D Pools Alert #10580

Closed
dotnet-eng-status-staging bot opened this issue Aug 25, 2022 · 5 comments
Closed
Assignees
Labels
Grafana Alert Issues opened by Grafana Inactive Alert Issues from Grafana alerts that are now "OK" Ops - First Responder Staging Tied to the Staging environment (as opposed to Production)

Comments

@dotnet-eng-status-staging
Copy link

💔 Metric state changed to alerting

Please run the following query to identify the delinquent pipelines:

TimelineBuilds
| where QueueTime > ago(1d)
| where SourceBranch has "release"
| join kind=inner (TimelineRecords
                    | where Order != 0
                    | where strlen(Path) == 11
                    | where WorkerName has "NetCore1ESPool"
                    | where not(WorkerName has "-Svc")
                    | summarize arg_min(WorkerName, *) by RecordId, BuildId ) on BuildId
  • count_ 2837

Metric Graph

Go to rule

@dotnet/dnceng, please investigate

Automation information below, do not change

Grafana-Automated-Alert-Id-a5641c4a6d8e4e499f1710aa8386d81b

@dotnet-eng-status-staging dotnet-eng-status-staging bot added Active Alert Issues from Grafana alerts that are now active Ops - First Responder Grafana Alert Issues opened by Grafana Staging Tied to the Staging environment (as opposed to Production) labels Aug 25, 2022
@MattGal
Copy link
Member

MattGal commented Aug 25, 2022

I'm working on this via #10578.

@MattGal MattGal self-assigned this Aug 25, 2022
@dotnet-eng-status-staging dotnet-eng-status-staging bot added Inactive Alert Issues from Grafana alerts that are now "OK" and removed Active Alert Issues from Grafana alerts that are now active labels Sep 4, 2022
@dotnet-eng-status-staging
Copy link
Author

💚 Metric state changed to ok

Please run the following query to identify the delinquent pipelines:

TimelineBuilds
| where QueueTime > ago(1d)
| where SourceBranch has "release"
| join kind=inner (TimelineRecords
                    | where Order != 0
                    | where strlen(Path) == 11
                    | where WorkerName has "NetCore1ESPool"
                    | where not(WorkerName has "-Svc")
                    | summarize arg_min(WorkerName, *) by RecordId, BuildId ) on BuildId

Metric Graph

Go to rule

@dotnet-eng-status-staging dotnet-eng-status-staging bot added Active Alert Issues from Grafana alerts that are now active and removed Inactive Alert Issues from Grafana alerts that are now "OK" labels Sep 4, 2022
@dotnet-eng-status-staging
Copy link
Author

💔 Metric state changed to alerting

Please run the following query to identify the delinquent pipelines:

TimelineBuilds
| where QueueTime > ago(1d)
| where SourceBranch has "release"
| join kind=inner (TimelineRecords
                    | where Order != 0
                    | where strlen(Path) == 11
                    | where WorkerName has "NetCore1ESPool"
                    | where not(WorkerName has "-Svc")
                    | summarize arg_min(WorkerName, *) by RecordId, BuildId ) on BuildId
  • count_ 7

Metric Graph

Go to rule

@dotnet-eng-status-staging dotnet-eng-status-staging bot added Inactive Alert Issues from Grafana alerts that are now "OK" and removed Active Alert Issues from Grafana alerts that are now active labels Sep 18, 2022
@dotnet-eng-status-staging
Copy link
Author

💚 Metric state changed to ok

Please run the following query to identify the delinquent pipelines:

TimelineBuilds
| where QueueTime > ago(1d)
| where SourceBranch has "release"
| join kind=inner (TimelineRecords
                    | where Order != 0
                    | where strlen(Path) == 11
                    | where WorkerName has "NetCore1ESPool"
                    | where not(WorkerName has "-Svc")
                    | summarize arg_min(WorkerName, *) by RecordId, BuildId ) on BuildId

Metric Graph

Go to rule

@MattGal
Copy link
Member

MattGal commented Sep 19, 2022

Wow it's clear, as I still have #10578, closing this. (will likely reactivate)

@MattGal MattGal closed this as completed Sep 19, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Grafana Alert Issues opened by Grafana Inactive Alert Issues from Grafana alerts that are now "OK" Ops - First Responder Staging Tied to the Staging environment (as opposed to Production)
Projects
None yet
Development

No branches or pull requests

1 participant