-
Notifications
You must be signed in to change notification settings - Fork 3.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Edge runs on Azure Pipeline 'queued' #21692
Comments
https://dev.azure.com/web-platform-tests/_settings/agentpools shows 147 queued jobs: It does look like there are 25 jobs running, but it'll presumably take a look time to catch up. |
That presumes they are making progress. Can you check the 25 jobs to see if any of them have non-queued Edge runs? (I don't have access to that link) |
Looks like the failures started to happen on Friday morning with this job: And after that there were no successful runs until we rebuilt the VMs on Sunday with our automated jobs. I see runs progressing now, so it will take some time to catch up, but runs are progressing. Since VMs got rebuilt, I can't debug how they got into this state, but will send mail to Azure Pipelines folks to see if they have any additional data from the agent logs. I'm seeing errors like this on the agents that were running that job: ##[error]We stopped hearing from agent w10c00000O. Verify the agent machine is running and has a healthy network connection. Anything that terminates an agent process, starves it for CPU, or blocks its network access can cause this error. For more information, see: https://go.microsoft.com/fwlink/?linkid=846610 |
Reopening as I'm seeing jobs queuing up again (https://dev.azure.com/web-platform-tests/wpt/_build?definitionId=1&repositoryFilter=1&branchFilter=1091). Last successful run was 3 days ago (Sat. Feb 15). Apologies if this isn't the same issue. |
The root cause of this issue is different, but the end result is the same. We use Azure service principal to manage VMs and the password for that account has expired. Working on renewing it now and will kick off a job to re-generate Windows VMs. I'll add a few extra VMs to help increase the number of runs that we can process and hopefully deal with that backlog faster. |
VMs are back up and running, let me know if you see any other issues. |
Looking good last few days, reclosing. Thanks! |
Whilst investigating #21691 I noticed that epochs/daily and epochs/three_hourly have also been stuck since Friday. They all have pending jobs that are queued waiting for Edge {Stable, Dev}.
epochs/daily, and example run
epochs/three_hourly and example run
Assigning to @mustjab as this seems to definitely be Azure Pipelines problems
The text was updated successfully, but these errors were encountered: