-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Prebuilds are stuck in 'queued' #9395
Comments
The cases have in common that the phase is Unfortunately, I couldn't find any signs in the log of workspace-cluster of these workspace instances. Also weren't lucky with finding traces in honeycomb (I checked 20 instances) 😞 . |
This sounds related to a fix @andrew-farries is working on here. At some point we seem to have started using @andrew-farries Can you make sure we introduce separate timeouts for "building", and start out with the same value as "preparing"? |
Yes, that explains why they are no signs in workspace-cluster the workspaces are probably building docker images and timing out in the meantime (after 10min). |
Suspected example of affected user (internal) |
Having had a look at the data we indeed leak a couple of prebuilds in
|
Bug description
The DB contains +6k entries of prebuilds in 'queued' state. Would be good to understand what happens, clean things up and prevent these from being created further.
Updating prebuilt.state on failed prebuild start
During this incident we discovered that there seems to be an ongoing issue with updating state for prebuilds whose instances never reach
RUNNING
phase (note the emptystartedTime
):Note: Especially older entries could also be fallout from other incidents + cleanups, but there seems to be a couple of fresh ones every day.
Steps to reproduce
select * from d_b_prebuilt_workspace where state = 'queued' order by creationTime;
The text was updated successfully, but these errors were encountered: