-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
runc-facade squashes rare runc errors #12365
Comments
Thanks @utam0k!
How long, or how many times should runc-facade do retry before it gives up? Are there any scenarios where we would NOT want to retry, because retrying would be disruptive to the user experience? |
At least one time. Even so, it should be effective enough.
Unfortunately, when the user application gets failed in starting a container, runc-facade tried to retry. Ideally, we have to distinguish between the error from runc or a user application, but we can't now. So in case of user application failure, we will have to retry in vain.
|
@utam0k I removed this from breakdown, but, as you shared it will only take 30m, please feel free to assign yourself and set status toIn-Progress. If it ends up taking more time, please move back to our inbox (no status). |
@utam0k |
Is your feature request related to a problem? Please describe
This issue came from #9247 (comment)
Rarely runc will cause an error due to something (probably seccomp notify). It is very likely that these errors are difficult to resolve at the root. Therefore, runc-facade is used to retry and squash these errors.
This problem still happens recently, the gcp log
Solution
Please give it a retry when runc returns any error around here
gitpod/components/docker-up/runc-facade/main.go
Lines 89 to 92 in 52848f6
How to reproduce
Open this repository
https://github.com/spearki/gitpod-runc-issue-repro
Describe the behaviour you'd like
Wrote above
NOTE
The text was updated successfully, but these errors were encountered: