-
Notifications
You must be signed in to change notification settings - Fork 86
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[v20.10.1] dockerd panics when configuring default runtime #1169
Comments
In case you're wondering why we want to change the default runtime: https://github.com/NVIDIA/k8s-device-plugin
|
Thanks for reporting; looks like it's panicking here; https://github.com/moby/moby/blob/v20.10.1/plugin/executor/containerd/containerd.go#L71 Looks related to moby/moby@f63f73a (part of moby/moby#41182) /cc @cpuguy83 PTAL |
moby/moby#41854 Should take care of this. |
I ran into this exact same issue, but I had not customized the default runtime. I was running Docker 20.10.1 on Ubuntu 20.04 LTS. Everything had been running fine on the machine for about 24 hours, and suddenly |
Sweet, thanks a lot! Is there a way I can test this? |
Our instances upgraded to 20.10.2 and the issue indeed seems to have been resolved, thanks! |
Apparently 20.10.3 broke this again. Please reopen the issue :) |
Docker 20.10.3 was a security release. Security releases usually only contain security fixes, and won't be combined with other fixes; the fix is being back ported for docker 20.10.4 (moby/moby#41974) |
Thanks for the info! |
Expected behavior
The expected behavior is that Docker starts successfully if you set the default runtime.
Actual behavior
Docker fails to start.
Steps to reproduce the behavior
Install Docker CE on Ubuntu 18.04 as per the official docs.
Observe that the
docker
service starts successfully withsystemctl status docker.service
.Set up
/etc/docker/daemon.json
as per the nvidia-container-runtime docs:Restart the service with
systemctl restart docker.service
and observe its failure:Check the details with
journalctl -u docker
:Now modify
daemon.json
to exclude thedefault-runtime
setting:Restart Docker with
systemctl restart docker.service
and observe that it starts successfully. If you put back the default runtime setting again, it fails again.At this point you'll possibly also realize that Docker tries to restart itself too often, so even if you remove the the default runtime, you might not be able to restart Docker right away, because systemd blocks it for a while:
(Which hightlights another problem, that
RestartSec=2
should definitely be increased to 15 seconds or so in/lib/systemd/system/docker.service
, but this ticket is not about that, I just want to point it out so that you don't run into this, as I did. Always check the failure cause withjournalctl -fu docker.service
before concluding that your config is wrong.)Now that you have successfully started Docker with the runtime being defined but without being set as the default, confirm that the runtime is actually operable when set explicitly during use:
Check for the available
docker-ce
versions:Try downgrading to 20.10.0:
And observe that the Docker service still fails to start with the default runtime set. Downgrade to 19.03.14:
And observe that the Docker service starts successfully even with the default runtime set. Now your container output will behave as it should:
Instead of the previous
bash: nvidia-smi: command not found
error.We've been using this configuration for over a year now. To me it seems like a regression in Docker CE 20. Please advise.
Output of
docker version
:Output of
docker info
:Additional environment details (AWS, VirtualBox, physical, etc.)
Ubuntu 18.04.5 with all updates installed, on several physical computers.
The text was updated successfully, but these errors were encountered: