-
Notifications
You must be signed in to change notification settings - Fork 86
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Restarting containerd leaves dockerd stopped #1155
Comments
30+ reports in my reddit thread and friends are talking about it too... |
Isn’t this how systemd unit dependencies work? A depends on B. You stop B, A is also stopped. If you start B, A does not start again automatically. |
I don't know but it shouldn't nuke docker in production wtf. This is a Lts image |
Were these all unattended updates, or also manual updates? |
My suspicion is the unattended updates noticed the issue since they run often enough to get only an update to containerd, and when it fails it's overnight leading to a long outage or page. |
I tried updating containerd manually on a machine, and docker was restarted successfully; I recall we received reports in the past where unattended upgrades didn't always result in the same, so was curious if that was the difference here |
This also happened to me via unattended upgrades on Ubuntu. These two lines make me think the upgrade somehow killed Docker.
|
I'm also failing to recreate this in the lab with manual upgrades and downgrades of containerd. @Sloth-on-meth @chdsbd can you provide your own output of |
@sudo-bmitch sorry, i updated all my docker installs to the docker repo and uninstalled the Ubuntu repo versions after this happened, someone on reddit said that might prevent this in the future |
Faced the same problem today and back in mid-October. Caused a massive outage back in October but less so today (only because of separate HA mitigations I put in place). Had this issue on AWS EC2 instances running Ubuntu 20.04. |
@Sloth-on-meth that confirms my suspicion that this could be a Ubuntu issue rather than a Docker one. I'll leave this issue open for a bit in case others have logs showing it happened with the Docker packages. Otherwise, anyone with the issue should follow up with Canonical. Searching their bug tracker I'm seeing the following: https://bugs.launchpad.net/ubuntu/+source/containerd/+bug/1870514 |
@vincent-heatseekr see the debugging details requested above, and verify whether you are installing the docker packages from the ubuntu or the docker repositories. |
@sudo-bmitch I woke up this morning, on my Ubuntu machine, docker daemon was down. It was installed from Ubuntu repositories. On my Amazon Linux EC2, no issues. Think you may be on to something. |
@sudo-bmitch the same problem. Output attached. |
We experienced the same issue in a corporate environment with production services. None of our docker machines that got updated unattended survived the restart. |
@glader thanks. That shows docker installed from the Ubuntu repositories, rather than the upstream Docker ones. Seems consistent with other reports so I'm going to close the issue here since there's nothing for Docker to fix, and they've already provided a solution for users experiencing this issue. |
For anyone interested, the issue is now on the launchpad tracker at CRITICAL priority. https://bugs.launchpad.net/ubuntu/+source/containerd/+bug/1870514 |
/cc @chris-crone @glours @RomainBelorgey FYI: this was the issue we were discussing Yesterday |
I see https://bugs.launchpad.net/ubuntu/+source/containerd/+bug/1870514 also mentions docker/docker-ce-packaging#508 (I was eyeing that fix as well as possibly related); however
I opened a backport for the 19.03 branch to get the |
Let me ask here in case someone knows; is there a way to manually trigger unattended updates? (also; are they only performed for distro-packages or also third-party package repositories)? If course would be great if we could verify the behaviour of unattended upgrades, but, well, them being "unattended" makes it slightly difficult 😅 |
@thaJeztah I ran a
So I believe the update must be a security update from the Ubuntu repos to trigger the unattended update. This was with containerd back versioned, but installed from the Docker repos, so the update was available, just not from Ubuntu:
|
Thank you, @sudo-bmitch ! That's useful information, so looks indeed that these only are triggered for packages from those repositories. That makes docker/docker-ce-packaging#511 "good to have", but in practice, users wouldn't run into the same problem if they installed the official docker packages from download.docker.com (not the distro-maintained packages of containerd/docker) |
Expected behavior
After a containerd upgrade, the docker daemon should be running.
Actual behavior
If you restart containerd, dockerd remains down.
Steps to reproduce the behavior
Output of
docker version
:Output of
docker info
:Additional environment details (AWS, VirtualBox, physical, etc.)
Above is from a small test VM. I'm posting this after seeing:
The cause from the above I suspect to be an update to containerd without a corresponding update to the docker daemon that would restart both services.
The text was updated successfully, but these errors were encountered: