-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
testing: failed to start rpm-ostree daemon after auto-update to 31.20191127.1 #322
Comments
Ouch. Hmm OK, I think I see what's going on here. In coreos/fedora-coreos-config#155, we switched from using static boot mount units to creating it from a generator. Which is fine, except that starting from coreos/fedora-coreos-config#219, we made those boot mounts now be conditional on So this probably bricked all the nodes that fall in that bucket. 😢 This is why #228 is crucial. That said, I'm not sure if there's much we can do here. We'll probably just have to send a notice that updates broke and users that provisioned from versions older than X (I think 30.20191014.0) will have to reprovision? |
We fully rolled out `31.20191127.1` on `testing`, but it looks like auto-updating from some old releases might not be safe. Temporarily hiding this rollout while we investigate coreos/fedora-coreos-tracker#322.
We fully rolled out `31.20191127.1` on `testing`, but it looks like auto-updating from some old releases might not be safe. Temporarily hiding this rollout while we investigate coreos/fedora-coreos-tracker#322.
great.. I tested from |
Just look at `/etc/fstab` for `/boot` mounts to determine whether to generate them ourselves or let the `systemd-fstab-generator` do it. The `grep` stuff here is a bit gory but it should work for our purposes. The next step up would be actually parsing it using the appropriate APIs in a compiled language. See also: coreos/fedora-coreos-tracker#322
Just look at `/etc/fstab` for `/boot` mounts to determine whether to generate them ourselves or let the `systemd-fstab-generator` do it. See also: coreos/fedora-coreos-tracker#322
Just look at `/etc/fstab` for `/boot` mounts to determine whether to generate them ourselves or let the `systemd-fstab-generator` do it. See also: coreos/fedora-coreos-tracker#322
Just look at `/etc/fstab` for `/boot` mounts to determine whether to generate them ourselves or let the `systemd-fstab-generator` do it. See also: coreos/fedora-coreos-tracker#322
If anyone hits this, the workaround is just:
Then upgrade and reboot. |
"bricked" is a strong word here - yes, this didn't meet the quality bar I set for myself and our team, but there's a one liner workaround. I'd say this broke ability to receive new updates by default - "bricked" I think is more when the system cannot be recovered even manually. |
Agreed. To be clear, I think it's inevitable that we're going to hit these kinds of really subtle issues. There's a lot of "undeclared" dependencies between cosa, FCOS, and the pipeline. The exercise in having to "rewind the clock" to do another f30 release also demonstrated this. In this particular instance, I don't think the problem was we didn't think hard enough. It was (and still is) that we have no coverage for upgrade testing. |
Just look at `/etc/fstab` for `/boot` mounts to determine whether to generate them ourselves or let the `systemd-fstab-generator` do it. See also: coreos/fedora-coreos-tracker#322
coreos/fedora-coreos-config#247 is merged now. Was chatting with @lucab; instead of doing another async release, let's just pull forward the next release a bit and roll it in -- so e.g. early next week? |
thanks for posting this and the work around. It fixed my box. cheers |
This is obvious because |
We released Nodes that are stuck require manually intervention to proceed with upgrades.
|
My canary node updated from
30.20191014.1
to31.20191127.1
and properly rebooted into it; however it does not seem to be very healthy.rpm-ostree daemon is not available anymore, as it fails to start with:
Indeed,
/boot
is empty:For some unknown reason, it looks like this machine ended up without a
boot.mount
unit:Additionally, any node that ends up in this situation will need manual intervention. Zincati cannot auto-update out of it, as rpm-ostreed is not available.
This is a node on platform
aws
which was born on version30.20191002.0
, and up to this version it was updating without issues.The text was updated successfully, but these errors were encountered: