Don't loop forever on errors writing installinator slots #9471

labbott · 2025-12-04T15:27:18Z

Currently if there's a permanent error while writing to the M.2 drives we may loop/retry forever. This isn't great behavior so attempt to break if it looks like we aren't making progress writing.

labbott · 2025-12-04T15:31:02Z

I found this while working on something else, I had a bad check for a create_dir error and the unit test looped forever. I tried to turn this into a smaller unit test but that was difficult to do with the current setup and the one I tried hit a different infinite loop in installinator.

jgallagher · 2025-12-04T19:57:09Z

installinator/src/write.rs

-            if success_this_iter == self.drives.len() || success_prev_iter > 0 {
+            // 3. We had the same number of successes as the previous iteration,
+            //    which implies that we seem to be permanetly stuck and unlikely
+            //    to succeed


Hmm, this is a pretty significant behavior change to the real installinator, right? I think the intent was to loop forever if we're not having success, because there's no way to restart this on failure other than aborting the entire mupdate and starting over.

That's pretty terrible for tests, though. I wonder if we should have a cap on the number of attempts (which this code implicitly has, I think, with the cap set to "2"), and allow prod to pass a cap of "loop forever" while tests can pass something much smaller?

yeah we're getting into "timeouts timeouts always wrong" territory here. I don't actually mind looping forever for tests because that is a sign I need to fix something because tests should finish. Looping forever in production actually seems worse though if there's some kind of permanent error we'll just be stuck which seems bad. Maybe wicket will give more information than tests though? If we're actually okay with the current behavior I can also just close this.

Currently if there's a permanent error while writing to the M.2 drives we may loop/retry forever. This isn't great behavior so attempt to break if it looks like we aren't making progress writing.

labbott force-pushed the installinator_stop_loop branch from 69d14eb to 410c0ae Compare December 4, 2025 15:50

labbott requested review from jgallagher and sunshowers December 4, 2025 18:24

jgallagher reviewed Dec 4, 2025

View reviewed changes

Don't loop forever on errors writing installinator slots

410c0ae

Currently if there's a permanent error while writing to the M.2 drives we may loop/retry forever. This isn't great behavior so attempt to break if it looks like we aren't making progress writing.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Don't loop forever on errors writing installinator slots #9471

Don't loop forever on errors writing installinator slots #9471

Uh oh!

labbott commented Dec 4, 2025

Uh oh!

labbott commented Dec 4, 2025

Uh oh!

jgallagher Dec 4, 2025

Uh oh!

labbott Dec 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Don't loop forever on errors writing installinator slots #9471

Are you sure you want to change the base?

Don't loop forever on errors writing installinator slots #9471

Uh oh!

Conversation

labbott commented Dec 4, 2025

Uh oh!

labbott commented Dec 4, 2025

Uh oh!

jgallagher Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

labbott Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants