Graceful controller shutdown #573

nightkr · 2021-06-30T00:06:49Z

Fixes #552

nightkr · 2021-06-30T00:07:22Z

CHANGELOG.md

+  - BREAKING: `controller::applier` now starts a graceful shutdown when the `queue` terminates
+  - BREAKING: `scheduler` now shuts down immediately when `requests` terminates, rather than waiting for the pending reconciliations to drain


I suspect these might have some impact on people's tests

clux

have approved because it's ultimately clean, and ready to use in the current form.

left some comments here and there, nothing block-worthy. but maybe there is a better default-path that we can take with letting users auto-install a sigterm handler.

clux · 2021-06-30T19:09:51Z

examples/configmapgen_controller.rs

+        Controller::new(cmgs, ListParams::default())
+            .owns(cms, ListParams::default())
+            .reconcile_all_on(reload_rx.map(|_| ()))
+            .graceful_shutdown_on(graceful_shutdown_rx.map(|_| ()))


while I think it's great that we are able to expose this, it's also a bit boilerplatey for the standard use case. if we are targetting a kubernetes deployed controller then the shutdown signal is always going to be SIGTERM.

i think the the method here makes complete sense for configurability, but maybe we could also have a Controller::install_sigterm_handler() method that sets this up the same thing under the hood using tokio::signal inside the building step of the Controller?

How about shutdown_on_sigint or shutdown_on_ctrl_c?

i feel ctrl_c and sigint naming is strange if we are putting it inside kubernetes which sends SIGTERM.
but it looks like we need to use SIGINT locally?

Yeah, in-cluster (or when running in systemd or similar) we want SIGTERM, when running from cargo we want SIGINT (or whatever the Windows equivalent is). We should be safe if we just treat the two as equivalent.

yeah, if we can get tokio::signal to listen on both, then that would be ideal.

How about Controller::manage_termination_signals() and make it trigger on either SIGINT or SIGTERM?

Interestingly, actix-web has an opt-out for this scenario. That might also be somewhere to go to, but that's probably too early.

IMO installing signal handlers without asking for permission is a bit presumptuous if we aren't sure that we own the whole process. If we had a #[kube_runtime::main] or similar I'd be for that doing it, but not in the current state.

Added a helper now for managing the signal handling for you.

examples/configmapgen_controller.rs

clux · 2021-06-30T19:11:49Z

examples/configmapgen_controller.rs

+                }
+            })
+            .boxed(),
+        forceful_shutdown.boxed(),


although you do seem to need a select on the controller future vs. the shutdown future. That feels a bit subtle. Do you not want the controller to complete its outstanding items?

That's the difference. The first ctrl+c initiates the graceful shutdown (by resolving graceful_shutdown_rx), the second means that we just want the process to die die die ASAP.

Do we need to codify the latter into the app? Kubernetes will give us SIGTERM, wait 30s, then send a SIGKILL. We're don't really get to do anything after the SIGKILL, so it might not be worth trying to handle it.

When running in-cluster, you're correct (though that actually raises the problem that iirc the example only listens for SIGINT, not SIGTERM).

When running locally, well, it's annoying to have to switch tab and pkill.

right, we don't have a way of propagating a forceful termination within the applier. got it. so for the example we need the full setup.

right, we don't have a way of propagating a forceful termination within the applier. got it. so for the example we need the full setup.

Yeah, the whole point of the forceful termination would be to bypass everything and burn it to the ground.

wait, actually, do we need the forceful_shutdown future? the async block that creates it is doing process::exit on the second ctrl-c? wouldn't that just stop everything?

We need something that waits for waits for the second signal, but doesn't trigger for the first. That means that we need to keep them in the same "path", so to speak.

The std::process::exit is also not strictly necessary, there's a difference between forceful and forceful. Essentially, you could say that we have six potential "levels" of grace that we could potentially implement:

The Kronblom shutdown: when we initiate a shutdown, stop taking new scheduling requests, but let all currently scheduled reconciliations run and finish

This is what scheduler currently implements in master (before this PR), but due to applier's circular nature this isn't actually usable in applier anyway (since it doesn't have the cutoff that this PR implements)

Depending on whether we still allow retries to be scheduled, this may never terminate

The slightly overcooked shutdown: like the above, but only let currently pending reconciliations run (that is, they have already expired, but haven't started yet for whatever reason) while dropping reconciliations that are scheduled into the future

The graceful shutdown: wait for all running reconciliations to finish, but do not start any new ones

This is what this PR calls a "graceful" shutdown

The forceful shutdown: abort all currently running reconciliations, but wait for them to cancel orderly (essentially: wait for them to hit the next .await)

The Brütal shutdown: std::process::exit

This is what the example calls a "forceful" shutdown

The Spın̈al shutdown: you didn't need that computer anyway, did you?

From this list, this PR adds support for the graceful shutdown, while the forceful, Brütal, and Spın̈al shutdowns were already supported (kind of unavoidably :P) but undocumented. The example currently uses a Brütal shutdown (which was mostly a vestige from tokio::io::stdin using an uncancellable background worker task), but could be downgraded to a forceful shutdown.

The Kronblom and overcooked shutdowns would (IMO) mostly be useful for testing runtime internals, and this PR replaces those cases with sleeps (which are collapsed by tokio's testing mode anyway).

Hah. That's a solid classification 🤘

Yeah, that sounds sensible. I think the graceful and the brutal ones are likely the most useful ones for us (maybe forceful as well, and as you say, overcooked ones for testing).

Everything in the PR so far looks sensible to me. But I'm still a bit unsure about the main example here:

If we are currently in a brutal scenario, what good would does the last select! in main do? If you removed the process::exit, and instead defer to the forceful_shutdown which i assume is intended to trigger at the end of the async double-ctrl-c wait scope, then that's just immediately triggered in that last select! instead, right? i don't think it would functionally cause any different behaviour to avoid process::exit. Or am i misunderstanding.

i don't think it would functionally cause any different behaviour to avoid process::exit

Controller tries to abort all reconciliations when dropped, and #[tokio::main] waits for all spawned tasks to finish before exiting the process after the main function returns.

This combines to give you a forceful shutdown, rather than brutal (according to the previous chart :P).

we are currently in a brutal scenario, what good would does the last select! in main do?

Regardless of whether we call std::process::exit in it, something needs to poll it for it to actually do anything. And we can't just spawn it, since that would keep the graceful shutdown waiting for it.

Ahhh. It's because futures does nothing unless polled. Bahhh. Sorry, I was being thick.

Aren't we all?

kube-runtime/src/controller/mod.rs

kube-runtime/src/scheduler.rs

kube-runtime/src/utils.rs

kube-runtime/src/watcher.rs

kazk

Looks great!

clux · 2021-07-01T19:50:56Z

Looks great. Thanks so much. Great default path available and a custom path that's well documented. Awesome PR!

Tiny nit: info messages from kube-runtime might not be super popular. I would personally downgrade those to debug. But feel free to merge at your leisure.

…ntroller-shutdown

nightkr · 2021-07-02T03:04:43Z

The new info messages are intended towards operators, and developers don't really have a good way to hook in there atm. That said, anyone who does want to silence them can just set their a kube-runtime-specific logging level.

clux · 2021-07-02T03:46:03Z

Ok. Let's leave it as is 👍

clux · 2021-07-05T09:19:57Z

released in 0.58 :-)

Graceful controller shutdown

02c44ed

Fixes kube-rs#552

nightkr added the runtime controller runtime related label Jun 30, 2021

nightkr requested review from clux and kazk June 30, 2021 00:06

nightkr commented Jun 30, 2021

View reviewed changes

clux approved these changes Jun 30, 2021

View reviewed changes

kazk approved these changes Jun 30, 2021

View reviewed changes

nightkr added 2 commits July 1, 2021 02:33

Don't panic because you can't reschedule while shutting down

b996038

Add helper for installing shutdown signal handlers

cd00e35

nightkr requested a review from clux July 1, 2021 17:08

Merge remote-tracking branch 'origin/master' into feature/graceful-co…

b884476

…ntroller-shutdown

nightkr merged commit c84110f into kube-rs:master Jul 2, 2021

nightkr deleted the feature/graceful-controller-shutdown branch July 2, 2021 03:47

clux mentioned this pull request Oct 22, 2021

SIGTERM handling in kube-runtime #275

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Graceful controller shutdown #573

Graceful controller shutdown #573

nightkr commented Jun 30, 2021

nightkr Jun 30, 2021

clux left a comment

clux Jun 30, 2021

nightkr Jun 30, 2021

clux Jun 30, 2021

nightkr Jun 30, 2021

clux Jun 30, 2021

clux Jul 1, 2021

nightkr Jul 1, 2021 •

edited

Loading

nightkr Jul 1, 2021

clux Jun 30, 2021

nightkr Jun 30, 2021

clux Jun 30, 2021

nightkr Jun 30, 2021

clux Jun 30, 2021

nightkr Jul 1, 2021

clux Jul 1, 2021

nightkr Jul 1, 2021

clux Jul 1, 2021

nightkr Jul 2, 2021

kazk left a comment

clux commented Jul 1, 2021

nightkr commented Jul 2, 2021

clux commented Jul 2, 2021

clux commented Jul 5, 2021

		- BREAKING: `controller::applier` now starts a graceful shutdown when the `queue` terminates
		- BREAKING: `scheduler` now shuts down immediately when `requests` terminates, rather than waiting for the pending reconciliations to drain

Graceful controller shutdown #573

Graceful controller shutdown #573

Conversation

nightkr commented Jun 30, 2021

Choose a reason for hiding this comment

clux left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nightkr Jul 1, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kazk left a comment

Choose a reason for hiding this comment

clux commented Jul 1, 2021

nightkr commented Jul 2, 2021

clux commented Jul 2, 2021

clux commented Jul 5, 2021

nightkr Jul 1, 2021 •

edited

Loading