smp: prefaulter: don't leave zombie worker threads #2679

avikivity · 2025-03-06T11:24:27Z

As explained in #2623 in detail, the prefaulter worker threads that have completed but not joined are left in a zombie state, which confuses gdb thread_local processing. As seastar relies on thread locals heavily, it becomes impossible to debug core dumps.

Fix this by joining the threads after they complete. Use seastar::alien to ask the main reactor threads to join the completed threads when they are done, so it won't stall.

Fixes #2623.

As explained in scylladb#2623 in detail, the prefaulter worker threads that have completed but not joined are left in a zombie state, which confuses gdb thread_local processing. As seastar relies on thread locals heavily, it becomes impossible to debug core dumps. Fix this by joining the threads after they complete. Use seastar::alien to ask the main reactor threads to join the completed threads when they are done, so it won't stall. Fixes scylladb#2623.

michoecho · 2025-03-06T14:49:27Z

src/core/smp.cc

            work(ranges, page_size, huge_page_size_opt);
+            if (!--_active_threads) {
+                run_on(alien, 0, [this] () noexcept { join_threads(); });


Is there no risk that this join_threads() call will happen after the memory_prefaulter is already destroyed?

the prefaulter is nested under smp, and so are the reactors. So if the reactor is still running, the prefaulter still exists.

Yes, but's it's not nice to have implicit lifetime assumptions like that.
Not that this case is worth caring about, though.

In such cases it is unavoidable without strong compiler support. The setup is too hairy.

michoecho

LGTM.

michoecho · 2025-03-06T19:12:46Z

src/core/smp.cc

            work(ranges, page_size, huge_page_size_opt);
+            if (!--_active_threads) {
+                run_on(alien, 0, [this] () noexcept { join_threads(); });


Yes, but's it's not nice to have implicit lifetime assumptions like that.
Not that this case is worth caring about, though.

michoecho reviewed Mar 6, 2025

View reviewed changes

michoecho approved these changes Mar 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

smp: prefaulter: don't leave zombie worker threads #2679

smp: prefaulter: don't leave zombie worker threads #2679

avikivity commented Mar 6, 2025

michoecho Mar 6, 2025

avikivity Mar 6, 2025

michoecho Mar 6, 2025

avikivity Mar 6, 2025

michoecho left a comment

michoecho Mar 6, 2025

smp: prefaulter: don't leave zombie worker threads #2679

Are you sure you want to change the base?

smp: prefaulter: don't leave zombie worker threads #2679

Conversation

avikivity commented Mar 6, 2025

michoecho Mar 6, 2025

Choose a reason for hiding this comment

avikivity Mar 6, 2025

Choose a reason for hiding this comment

michoecho Mar 6, 2025

Choose a reason for hiding this comment

avikivity Mar 6, 2025

Choose a reason for hiding this comment

michoecho left a comment

Choose a reason for hiding this comment

michoecho Mar 6, 2025

Choose a reason for hiding this comment