scx_layered: Implement empty LLC draining #1092

htejun · 2024-12-10T22:46:24Z

A layer is served by per-LLC DSQs. Only tasks that can be serviced by all
CPUs in an LLC are put in the DSQ, so as long as a CPU is assigned to the
layer in the LLC, forward progress is guaranteed. However, if a layer-LLC
loses all its CPUs, there is no forward progress guarantee ignoring the
antistall mechanism. For a confined layer, no CPU will be visiting the empty
LLCs and even for a grouped or open layer, execution from an LLC without
CPUs assigned is lower priority than owned execution and can easily starve.

To resolve the problem, implement LLC draining mechanism. When layer-LLC
loses all CPUs with tasks in it, draining is turned on and other CPUs
assigned to the layer will alternate between their own execution and
draining LLCs without any CPU. The interlockings between the involved code
paths - refresh_cpumasks(), layered_enqueue() and layered_dispatch() - are
rather intricate to guarantee that no tasks end up sitting in a CPU-less
LLC. See comments for details.

Use < instead of <= when comparing against xllc_mig_min_ns so that 0 can disable it completely.

This is trivial to count from BPF but we'll also add per-LLC counts, so let's just do whatever we can do from userspace in userspace.

A layer is served by per-LLC DSQs. Only tasks that can be serviced by all CPUs in an LLC are put in the DSQ, so as long as a CPU is assigned to the layer in the LLC, forward progress is guaranteed. However, if a layer-LLC loses all its CPUs, there is no forward progress guarantee ignoring the antistall mechanism. For a confined layer, no CPU will be visiting the empty LLCs and even for a grouped or open layer, execution from an LLC without CPUs assigned is lower priority than owned execution and can easily starve. To resolve the problem, implement LLC draining mechanism. When layer-LLC loses all CPUs with tasks in it, draining is turned on and other CPUs assigned to the layer will alternate between their own execution and draining LLCs without any CPU. The interlockings between the involved code paths - refresh_cpumasks(), layered_enqueue() and layered_dispatch() - are rather intricate to guarantee that no tasks end up sitting in a CPU-less LLC. See comments for details.

htejun added 4 commits December 9, 2024 12:10

scx_layered: Allow xllc_mig_min_us to be disabled completely

5c38d47

Use < instead of <= when comparing against xllc_mig_min_ns so that 0 can disable it completely.

scx_layered: Set layer.nr_cpus from userspace

d0a1002

This is trivial to count from BPF but we'll also add per-LLC counts, so let's just do whatever we can do from userspace in userspace.

scx_layered: Track per-layer-LLC nr_cpus and report them

6e42f5b

htejun requested review from hodgesds, JakeHillion, etsal and likewhatevs December 10, 2024 22:46

etsal approved these changes Dec 10, 2024

View reviewed changes

htejun added this pull request to the merge queue Dec 11, 2024

Merged via the queue into main with commit 1946c01 Dec 11, 2024
46 checks passed

htejun deleted the htejun/layered-updates branch December 11, 2024 01:29

htejun restored the htejun/layered-updates branch December 11, 2024 01:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scx_layered: Implement empty LLC draining #1092

scx_layered: Implement empty LLC draining #1092

htejun commented Dec 10, 2024

scx_layered: Implement empty LLC draining #1092

scx_layered: Implement empty LLC draining #1092

Conversation

htejun commented Dec 10, 2024