rt: don't reserve a core for the admin runtime admin #613

hawkw · 2020-07-31T16:19:47Z

I don't think we should dedicate an entire core to the admin thread...
That's a lot of resources for something we shouldn't be doing much
work on. The real goal of the separate admin runtime is to ensure we
can still interact with the proxy if, for instance, the main runtime
isn't healthy. I think we're best off keeping it simple and treating
it as if the admin thread needs no dedicated resources.

The admin runtime only performs observability work & certifies
identitities, so its load should be pretty light, and it doesn't need to
be actively using a CPU core all the time. The purpose of having a
separate admin runtime is to continue allowing access to observability
functions even if the rest of the proxy gets into a bad state, not to
take load off the main forwarding runtime; the admin thread no longer
performs service discovery lookups (and I think it hasn't for a while).

This branch changes the rt::build function in linkerd2-proxy so
that, when the multithreaded Tokio runtime feature is enabled, we give
the main forwarding runtime a thread for every available CPU core,
rather than every core minus one. Furthermore, if the feature flag is
enabled, we will now use the multithreaded Tokio scheduler if the
number of available CPUs is two or more, rather than three or more,
since we are no longer reserving a thread for the admin runtime.

In order to ensure that the admin runtime is out of the forwarding
path (so that multiple worker threads don't result in a corresponding
increase in admin load), I've moved the DNS resolver background task to
the main runtime. As a potential follow-up, we might want to simplify
the DNS code so that we just spawn resolution tasks directly, rather
than having a DNS daemon task. trust-dns no longer requires a
background task, so its purpose was just to ensure that DNS resolutions
were spawned on the admin thread.

Signed-off-by: Eliza Weisman <eliza@buoyant.io>

olix0r · 2020-07-31T16:24:28Z

I've got most of this in #612

hawkw · 2020-07-31T16:25:42Z

I've got most of this in #612

whoops, missed that! we should probably still move the DNS task, i can just cherry-pick that part out if you like?

olix0r · 2020-07-31T21:05:51Z

@hawkw yeah do you want to change this PR to move the tasks onto the main thread? I think we should consider moving the destination & identity clients and the tap server onto the main runtime, too, leaving the admin thread to only be responsible for the admin server. Up to you whether these make sense as separate PRs etc.

hawkw · 2020-07-31T21:58:14Z

Sounds good, I'll move the DNS and Destination clients next; still on the fence about where tap and identity belong. In particular, I think it might be worth being able to tap a proxy even if it's gone into a stuck state?

hawkw added 3 commits July 31, 2020 09:03

rt: give the main runtime all available cores

ce56d59

Signed-off-by: Eliza Weisman <eliza@buoyant.io>

admin: make thread name consistent with main rt workers

0fb6aab

Signed-off-by: Eliza Weisman <eliza@buoyant.io>

rt: spawn DNS task on main rt

6984d59

Signed-off-by: Eliza Weisman <eliza@buoyant.io>

hawkw requested review from olix0r and a team July 31, 2020 16:19

hawkw closed this Jul 31, 2020

olix0r deleted the eliza/no-admin-core branch May 25, 2021 15:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rt: don't reserve a core for the admin runtime admin #613

rt: don't reserve a core for the admin runtime admin #613

hawkw commented Jul 31, 2020

olix0r commented Jul 31, 2020

hawkw commented Jul 31, 2020

olix0r commented Jul 31, 2020

hawkw commented Jul 31, 2020

rt: don't reserve a core for the admin runtime admin #613

rt: don't reserve a core for the admin runtime admin #613

Conversation

hawkw commented Jul 31, 2020

olix0r commented Jul 31, 2020

hawkw commented Jul 31, 2020

olix0r commented Jul 31, 2020

hawkw commented Jul 31, 2020