sync: bounded mpsc memory use is proportional to max waiters #2637

hawkw · 2020-06-30T20:53:02Z

Version

tokio v0.2.21

Platform

all

Description

The memory use of the bounded MPSC channel is proportional to the maximum number of tasks that have concurrently waited on the channel. This isn't technically a "memory leak" as reported by tools like valgrind, as the memory is freed when the channel is dropped. However, in systems where channels are long-lived, it may as well be a memory leak --- especially if the workload is bursty and the number of waiting tasks occasionally spikes.

tokio::sync's locking primitives, like Mutex, do not exhibit the same behavior. Instead, the memory use of a Mutex remains constant regardless of how many tasks are waiting for it. This is because the locks use the new intrusive semaphore implementation added in #2325. The intrusive semaphore does not require additional memory allocations per waiting task, while the old semaphore implementation (which uses a non-intrusive linked list) does. In addition, cancelled waiters in the new semaphore are unlinked from the list when the waiting task terminates, while the old implementation only removes cancelled tasks from the wait list when it notifies a waiter.

I wrote a quick demo of this that compares the mpsc and Mutex:
https://github.com/hawkw/tokio-buffer-repro

It spawns a bunch of tasks that all attempt to wait on Sender::send/Mutex::lock when the mpsc is full or the Mutex is already locked. The tasks cancel themselves after a period of time. During this, the repro prints the process' current memory use.

At the end of the test, before the Mutex/mpsc are dropped, we see the following:

mpsc:  waiters:      0; RSS:  36780 kb; virt: 6768197632 kb;
mutex: waiters:      0; RSS:   4844 kb; virt: 6676283392 kb;

We should update the MPSC channel to use the new semaphore implementation.

The text was updated successfully, but these errors were encountered:

hawkw · 2020-06-30T20:54:09Z

As a side note: we have also observed occasional latency spikes after switching some code in Linkerd that used Mutex to use a mpsc channel instead. However, we don't have as tight a repro for that yet, so I'm going to work on this and hope it also improves that problem.

zaharidichev · 2020-09-22T12:18:12Z

@hawkw I am interested in working on that if it is ok. I had a question though.. It seems to me that switching to the new semaphore would necessitate the removal of the poll_ready, and disarm functions on the bounded sender. Or am I missing something. And if that is the case, is it fine to do so? I mean isn't having an async fn send that can fail if the buffer is full enough? Or are there more fundamental reasons for splitting the workflow into slot reservation/release steps and exposing these methods?

carllerche · 2020-09-22T15:13:41Z

Ah, I have this work in progress. It is close to completion.I hope to have a PR up today. I should have posted that, sorry.

Updates the mpsc channel to use the intrusive waker based sempahore. This enables using `Sender` with `&self`. Instead of using `Sender::poll_ready` to ensure capacity and updating the `Sender` state, `async fn Sender::reserve()` is added. This function returns a `Permit` value representing the reserved capacity. Fixes: #2637 Refs: #2718 (intrusive waiters)

@LucioFranco

This branch updates Tower to Tokio 0.3. Unlike #474, this branch uses Tokio 0.3's synchronization primitives, rather than continuing to depend on Tokio 0.2. I think that we ought to try to use Tokio 0.3's channels whenever feasible, because the 0.2 channels have pathological memory usage patterns in some cases (see tokio-rs/tokio#2637). @LucioFranco let me know what you think of the approach used here and we can compare notes! For the most part, this was a pretty mechanical change: updating versions in Cargo.toml, tracking feature flag changes, renaming `tokio::time::delay` to `sleep`, and so on. Tokio's channel receivers also lost their `poll_recv` methods, but we can easily replicate that by enabling the `"stream"` feature and using `poll_next` instead. The one actually significant change is that `tokio::sync::mpsc::Sender` lost its `poll_ready` method, which impacts the way `tower::buffer` is implemeted. When the buffer's channel is full, we want to exert backpressure in `poll_ready`, so that callers such as load balancers could choose to call another service rather than waiting for buffer capacity. Previously, we did this by calling `poll_ready` on the underlying channel sender. Unfortunately, this can't be done easily using Tokio 0.3's bounded MPSC channel, because it no longer exposes a polling-based interface, only an `async fn ready`, which borrows the sender. Therefore, we implement our own bounded MPSC on top of the unbounded channel, using a semaphore to limit how many items are in the channel. I factored out the code for polling a semaphore acquire future from `limit::concurrency` into its own module, and reused it in `Buffer`. Additionally, the buffer tests needed to be updated, because they currently don't actually poll the buffer service before calling it. This violates the `Service` contract, and the new code actually fails as a result. Closes #473 Closes #474 Co-authored-by: Lucio Franco <luciofranco14@gmail.com> Signed-off-by: Eliza Weisman <eliza@buoyant.io>

hawkw added C-bug Category: This is a bug. A-tokio Area: The main tokio crate M-sync Module: tokio/sync labels Jun 30, 2020

hawkw self-assigned this Jun 30, 2020

carllerche assigned carllerche and unassigned hawkw Sep 22, 2020

carllerche mentioned this issue Sep 22, 2020

sync: support mpsc send with &self #2861

Merged

carllerche closed this as completed in #2861 Sep 25, 2020

patrickfreed mentioned this issue Oct 6, 2020

RUST-565 Properly report connection closures due to error mongodb/mongo-rust-driver#258

Merged

hawkw mentioned this issue Oct 26, 2020

update to Tokio 0.3 tower-rs/tower#476

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sync: bounded mpsc memory use is proportional to max waiters #2637

sync: bounded mpsc memory use is proportional to max waiters #2637

hawkw commented Jun 30, 2020

hawkw commented Jun 30, 2020

zaharidichev commented Sep 22, 2020

carllerche commented Sep 22, 2020

sync: bounded mpsc memory use is proportional to max waiters #2637

sync: bounded mpsc memory use is proportional to max waiters #2637

Comments

hawkw commented Jun 30, 2020

hawkw commented Jun 30, 2020

zaharidichev commented Sep 22, 2020

carllerche commented Sep 22, 2020