Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sync: provide API for notifying multiple waiters. #3066

Closed
2 tasks
carllerche opened this issue Oct 28, 2020 · 8 comments · Fixed by #3098
Closed
2 tasks

sync: provide API for notifying multiple waiters. #3066

carllerche opened this issue Oct 28, 2020 · 8 comments · Fixed by #3098
Assignees
Labels
A-tokio Area: The main tokio crate C-feature-request Category: A feature request. M-sync Module: tokio/sync

Comments

@carllerche
Copy link
Member

There currently is no async equivalent to the synchronous CondVar::notify_all. Roughly, Notify provides similar functionality, but only has a notify_one(). Notify currently has a notify_all() method, but it is crate private. This API has not been made public due to API questions to work through.

First, Notify currently uses an approach similar to park / unpark. When notify_one() is called and there are no waiters, it stores a pending notification. The next time a task waits on Notify, the task consumes the pending notification and the wait completes. This behavior may result in spurious wakeups but is designed to be a lightweight mechamism to avoid race condition without requiring locking in many cases.

However, the park / unpark behavior does not translate to a "notify many" operation. Implementing a useful "notify many" operation is a bit tricky. Consider std::sync::CondVar. In order to wait on a CondVar, the caller must first acquire a mutex. The mutex, paired with CondVar is how races are handled. The downside of using a Mutex is that it is fairly coarse. Usually, with regards to mutual exclusion, in "broadcast" style synchronization, the caller wishes to ensure mutual exclusion between producers and consumers and not with other consumers. Additionally, using an "async" mutex for this doesn't make a ton of sense either as the critical section to guard is usually small and synchronous. It usually makes more sense to use a std Mutex to ensure mutual exclusion, but we need an async-aware notification strategy. See sync::Watch as an example of this.

The private notify_waiters() API handles the race by requiring the consumer to acquire the notification future before checking the state. If the state is as expected and the consumer drops the notification future, otherwise, the consumer waits on the notification future. Unfortunatley, it isn't that simple. In order to register for the notification, the consumer must also poll the notification future. This requires pinning and results in a poor API. I believe that this problem can be solved by storing a counter internally tracking the number of calls to notify_*. When the notification future is created, it loads the current count. When the future is awaited, if the current count is greater than the one loaded on creation, the future completes immediately.

Additionally, notify_waiters() and notify_one() are fairly different behaviors. Conflating them in a single notification primitive may not make sense. I am not aware of any use cases that require mixing notify_one() and notify_waiters(). It may make more sense to provide a separate type for doing broadcast style notifications.

Actions

  • determine if the counter strategy can resolve the poor notify_waiters() API.
  • decide if notify_waiters() should be included on Notify, or a separate type should be created.

Refs: #2739, #3061

@carllerche carllerche added A-tokio Area: The main tokio crate C-feature-request Category: A feature request. M-sync Module: tokio/sync labels Oct 28, 2020
@carllerche carllerche added this to the v1.0 milestone Oct 28, 2020
@kaimast
Copy link

kaimast commented Oct 28, 2020

Awesome! Maybe eventually extend the text in the documentation for tokio::sync as well explaining how to port existing code using CondVars.

@hawkw
Copy link
Member

hawkw commented Oct 28, 2020

Additionally, notify_waiters() and notify_one() are fairly different behaviors. Conflating them in a single notification primitive may not make sense. I am not aware of any use cases that require mixing notify_one() and notify_waiters(). It may make more sense to provide a separate type for doing broadcast style notifications.

I could go either on this. I think you're right that I can't immediately think of use-cases that require both. On the other hand, if we have two separate public API types for "notify one" and "notify all" behaviors, but they share the underlying internal implementation, and someone does have a use-case for both in the same place, this seems a little inefficient. If a user wants to wake tasks with either a "notify one" or a "notify all" operation, they have to express interest in two separate types, maintaining two separate wait lists...

One potential use-case I can think of for both types of notification in one place is something that notifies a single waiter at a time during normal operation but notifies all waiters when it's dropped/shut down/canceled.

@zaharidichev
Copy link
Contributor

Is that something I can work on? Starting with the counter approach for notify_waiters() ?

@carllerche
Copy link
Member Author

@zaharidichev sure. I believe that based on @hawkw's comments, we should include notify_waiters() on Notify itself.

Assuming the counter strategy works, we should increase the Notify internal state from AtomicU8 to AtomicUsize. Then use all bits not needed for the existing state (0, 1, 2, so 2 bits) for the counter. It will probably be easiest to reserve the LSBs for the state and use the MSBs for the counter, then we can just let it wrap.

Existing internal usage of notify_waiters() will need to be updated to avoid the initial pin/poll.

Does the counter strategy make sense?

@zaharidichev
Copy link
Contributor

@carllerche Yes all that makes sense.

@zaharidichev
Copy link
Contributor

I believe that this problem can be solved by storing a counter internally tracking the number of calls to notify_*. When the notification future is created, it loads the current count. When the future is awaited, if the current count is greater than the one loaded on creation, the future completes immediately.

@carllerche Not sure what you mean by "notify_*". I do not think tracking the number of calls to notify_one is right. We only need to count the calls to notify_waiters because otherwise we can end up with multiple Notified futures getting completed because of a single call to notify_one.

@carllerche
Copy link
Member Author

@zaharidichev very possible. The NOTIFIED state probably handles the case of notify_one.

zaharidichev added a commit to zaharidichev/tokio that referenced this issue Nov 5, 2020
This PR changes the way `notify_waiters`,
is used. Previously in order for the consumer to
register interest, in a notification triggered by
`notify_waiters`, the `Notified` future had to be
polled. This introduced friction when using the api
as the future had to be pinned before polled.

This change introduces a counter that tracks how many
times `notified_waiters` has been called. Upon creation of
the future the number of times is loaded. When first
polled the future compares this number with the count
state of the `Notify` type. This avoids the need for
registering the waiter upfront.

Fixes: tokio-rs#3066

Signed-off-by: Zahari Dichev <zaharidichev@gmail.com>
@carllerche
Copy link
Member Author

Based on the discussion, all improvements are forwards compatible. I will remove the 1.0 tag.

@carllerche carllerche removed this from the v1.0 milestone Nov 12, 2020
carllerche pushed a commit that referenced this issue Nov 16, 2020
This PR makes `Notify::notify_waiters` public. The method
already exists, but it changes the way `notify_waiters`,
is used. Previously in order for the consumer to
register interest, in a notification triggered by
`notify_waiters`, the `Notified` future had to be
polled. This introduced friction when using the api
as the future had to be pinned before polled.

This change introduces a counter that tracks how many
times `notified_waiters` has been called. Upon creation of
the future the number of times is loaded. When first
polled the future compares this number with the count
state of the `Notify` type. This avoids the need for
registering the waiter upfront.

Fixes: #3066
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A-tokio Area: The main tokio crate C-feature-request Category: A feature request. M-sync Module: tokio/sync
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants