[WithLatestFrom] implement operator for arity 1 and 2 #147

twittemb · 2022-04-15T14:48:38Z

Hi.

This PR aims to propose a new operator: withLatest(from:).
It has been discussed in the forum here.

I am new to the evolution proposal process. This PR contains:

a proposal based on the sample + the guides.
the implemented operator with unit tests and documentation.

Don't hesitate to guide me through the process 😀.

Thanks.

phausler

This looks like a fantastic start!

phausler · 2022-04-15T16:39:57Z

Sources/AsyncAlgorithms/Rethrow.swift

@@ -26,7 +26,7 @@ extension _ErrorMechanism {
    fatalError("materialized error without being in a throwing context")
  }

-  internal func _rethrowGet() rethrows -> Output {
+  @usableFromInline internal func _rethrowGet() rethrows -> Output {


we should avoid this; the _ErrorMechanism is a work-around for a compiler feature that is not yet there so ideally it should not be exposed into ABI.

removed :-)

phausler · 2022-04-15T16:46:58Z

Tests/AsyncAlgorithmsTests/TestZipLatestFrom.swift

+
+  func test_zipLatestFrom_uses_latest_element_from_other_when_base_produces_first_elements() async {
+    // Timeline
+    // base:     |-1-2---3------4---------5-----


these diagrams are really close to the validation diagram, however the use of | is kinda confusing.

Perhaps we can use some sort of combing function that is easily understandable to represent the tuple? e.g. we can use emoji so a tuple of a shape out of 🤍⬜️⚪️ along side a color out of 🟥🟧🟨🟩🟦🟪⬛️ results in a symbol of ❤️🟥🔴🧡🟧🟠... etc?

Doing a 3 variant might be hard; I still need to think on how to accomplish that.

I've refactored the diagrams a bit to make them clearer, tuple-wise

phausler · 2022-04-15T16:48:48Z

Sources/AsyncAlgorithms/AsyncZipLatestFromSequence.swift

+          self.other.task.cancel()
+          throw error
+        }
+        lastKnownOtherState = await self.other.next()


so this does not run the two sequences concurrently for iteration. effectively it is blocking iteration from one to the other. Perhaps a TaskGroup is in order here (like how zip does it)?

The Other sequence is being iterated over in its own Task. The call to self.other.next() will return a value right away (either noElement, element(value) or nil). With the way it is implemented right now I don't see the benefit of having a TaskGroup.

However, the current implementation might not be the good one! What do you think of iterating over Other in a separate Task so that we can always have its latest known value at our disposal ? It comes with the drawback that the latest known value might not be the one that we think because the Task's execution is scheduled by the system and it might be paused for some reason.

Other question: with the current implementation if Other fails or finishes, we will have to wait for the next Base value to know it and end the async sequence. We might pick another strategy where we can end the async sequence as soon as the Other sequence ends.

We will have the same type of questions for operators like switchLatest()`.

phausler · 2022-04-15T16:51:38Z

Sources/AsyncAlgorithms/AsyncZipLatestFromSequence.swift

+  /// - Parameter other: the other ``AsyncSequence``
+  /// - Returns: an ``AsyncZipLatestFromSequence`` where elements are a tuple of an element from `self` and the
+  /// latest known element (if any) from the `other` sequence.
+  @inlinable


I have a feeling that these probably should not be inlinable. Usually the bar I used was "could this reasonably be used for byte streams?" If the answer was yes then it should be inlined/frozen etc. If the answer is meh that doesn't make any sense then they are ok to be resilient boundaries. If the answer is on the fence then it is done if and only if it can be done w/o exposing hacks/workarounds for the language etc.

removed :-)

phausler · 2022-04-15T16:53:09Z

Tests/AsyncAlgorithmsTests/TestZipLatestFrom.swift

+//  Created by Thibault Wittemberg on 01/04/2022.
+//
+
+@preconcurrency import AsyncAlgorithms


oof wat?
XCTest I get, but the framework we are building for async/await stuff; im sure this is just a copy/paste error?

removed :-) It was due to a lack of conformance to Sendable

phausler · 2022-04-15T16:54:01Z

Sources/AsyncAlgorithms/AsyncZipLatestFromSequence.swift

+
+/// ``AsyncZipLatestFromSequence`` is an ``AsyncSequence`` where elements are a tuple of an element from `base` and the
+/// latest known element (if any) from the `other` sequence.
+public struct AsyncZipLatestFromSequence<Base: AsyncSequence, Other: AsyncSequence>: AsyncSequence where Other: Sendable {


these types probably need to be audited for being sendable. conditionally or not; my guess is that they will have similar constraints to zip/combineLatest.

I've conformed the types to Sendable, taking zip as a reference

phausler · 2022-04-15T16:55:29Z

Tests/AsyncAlgorithmsTests/TestZipLatestFrom.swift

+final class TestZipLatestFrom: XCTestCase {}
+
+// MARK: test for AsyncZipLatestSequence
+extension TestZipLatestFrom {


it would be good to get some coverage numbers so that we can get an idea of how much is addressed in the implementation by these tests; generally we aimed to have 90%+ coverage. Obviously coverage does not mean we cover ALL permutations of usage but it definitely assuages stability concerns etc.

I'm having some trouble getting the coverage in Xcode because of an error "Failed to merge raw profiles ...". Does that ring a bell?

phausler · 2022-04-15T16:56:56Z

Sources/AsyncAlgorithms/AsyncZipLatestFromSequence.swift

+
+    @inlinable
+    public mutating func next() async rethrows -> Element? {
+      guard !Task.isCancelled else {


so the cancellation only happens at the start of the iteration? it does not respond immediately while the base here is going too iterated (or when the other is iterated?

If I am reading it correctly; I think that is fine... just wanted to make sure that was the intent.

I've reworked the cancellation handling so that every iteration can catch it

parkera · 2022-05-04T18:46:37Z

Marking this as v1.1 because we should strongly consider it after our initial API stable 1.0 release.

FranzBusch

Just left some smaller comments here already that I spotted

Sources/AsyncAlgorithms/AsyncWithLatestFrom2Sequence.swift

FranzBusch · 2022-09-12T09:15:58Z

Sources/AsyncAlgorithms/AsyncWithLatestFrom2Sequence.swift

+    self.other2 = other2
+  }
+
+  public func makeAsyncIterator() -> Iterator {


We should make sure that only a single Iterator can be created. The problem with multiple iterators is that it leads to non-deterministic behaviour where elements are ending up in.

Can you be a bit more specific, I don't understand what you are asking for ?

Sure, the problem here is a bit broader to the AsyncSequence ecosystem and I am planning to write something up soon. Right now in all the AsyncSequence implementations the makeAsyncIterator method can be called multiple times. Each of these iterators is almost always using the same underlying storage. So the actual implementation is that the algorithms are unicast AsyncSequences however since they allow multiple iterators are getting into an awkward state where multiple Tasks can consume them now. This causes all kinds of issues like non-deterministic behaviour who gets an element. Furthermore, cancellation handling is becoming problematic since if two Tasks are consuming the same AsyncSequence and one gets cancelled it almost always cancels the other consumer as well. I currently consider it best practice to just guard that only a single Iterator can be created and fatalError on subsequent calls.

The current behaviour is fine for Sequences since the they don't have this time aspect to them and are most of the time referencing an underlying content. For our AsyncSequences the content is almost never buffered but consumed on the time stream.

As I said, I am planning to write more up on this soonish just wanted to make sure to put this out here already.

Thanks for the detailed explanation. I'm not sure I see a problem since a call to makeAsyncIterator() creates a new iterator that has its own properties and storage. Nothing is shared between several iterators.

Am I missing something ?

For AsyncSequences that have a storage shared across consumers (like AsyncChannel or an hypothetical AsyncShareSequence), the locking mechanism enforces the good behaviour right ?

Ah sorry I missed that the whole core implementation is inside the Iterator. While this makes things a bit better since the state doesn't interfere it makes still for some weird behaviour IMO:

let other = AsyncChannel<> let base = AsyncStream<> let sequence = base.withLatest(from: other) Task { for await element in sequence {} } Task { for await element in sequence {} }

You cannot tell from this code what is going to happen since the way all of the sequences involved in this little snippet right now allow creating multiple iterators. While your implementation is multi-cast all the other ones are not. IMO, what I personally think we should do is implement almost all our algorithms and base sequences as unicast (unless they have different semantics) and provide share/multicast algorithms. This makes reasoning about the code in the end way easier

let other = AsyncChannel<> let base = AsyncStream<> let sequence = base.shared().withLatest(from: other.shared())

I don't think you have to change your PR here right away since this is still in discussion and like I said I wanna open up a broader discussion around this soonish. Just wanted to share more details with you in why I think we need to be careful around these things.

Ok I understand. This is something I struggled with also. But it is mainly due to the fact that although AsyncChannel and AsyncStream are compatible with a multi producer/multi consumer paradigm, their output is not shared and the elements will be distributed across the consumers.

I guess this is what channels are for. If I refer to the Kotlin implementation it does exactly that.

I've open sourced a project AsyncExtensions before swift-async-algorithms was announced :-) and I'm trying to address notably the share operator in there, as well as shared "channels". Now that swift-async-algorithms has been announced, I'm rethinking my repo as a companion to swift-async-algorithms. It means I will add things that don't yet exist in swift-async-algorithms and try to suggest them to this main repo step by step.

The repo is a work in progress (https://github.com/sideeffect-io/AsyncExtensions/tree/refactor) and the README is not up-to-date but there are some interesting operators in there, that were challenging, like .switchToLatest().

I think the AsyncChannel being MPMC makes sense; however, I am still very torn if AsyncStream should be like that. AsyncChannel does handle cancellation better since it only removes the continuations for that specific Task whereas AsyncStream cancels the whole thing which makes it very hard to use safely.

But we are learning all of this along the way so we should just come up with some rules around what to expect!

W.r.t. your package. I really like it! I also have a rudimentary implementation of share lying around; however, as @phausler said we first need to stabilise the algos that are in here and then add more.

@FranzBusch I've addressed all the comments that could be addressed without an external opinion.
Thanks again for your time.

BTW, speaking of AsyncChannel cancellation, I have on ongoing PR regarding cancellation when several producers in different tasks -> #184

Sources/AsyncAlgorithms/AsyncWithLatestFrom2Sequence.swift

FranzBusch · 2022-09-12T09:17:23Z

Sources/AsyncAlgorithms/AsyncWithLatestFromSequence.swift

+    var onBaseElement: (@Sendable (Base.Element) -> Void)?
+    var onOtherElement: (@Sendable (Other.Element?) -> Void)?


I strongly believe we shouldn't insert code into these things just for testing. The behaviour should be fully observable from the public APIs.

@phausler do you have an opinion about that ?

the more exposed APIs we have the more potential bugs/bad-interactions there could be. If we need some sort of mechanism for testing then we should figure out a way to observe the issue via the tests. Since the tests can control the stepping of the global executors, and we also control both the inputs AND the demand, we should be able to observe everything that is of importance in the tests.

FranzBusch · 2022-09-12T09:33:23Z

Sources/AsyncAlgorithms/AsyncWithLatestFrom2Sequence.swift

+    var onBaseElement: (@Sendable (Base.Element) -> Void)?
+    var onOther1Element: (@Sendable (Other1.Element?) -> Void)?
+    var onOther2Element: (@Sendable (Other2.Element?) -> Void)?


Sources/AsyncAlgorithms/AsyncWithLatestFromSequence.swift

Sources/AsyncAlgorithms/AsyncWithLatestFrom2Sequence.swift

twittemb · 2022-09-12T09:52:06Z

Just left some smaller comments here already that I spotted

Hi @FranzBusch thanks for the review. I'll address them soon. As for the "ad-hoc" internal testing properties, I followed some advices given in this thread by Joe Groff (https://forums.swift.org/t/reliably-testing-code-that-adopts-swift-concurrency/57304/32). Testing concurrently executing code involving a time line is really challenging. In the case of withLatest(from:) we have to have a precise coordination that is deterministic because we have no control on the inner task execution for the other sequence.

What would you suggest to improve that ?

FranzBusch · 2022-09-12T10:50:15Z

Hi @FranzBusch thanks for the review. I'll address them soon. As for the "ad-hoc" internal testing properties, I followed some advices given in this thread by Joe Groff (https://forums.swift.org/t/reliably-testing-code-that-adopts-swift-concurrency/57304/32). Testing concurrently executing code involving a time line is really challenging. In the case of withLatest(from:) we have to have a precise coordination that is deterministic because we have no control on the inner task execution for the other sequence.

What would you suggest to improve that ?

I am really torn on that suggestion, but I feel your pain points because I experienced the same in the merge and debounce implementation where certain scenarios can't be tested since you are not controlling the flow in the spawned Tasks. I think if we go forward with hooks for tests we should #if DEBUG to only have them in the tests but even that produces problems since we want to run the benchmarking tests in release mode.

I would like to get @phausler opinion here about test hooks for simulating timings.

phausler · 2022-09-12T16:24:02Z

So we can control the spawning of tasks! https://github.com/apple/swift-async-algorithms/blob/main/Sources/AsyncSequenceValidation/Test.swift#L313 is where the tests do it today - and I could imagine that we could add either extra functionality to that to adjust the way we need... or we can use that dark magic to build some other really neat tools to control that in different ways.

What I think would be a good plan is to favor testing via what is public surface area as much as possible and then collect info on exactly what we need such that either a) we can get effort from a language/runtime layer or b) (ab)use some of the runtime functions to manipulate the execution into what we want to test. But that really requires some really good outlines of precisely what needs to occur in the most general sense.

twittemb force-pushed the zipLatestFrom branch from 8241e2d to 4b42fef Compare April 15, 2022 14:50

phausler reviewed Apr 15, 2022

View reviewed changes

phausler added the enhancement New feature or request label Apr 15, 2022

twittemb force-pushed the zipLatestFrom branch from 4b42fef to b0fc952 Compare April 20, 2022 12:28

parkera added the v1.1 Post-1.0 work label May 4, 2022

twittemb force-pushed the zipLatestFrom branch from b0fc952 to ab8084f Compare May 4, 2022 20:28

twittemb force-pushed the zipLatestFrom branch from ab8084f to 5c7ca84 Compare June 11, 2022 19:17

twittemb mentioned this pull request Jul 26, 2022

Add an introduction and formatting for the combineLatest algorithm into a proposal. #177

Merged

twittemb force-pushed the zipLatestFrom branch 3 times, most recently from 60bb5c4 to 39e3ec1 Compare September 12, 2022 08:10

FranzBusch reviewed Sep 12, 2022

View reviewed changes

twittemb changed the title ~~operators: add zipLatest(from:)~~ operators: add withLatest(from:) Sep 12, 2022

operators: add withLatest(from:)

ddd5969

twittemb force-pushed the zipLatestFrom branch from 39e3ec1 to ddd5969 Compare September 12, 2022 15:36

twittemb changed the title ~~operators: add withLatest(from:)~~ [Operators] add withLatest(from:) Nov 12, 2022

Keithbreadley approved these changes Jan 4, 2023

View reviewed changes

twittemb changed the title ~~[Operators] add withLatest(from:)~~ [WithLatestFrom] implement operator for arity 1 and 2 Jan 5, 2023

		var onBaseElement: (@Sendable (Base.Element) -> Void)?
		var onOtherElement: (@Sendable (Other.Element?) -> Void)?

[WithLatestFrom] implement operator for arity 1 and 2 #147

Are you sure you want to change the base?

[WithLatestFrom] implement operator for arity 1 and 2 #147

Uh oh!

Conversation

twittemb commented Apr 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

phausler left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

twittemb Apr 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

twittemb Apr 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

parkera commented May 4, 2022

Uh oh!

FranzBusch left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

twittemb Sep 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

twittemb commented Apr 15, 2022 •

edited

Loading

twittemb Apr 20, 2022 •

edited

Loading

twittemb Apr 20, 2022 •

edited

Loading

twittemb Sep 12, 2022 •

edited

Loading

twittemb commented Sep 12, 2022 •

edited

Loading

phausler commented Sep 12, 2022 •

edited

Loading