Optmize `interop.flow.StreamSubscriber.onNext` #3387

BalmungSan · 2024-02-10T22:24:53Z

Since the reactive-streams spec mentions that all calls must be done "serially" these changes optimize the tight loop of multiple consequential onNext calls.

Benchmark results

Before the optimization

Benchmark	(fibers)	(iterations)	Mode	Cnt	Score	Error	Units
FlowInteropBenchmark.fastPublisher	1000	1024	thrpt	20	25.938	± 0.306	ops/s
FlowInteropBenchmark.fastPublisher	1000	5120	thrpt	20	6.519	± 0.015	ops/s
FlowInteropBenchmark.fastPublisher	1000	10240	thrpt	20	3.201	± 0.168	ops/s
FlowInteropBenchmark.fastPublisher	1000	51200	thrpt	20	0.680	± 0.013	ops/s
FlowInteropBenchmark.fastPublisher	1000	512000	thrpt	20	0.062	± 0.001	ops/s

After the optimization

Benchmark	(fibers)	(iterations)	Mode	Cnt	Score	Error	Units
FlowInteropBenchmark.fastPublisher	1000	1024	thrpt	20	63.846	± 0.094	ops/s
FlowInteropBenchmark.fastPublisher	1000	5120	thrpt	20	22.314	± 0.107	ops/s
FlowInteropBenchmark.fastPublisher	1000	10240	thrpt	20	12.662	± 0.028	ops/s
FlowInteropBenchmark.fastPublisher	1000	51200	thrpt	20	2.736	± 0.012	ops/s
FlowInteropBenchmark.fastPublisher	1000	512000	thrpt	20	0.275	± 0.002	ops/s

Analysis

For only 2 chunks of 512 elements the improvements were almost 2.5 times better.
For 20 chunks, the improvements were almost 4 times better.
But the improvements for 1000 chunks were just 4.4 times better.

It seems then that in a fully sequential and CPU-bound Publisher, the new Subscriber is roughly 4 times more efficient. As long as the chunk size is adequate, and there are enough chunks to flatten the overhead of the general machinery.
Overall, I would say this optimization looks great, even if realistic use cases won't get such a big increase, it is likely the effects will be noticeable.

Lies, Damn Lies, and Benchmarks

BalmungSan · 2024-02-12T17:15:47Z

core/shared/src/main/scala/fs2/interop/flow/StreamSubscription.scala

@@ -109,6 +109,7 @@ private[flow] final class StreamSubscription[F[_], A] private (
            // if we were externally canceled, this is handled below
            F.unit
        }
+        .mask


Related to #3384

benchmark/src/main/scala/fs2/benchmark/FlowInteropBenchmark.scala

BalmungSan force-pushed the optimize-flow-interop branch from 72b0be9 to 8234158 Compare February 12, 2024 17:14

BalmungSan commented Feb 12, 2024

View reviewed changes

BalmungSan commented Mar 29, 2024

View reviewed changes

armanbilge reviewed Sep 30, 2024

View reviewed changes

benchmark/src/main/scala/fs2/benchmark/FlowInteropBenchmark.scala Outdated Show resolved Hide resolved

benchmark/src/main/scala/fs2/benchmark/FlowInteropBenchmark.scala Outdated Show resolved Hide resolved

BalmungSan added 4 commits October 19, 2024 13:33

Add benchmark FlowInterop.fastPublisher

3452b3e

Optmize interop.flow.StreamSubscriber.onNext

4402b49

Mask interop.flow.StreamSubscription.run

7ba4920

Fix MiMa issues

843c382

BalmungSan force-pushed the optimize-flow-interop branch from 05418a2 to d565f2e Compare October 19, 2024 18:38

BalmungSan requested a review from armanbilge October 20, 2024 13:33

armanbilge reviewed Oct 21, 2024

View reviewed changes

benchmark/src/main/scala/fs2/benchmark/FlowInteropBenchmark.scala Outdated Show resolved Hide resolved

BalmungSan force-pushed the optimize-flow-interop branch from d565f2e to 1c0ba9a Compare October 21, 2024 21:33

BalmungSan requested a review from armanbilge October 21, 2024 21:33

armanbilge reviewed Oct 21, 2024

View reviewed changes

benchmark/src/main/scala/fs2/benchmark/FlowInteropBenchmark.scala Outdated Show resolved Hide resolved

BalmungSan force-pushed the optimize-flow-interop branch from 1c0ba9a to fe97a74 Compare October 21, 2024 21:56

BalmungSan added 2 commits October 21, 2024 17:28

Make the fastPublisher in the FlowInterop benchmark a sequential one

1e84952

Merge branch 'flow-benchmark' into optimize-flow-interop

58bfa07

BalmungSan force-pushed the optimize-flow-interop branch from fe97a74 to 58bfa07 Compare October 21, 2024 22:28

BalmungSan requested a review from armanbilge October 22, 2024 03:39

armanbilge changed the title ~~Optmize interop.flow.StreamSubscriber.onNext~~ Optmize interop.flow.StreamSubscriber.onNext Oct 23, 2024

armanbilge approved these changes Oct 23, 2024

View reviewed changes

armanbilge merged commit 8d11163 into typelevel:main Oct 23, 2024
15 checks passed

BalmungSan deleted the optimize-flow-interop branch October 23, 2024 00:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optmize `interop.flow.StreamSubscriber.onNext` #3387

Optmize `interop.flow.StreamSubscriber.onNext` #3387

BalmungSan commented Feb 10, 2024 •

edited

Loading

BalmungSan Feb 12, 2024 •

edited

Loading

Optmize interop.flow.StreamSubscriber.onNext #3387

Optmize interop.flow.StreamSubscriber.onNext #3387

Conversation

BalmungSan commented Feb 10, 2024 • edited Loading

Benchmark results

Before the optimization

After the optimization

Analysis

BalmungSan Feb 12, 2024 • edited Loading

Choose a reason for hiding this comment

Optmize `interop.flow.StreamSubscriber.onNext` #3387

Optmize `interop.flow.StreamSubscriber.onNext` #3387

BalmungSan commented Feb 10, 2024 •

edited

Loading

BalmungSan Feb 12, 2024 •

edited

Loading