Add experimental alternative fetch strategies #970

erikvanoosten · 2023-07-05T18:17:36Z

ManyPartitionsQueueSizeBasedFetchStrategy, a variation on the default QueueSizeBasedFetchStrategy which limits total memory usage.
PredictiveFetchStrategy an improved predictive fetching strategy (compared to the predictive strategy from zio-kafka 2.3.x) which uses history to calculate the average number of polls the stream needed to process, and uses that to estimate when the stream needs more data.

To do:

Add unit tests

1. `ManyPartitionsQueueSizeBasedFetchStrategy`, a variation on the default `QueueSizeBasedFetchStrategy` which limits total memory usage. 2. `PredictiveFetchStrategy` an improved predictive fetching strategy (compared to the predictive strategy from zio-kafka 2.3.x) which uses history to calculate the average number of polls the stream needed to process, and uses that to estimate when the stream needs more data.

erikvanoosten · 2024-07-14T10:47:47Z

ManyPartitionsQueueSizeBasedFetchStrategy has been split of to #1281.

svroonland · 2024-07-14T16:46:22Z

zio-kafka/src/main/scala/zio/kafka/consumer/fetch/PredictiveFetchStrategy.scala

+
+import scala.collection.mutable
+
+/**


Number of polls is quite a discrete measurement, what do you think of converting this into a running average (like exponentially weighed moving average) of the number of records dequeued each poll?

That will work against people that use something like Grafana. First you need to collect and sum the raw counters from each instance of your service. Then, and only then, you can calculate a running average, an integral, or whatever other operation.

svroonland · 2024-11-16T10:41:14Z

@erikvanoosten Shall we close this PR? It's well over a year old now

erikvanoosten · 2024-11-16T11:40:35Z

@erikvanoosten Shall we close this PR? It's well over a year old now

Sure. I have put a lot of effort in PredictiveFetchStrategy but if nobody is going to use it, it doesn't make sense to keep it.

erikvanoosten mentioned this pull request Jul 10, 2023

Prevent OOM by introducing a max on total buffer capacity #944

Closed

erikvanoosten force-pushed the more-fetch-strategies branch from 5c3bac7 to deb81f3 Compare August 9, 2023 16:09

erikvanoosten force-pushed the more-fetch-strategies branch from b479904 to 556d421 Compare December 24, 2023 17:19

erikvanoosten force-pushed the more-fetch-strategies branch from f92b8b2 to b6d9d7a Compare July 14, 2024 09:49

erikvanoosten mentioned this pull request Jul 14, 2024

Add alternative fetch strategy for many partitions #1281

Merged

Merge branch 'master' into more-fetch-strategies

c00ee3a

svroonland reviewed Jul 14, 2024

View reviewed changes

erikvanoosten closed this Nov 16, 2024

erikvanoosten deleted the more-fetch-strategies branch November 16, 2024 11:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add experimental alternative fetch strategies #970

Add experimental alternative fetch strategies #970

erikvanoosten commented Jul 5, 2023 •

edited

Loading

erikvanoosten commented Jul 14, 2024

svroonland Jul 14, 2024

erikvanoosten Jul 15, 2024

svroonland commented Nov 16, 2024

erikvanoosten commented Nov 16, 2024


		import scala.collection.mutable

		/**

Add experimental alternative fetch strategies #970

Add experimental alternative fetch strategies #970

Conversation

erikvanoosten commented Jul 5, 2023 • edited Loading

erikvanoosten commented Jul 14, 2024

svroonland Jul 14, 2024

Choose a reason for hiding this comment

erikvanoosten Jul 15, 2024

Choose a reason for hiding this comment

svroonland commented Nov 16, 2024

erikvanoosten commented Nov 16, 2024

erikvanoosten commented Jul 5, 2023 •

edited

Loading