Documents the different benchmarks

zio · Nov 23, 2024 · a699f94 · a699f94
1 parent b713170
commit a699f94
Show file tree

Hide file tree

Showing 2 changed files with 55 additions and 4 deletions.
diff --git a/zio-kafka-bench/README.md b/zio-kafka-bench/README.md
@@ -2,17 +2,67 @@
 
 ## Results
 
-The benchmark are run from a GitHub action on every commit. The results are published on https://zio.github.io/zio-kafka/dev/bench/.
+The benchmark are run from a GitHub action on every commit. The results are published
+on https://zio.github.io/zio-kafka/dev/bench/.
 
 The results are automatically pruned by [a scala script](https://github.com/zio/zio-kafka/blob/gh-pages/scripts/prune-benchmark-history.sc) on the `gh-pages` branch.
 
-## Interpreting the benchmarks
+## The consumer benchmarks
 
-To do!
+When comparing the zio-kafka benchmarks against the regular Kafka clients, keep in mind that these benchmarks represent
+the worst possible case for zio-kafka. This is because these consumers only count the received records, there is no
+processing. This makes the comparison look bad for zio-kafka because zio-kafka programs normally process records in
+parallel, while other Kafka consumers process records serially.
+
+All consumer benchmarks send 50k ~512 byte records per run.
+
+#### zio.kafka.bench.ZioKafkaConsumerBenchmark.throughput
+
+Uses zio-kafka's `plainStream` with a topic subscription. The offsets of the consumed records are _not_ committed.
+
+#### zio.kafka.bench.ZioKafkaConsumerBenchmark.throughputWithCommits
+
+Same as above, but now the offsets of the consumed records are committed.
+
+#### zio.kafka.bench.comparison.KafkaClientBenchmarks.kafkaClients
+
+The simplest possible Kafka client that subscribes to a topic. It directly calls the poll method in a tight loop.
+
+#### zio.kafka.bench.comparison.KafkaClientBenchmarks.manualKafkaClients
+
+Same as above, but now using partition assignment instead of topic subscription.
+
+#### zio.kafka.bench.comparison.ZioKafkaBenchmarks.zioKafka
+
+Does the same as `zio.kafka.bench.ZioKafkaConsumerBenchmark.throughput`.
+
+#### zio.kafka.bench.comparison.ZioKafkaBenchmarks.manualZioKafka
+
+Does the same as `zio.kafka.bench.ZioKafkaConsumerBenchmark.throughput`, but uses a partition assignment instead of a
+topic subscription.
+
+## The producer benchmarks
+
+#### zio.kafka.bench.ZioKafkaProducerBenchmark.produceChunkSeq
+
+Sequentially produces 30 batches, where each batch contains 500 small records.
+
+#### zio.kafka.bench.ZioKafkaProducerBenchmark.produceChunkPar
+
+Produces the same batches as the above, but from 4 fibers.
+
+#### zio.kafka.bench.ZioKafkaProducerBenchmark.produceSingleRecordSeq
+
+Sequentially produces 100 small records.
+
+#### zio.kafka.bench.ZioKafkaProducerBenchmark.produceSingleRecordPar
+
+Produces 100 small records from 4 fibers.
 
 ## How to run the benchmarks
 
 To run these "comparison" benchmarks, in a sbt console, run:
+
 ```scala
 clean
 Test/compile
@@ -27,6 +77,7 @@ Here, we're only selecting the ones living in the `comparison` package.
 To list all possible options and understand these configurations, see run `sbt "zioKafkaBench/Jmh/run -h"`
 
 Used options meaning:
+
  - "-wi 10": 10 warmup iterations
  - "-i 10": 10 benchmark iterations
  - "-r 1": Minimum time to spend at each measurement iteration. 1 second

diff --git a/zio-kafka-bench/src/main/scala/zio/kafka/bench/ZioKafkaProducerBenchmark.scala b/zio-kafka-bench/src/main/scala/zio/kafka/bench/ZioKafkaProducerBenchmark.scala
@@ -50,7 +50,7 @@ class ZioKafkaProducerBenchmark extends ProducerZioBenchmark[Kafka with Producer
   @BenchmarkMode(Array(Mode.Throughput))
   @OutputTimeUnit(TimeUnit.SECONDS)
   def produceSingleRecordSeq(): Any = runZIO {
-    // Produce 50 records sequentially
+    // Produce 100 records sequentially
     Producer.produce(topic1, "key", "value", Serde.string, Serde.string).repeatN(99)
   }