in_kafka: boost throughput #9625

coreidcc · 2024-11-20T16:56:45Z

We have a Kafka cluster with about 40k messages and 25MB of data per seconds. Fluent-bit stands no change to keep up with this load in its current state where it

a) commits each message individually
b) a poll-timeout of just one 1ms (this completely overrides fetch.wait.max.ms from kafka)

Even Logstash is faster and vector is just consuming all these messages with ease.

probably related to "Batch processing is required in in_kafka. #8030"

Enter [N/A] in the box, if an item is not applicable to your change.

Testing
Before we can approve your change; please submit the following in a comment:

To activate the changes one need to

[INPUT]
Name kafka
threaded true -> sets timeout so that it will be limited by fetch.wait.max.ms in any practical scenario
enable_auto_commit true -> disable explicit commit call

-> The change doesn't do any dynamic allocations at all and therefore cant introduce any mem-leaks
-> The change has no impact on packaging at all

Documentation

Documentation is prepared and follows in a second

Backporting

Minor change that doesn't require to wait for a major release.

Fluent Bit is licensed under Apache 2.0, by submitting this pull request I understand that this code will be released under the terms of that license.

coreidcc · 2024-11-20T17:04:26Z

Documentation is here: fluent/fluent-bit-docs#1520

patrick-stephens

I think the auto-commit default is not right to keep current behaviour. Plus we probably should make the poll rate configurable if we're changing this all anyway.

Can we add some unit tests as well?

patrick-stephens · 2024-11-22T11:25:51Z

plugins/in_kafka/in_kafka.c

-        rd_kafka_commit(ctx->kafka.rk, NULL, 0);
+
+        if(!ctx->enable_auto_commit) {
+            /* TO-DO: commit the record based on `ret` */


I think we need to sort all TODOs, we can't just leave them in there. I know it was in the original but we should attempt to sort or add more info.

Well i din't introduce the TO-DO. It existed before. It just went into the if statement. Unfortunately github shows to little context so you can't see it.

plugins/in_kafka/in_kafka.c

patrick-stephens · 2024-11-22T11:27:09Z

plugins/in_kafka/in_kafka.c

+   {
+    FLB_CONFIG_MAP_BOOL, "enable_auto_commit", FLB_IN_KAFKA_ENABLE_AUTO_COMMIT,
+    0, FLB_TRUE, offsetof(struct flb_in_kafka_config, enable_auto_commit),
+    "Relay on kafka auto-commit and commit messages in batches"


I think this is a typo and should be rely

plugins/in_kafka/in_kafka.c

patrick-stephens · 2024-11-22T11:28:35Z

plugins/in_kafka/in_kafka.c

-        /* TO-DO: commit the record based on `ret` */
-        rd_kafka_commit(ctx->kafka.rk, NULL, 0);
+
+        if(!ctx->enable_auto_commit) {


Current behaviour is to have auto-commit false but the default below is now true which means we are not maintaining the old behaviour by default.

Polling every 1ms and committing each message individually results in rather pure performance in high volume Kafka clusters. Commiting in batches (relay on auto-commit of kafka) drastically improves performance. Signed-off-by: CoreidCC <sws-github@coreid.cc>

having 1ms timeout might make sense if the input plugin is running in the main thread (not introducing delay for others). but if we run in our very own thread then we should not over- ride the fetch.wait.max.ms configuration value from the kafka-consumer. this in conjuntion with using autocommit again boosts the throuhput significantly. Signed-off-by: CoreidCC <sws-github@coreid.cc>

Signed-off-by: CoreidCC <sws-github@coreid.cc>

coreidcc requested review from edsiper, leonardo-albertovich, fujimotos and koleini as code owners November 20, 2024 16:56

github-actions bot added the docs-required label Nov 20, 2024

coreidcc force-pushed the master branch from c87cf60 to f452640 Compare November 20, 2024 17:01

patrick-stephens reviewed Nov 22, 2024

View reviewed changes

coreidcc force-pushed the master branch 2 times, most recently from 56a331a to 713453e Compare November 22, 2024 18:59

coreidcc temporarily deployed to pr November 23, 2024 17:22 — with GitHub Actions Inactive

coreidcc temporarily deployed to pr November 23, 2024 17:44 — with GitHub Actions Inactive

coreidcc temporarily deployed to pr November 23, 2024 17:45 — with GitHub Actions Inactive

coreidcc changed the title ~~Boost in_kafa throughput~~ Boost in_kafka throughput Nov 23, 2024

coreidcc changed the title ~~Boost in_kafka throughput~~ in_kafka: boost throughput Nov 23, 2024

coreidcc force-pushed the master branch from 713453e to de547bb Compare November 23, 2024 18:45

coreidcc requested a review from patrick-stephens November 25, 2024 17:30

coreidcc temporarily deployed to pr November 25, 2024 18:38 — with GitHub Actions Inactive

coreidcc temporarily deployed to pr November 25, 2024 19:01 — with GitHub Actions Inactive

coreidcc added 3 commits November 28, 2024 17:12

in_kafka: fix type in help text

f94cf72

Signed-off-by: CoreidCC <sws-github@coreid.cc>

coreidcc force-pushed the master branch from de547bb to f94cf72 Compare November 28, 2024 16:12

coreidcc temporarily deployed to pr November 28, 2024 16:19 — with GitHub Actions Inactive

coreidcc temporarily deployed to pr November 28, 2024 16:41 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

in_kafka: boost throughput #9625

in_kafka: boost throughput #9625

coreidcc commented Nov 20, 2024

coreidcc commented Nov 20, 2024

patrick-stephens left a comment

patrick-stephens Nov 22, 2024

coreidcc Nov 22, 2024

patrick-stephens Nov 22, 2024

patrick-stephens Nov 22, 2024

in_kafka: boost throughput #9625

Are you sure you want to change the base?

in_kafka: boost throughput #9625

Conversation

coreidcc commented Nov 20, 2024

coreidcc commented Nov 20, 2024

patrick-stephens left a comment

Choose a reason for hiding this comment

patrick-stephens Nov 22, 2024

Choose a reason for hiding this comment

coreidcc Nov 22, 2024

Choose a reason for hiding this comment

patrick-stephens Nov 22, 2024

Choose a reason for hiding this comment

patrick-stephens Nov 22, 2024

Choose a reason for hiding this comment