[MID-164] Process kafka messages sequentially and commit manually #135

ulich · 2022-07-11T13:48:15Z

This ensures that message processing is reliable. Before this change, the handling of a message was pushed off to a goroutine and the next message was read immediately. As soon as the next message is read, the previously read message is marked as processed successfully implicitly (by commiting the offset, which happened automatically relying on the default value of enable.auto.commit=true every 5 seconds, by default). After this change, you should make sure that you dont set enable.auto.commit at all (it will be set to false by this library) or set enable.auto.commit=false explicitly

If there was a crash or redeployment in between reading the message and the handler function finishing (especially a problem when the retry middleware is used, which can lead to very long execution times of the handler function), the messages would have been lost and wouldnt be reprocessed when the k8s pod comes back up.

Additionally, if there were thousands messages pushed to a topic, the consumer was reading all of them quickly after eachother, pushing each message processing onto a goroutine, essentially processing all messages in parallel. This causes problems on memory consumption and/or CPU etc.

This change will make the processing sequential. One consumer will read one message at a time. There will also be an explicit commit after the message handler is finished processing.

Now that each message is processed sequentially, there is no need for the config.WithDeliveryOrder method anymore as it will always be ordered. This got removed.

This change will reduce the throughput of message processing if you dont modify your application source code, as one consumer will only process one message at a time. If you expect a high volume of message processing, you can change your service and start multiple consumers instead of by scaling to more k8s pods.

Extra stuff:

When a handler returns an error, this is now passed to the errFn
Use latest confluent-kafka-go v1.9.1

This ensures that message processing is more reliable. By pushing the message handling off to a goroutine, we immediately "acknowledged" the kafka message and if a redeployment or an app crash happened during processing a message, it would have lost the messages. Additionally, when there are thousands of messages in the kafka topic, they all would have been read into memory and executed (more or less) simultaneously, leading to a very high memory consumption.

…el and wait for all to finish

jcyamacho · 2022-07-12T10:42:40Z

drone is all green, there is an issue with the hook:

ulich force-pushed the remove-concurrent-reads branch from 6762426 to 997473a Compare July 11, 2022 13:49

ulich and others added 13 commits July 12, 2022 09:40

use errFn for failed handlers and process multiple handlers in parall…

b53dbb5

…el and wait for all to finish

make wait group local

0ddb540

test(xevents): update test to remove in-memory messages ordering

b65678b

chore: upgrade dependencies

61b0a5f

set auto commit to false by default

9837d16

chore: update versions

7bd42ca

Prepare xevents for version v0.3.0

9564111

chore: update logger version to v0.6.5

af55056

Prepare logger for version v0.6.5

3d75f62

chore: update versions

c8fb1f5

Prepare camunda for version v2.0.4

007dceb

Prepare otel for version v0.1.4

3d85644

Prepare middleware for version v0.3.4

74bc848

jcyamacho previously approved these changes Jul 12, 2022

View reviewed changes

chore: merge from master

b661c74

jcyamacho dismissed their stale review via b661c74 July 12, 2022 10:15

jcyamacho previously approved these changes Jul 12, 2022

View reviewed changes

chore: regenerate camunda go.sum

42c3102

jcyamacho dismissed their stale review via 42c3102 July 12, 2022 10:22

ulich marked this pull request as ready for review July 12, 2022 10:35

ulich requested a review from giulliano-bueno as a code owner July 12, 2022 10:35

jcyamacho approved these changes Jul 12, 2022

View reviewed changes

ulich merged commit 66a4aaa into master Jul 12, 2022

ulich deleted the remove-concurrent-reads branch July 12, 2022 10:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MID-164] Process kafka messages sequentially and commit manually #135

[MID-164] Process kafka messages sequentially and commit manually #135

ulich commented Jul 11, 2022 •

edited

Loading

jcyamacho commented Jul 12, 2022

[MID-164] Process kafka messages sequentially and commit manually #135

[MID-164] Process kafka messages sequentially and commit manually #135

Conversation

ulich commented Jul 11, 2022 • edited Loading

Extra stuff:

jcyamacho commented Jul 12, 2022

ulich commented Jul 11, 2022 •

edited

Loading