Add new transactional producer #130

danxmoran · 2019-05-08T16:38:01Z

Fixes #128.

Here's an implementation which (mostly) avoids touching existing classes. There's a bunch of code copy-pasted from the existing KafkaProducer and ProducerMessage which might be nice to consolidate 😄

danxmoran

Some design questions I could use feedback on.

src/main/scala/fs2/kafka/ConsumerSettings.scala

src/main/scala/fs2/kafka/TransactionalMessage.scala

src/main/scala/fs2/kafka/package.scala

src/main/scala/fs2/kafka/TransactionalKafkaProducer.scala

vlovgr

Thanks a lot for this, already looking really good!

src/main/scala/fs2/kafka/ConsumerSettings.scala

src/main/scala/fs2/kafka/TransactionalMessage.scala

src/main/scala/fs2/kafka/TransactionalKafkaProducer.scala

danxmoran · 2019-05-09T18:35:31Z

@vlovgr I think I covered everything

codecov · 2019-05-09T18:45:56Z

Codecov Report

Merging #130 into 0.20.x will decrease coverage by 0.12%.
The diff coverage is 94.2%.

@@            Coverage Diff             @@
##           0.20.x     #130      +/-   ##
==========================================
- Coverage   93.75%   93.62%   -0.13%     
==========================================
  Files          40       48       +8     
  Lines        1153     1208      +55     
  Branches       78       93      +15     
==========================================
+ Hits         1081     1131      +50     
- Misses         72       77       +5

Impacted Files	Coverage Δ
...c/main/scala/fs2/kafka/internal/WithConsumer.scala	`100% <ø> (ø)`	⬆️
.../main/scala/fs2/kafka/ConsumerGroupException.scala	`0% <0%> (ø)`
...c/main/scala/fs2/kafka/internal/WithProducer.scala	`100% <100%> (ø)`
src/main/scala/fs2/kafka/package.scala	`100% <100%> (ø)`	⬆️
...n/scala/fs2/kafka/CommittableProducerRecords.scala	`100% <100%> (ø)`
src/main/scala/fs2/kafka/KafkaConsumer.scala	`99.16% <100%> (-0.84%)`	⬇️
src/main/scala/fs2/kafka/IsolationLevel.scala	`100% <100%> (ø)`
...scala/fs2/kafka/TransactionalProducerMessage.scala	`100% <100%> (ø)`
src/main/scala/fs2/kafka/CommittableOffset.scala	`100% <100%> (ø)`	⬆️
...cala/fs2/kafka/TransactionalProducerResource.scala	`100% <100%> (ø)`
... and 15 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 75ef99f...720ce88. Read the comment docs.

src/main/scala/fs2/kafka/TransactionalProducerMessage.scala

danxmoran · 2019-05-13T18:06:41Z

⚠️ Scope creep ⚠️

I've been working to get an 0.19.x-compatible version of this PR set up in my project at work as a stop-gap while we work out the rough edges of the API here. The first problem I hit was picking a transactional ID. Alpakka asks for a user-defined ID, but I found that Kafka Streams auto-generates IDs for users. When I dug into the implementation I found that Kafka Streams uses a unique transactional ID per "task" (topic/partition). Other projects like spring-kafka have followed the same pattern. From what I've read, it's effectively the only way to ensure messages don't get double-processed when partitions get rebalanced.

Do you think this should change our design here at all? I could see a setup where we remove the transactionalId field from ProducerSettings and have TransactionalKafkaProducer manage the lifecycles of several "raw" Kafka producers under-the-hood. It'd be more complicated, but I think a big win. Without baked-in support, users who care about zombie-fencing guarantees will need to spin up a fresh TransactionalKafkaProducer every time they want to publish a new message.

vlovgr · 2019-05-13T19:15:47Z

If I understand correctly, TransactionalKafkaProducer in its current form is only really useful if you have a single instance in the consumer group. With multiple instances, you need a stable transactional.id across instances per topic-partition, which is not trivial to implement yourself, and would fit better in the library. I definitely agree this is something we should handle in the library.

The mentioned topic-partition strategy also means one producer per topic-partition, right? That might be a bit much in the single instance scenario, but for multiple instances it sounds acceptable. Perhaps this is even behaviour we can toggle in ProducerSettings, while the remaining API stays as is? If you agree, then we can finish this up and merge it, and then do the remaining work in a separate pull request.

(I think the most tricky part of the producer-per-topic-partition is managing creating and closing producers as partitions are assigned and revoked. Maybe we could do something clever with rebalance listeners to get this working nicely.)

danxmoran · 2019-05-13T19:21:13Z

👍 I'm ok with this merging and setting up a new PR for improvements. Thanks for all the help and review! Is there anything I can do to help with adding docs & tests on this one?

vlovgr · 2019-05-13T19:40:02Z

Great! If there's anything you feel is missing in terms of docs or tests, then feel free to add it. Otherwise, I'll just have a final look through this tomorrow and then merge it. 👍

vlovgr · 2019-05-14T14:24:13Z

Thanks a lot for this @danxmoran! 👍

danxmoran commented May 8, 2019

View reviewed changes

vlovgr reviewed May 9, 2019

View reviewed changes

danxmoran added 5 commits May 9, 2019 09:57

Add consumer setting for read isolation level.

7cae7b9

Add producer settings for transaction configuration.

76360a3

Add group ID to CommittableOffset.

f13ba67

Add TransactionalProducerMessage class.

2f2b81c

Add TransactionalKafkaProducer.

5cd0888

danxmoran changed the base branch from master to 0.20.x May 9, 2019 18:34

Scalafmt.

90622bc

Viktor Lövgren added 3 commits May 9, 2019 22:20

Change to use IsolationLevel coproduct

9868758

Remove ConsumerSettings#groupId

57b5045

Fix docs for ProducerSettings#withTransactionTimeout

e121aaa

vlovgr reviewed May 10, 2019

View reviewed changes

src/main/scala/fs2/kafka/TransactionalProducerMessage.scala Outdated Show resolved Hide resolved

danxmoran and others added 13 commits May 10, 2019 16:11

Fix test and scalafmt.

c76c532

Track consumer group IDs in CommittableOffsetBatch.

2af6089

Don't force a 1:1 match between record and offset.

9bb30f7

Update cats-effect to 1.3.0

404ff20

Add internal ByteProducer alias

cfbaa82

Change to use CommittableProducerRecords

dfe89da

Add missing trailing comma

7542af6

Formatting

c0daf56

Remove unused group ID tracking from offset batches.

5a33f73

Add docs to CommittableProducerRecords.

754e4a7

Add docs to TransactionalProducerMessage.

c23ccd7

Add test for TransactionalProducerMessage with zero records.

a133308

Change to not require same type constructors

88d1784

Viktor Lövgren added 6 commits May 14, 2019 10:48

Add functions for explicitly specifying effect type

d2a889c

Change to cleanup CommittableOffset

c570cfc

Change to remove duplication across producers

a74f246

Change TransactionalProducerMessage to use Chunk

7d0c1f1

Change TransactionalProducerMessage to use Chunk

ce8c0d7

Fix compilation on scala 2.11

720ce88

vlovgr merged commit 2539e9d into fd4s:0.20.x May 14, 2019

This was referenced May 14, 2019

Support for transactions #128

Closed

Remove ConsumerSettings#shiftDeserialization #131

Merged

danxmoran mentioned this pull request May 14, 2019

Add automatic management of transactional.id to TransactionalKafkaProducer #132

Open

danxmoran deleted the danxmoran-kafka-transactions-128 branch August 8, 2019 15:08

danxmoran mentioned this pull request Aug 19, 2019

WIP First pass at rebalancing producer. #182

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new transactional producer #130

Add new transactional producer #130

danxmoran commented May 8, 2019

danxmoran left a comment

vlovgr left a comment

danxmoran commented May 9, 2019

codecov bot commented May 9, 2019 •

edited

Loading

danxmoran commented May 13, 2019

vlovgr commented May 13, 2019

danxmoran commented May 13, 2019

vlovgr commented May 13, 2019

vlovgr commented May 14, 2019

Add new transactional producer #130

Add new transactional producer #130

Conversation

danxmoran commented May 8, 2019

danxmoran left a comment

Choose a reason for hiding this comment

vlovgr left a comment

Choose a reason for hiding this comment

danxmoran commented May 9, 2019

codecov bot commented May 9, 2019 • edited Loading

Codecov Report

danxmoran commented May 13, 2019

vlovgr commented May 13, 2019

danxmoran commented May 13, 2019

vlovgr commented May 13, 2019

vlovgr commented May 14, 2019

codecov bot commented May 9, 2019 •

edited

Loading