Long-lived transactions #1156

vitoracle · 2023-03-02T15:09:40Z

Enhancement

We've had to use Kafka transactions at work and, fortunately, fs2-kafka has a pretty nice API through TransactionalKafkaProducer (thanks :D).

However, in our use case, we had to hold the Kafka transaction for a long period of time since we were producing substantial amounts of records into Kafka. We also preferred if that operation is atomic.

Unfortunately, it seems to me that TransactionalKafkaProducer's produce method begins and commits/aborts a transaction only for one chunk of ProducerRecords. Unluckily for us, this is not enough as we'd be receiving chunks periodically (from a stream).

Proposal

API

An API such as:

for {
  producer <- TransactionalKafkaProducer.stream { ... }
  _        <- Stream.eval(producer.beginTransaction)
  _        <- Stream.eval(producer.produce { ... })
  _        <- Stream.eval(producer.commitTransaction)
} yield ()

would suffice.

However, this can easily introduce invalid state as the consumer of this API could, for example, attempt to commit/abort a transaction that does not even exist. Also, the user could forget to commit/abort the transaction, intentionally or not.

A better approach would be to create a Transaction class, that is similar to a TransactionalKafkaProducer, with the exception that it can only be obtained within a Resource context. This ensures that when the resource is released, the transaction is always handled.

This is what is achieved here with:

/**
   * Creates a new [[Transaction]] in the `Resource` context. This operation will block until the previous
   * transaction is finished, if any. Once this resource is released, the transaction and offsets will be committed,
   * or aborted if an exception is thrown.
*/
def createTransaction: Resource[F, Transaction.WithoutOffsets[F, K, V]]

Example:

(for {
  producer    <- TransactionalKafkaProducer.resource { ... }
  transaction <- producer.createTransaction // blocks until acquired 
} yield transaction).use { transaction => 
  transaction.produce { ... } *> transaction.produce { ... }
}

We also of course need to ensure the resource is only acquired when the Semaphore has one permit. For that, I had to make an internal change, exposing WithTransactionalProducer's internal Semaphore:

private[kafka] sealed abstract class WithTransactionalProducer[F[_]] {
   ...

  def semaphore: Semaphore[F]
}

We do have ExclusiveAccess[F, A] however it does not suffice as it only grants the permit for one operation (F[A] => F[A]).

Batch committing

Within Transaction's implementation, a Ref[F, CommittableOffsetBatch[F]] is used to store all the batches to be committed once the resource is released. Batches are merged together in every call to produce.

Transaction leaks

Leaks are possible. This is an example from a test that causes them:

for {
  globalStateRef <- IO
    .ref[Option[Transaction.WithoutOffsets[IO, String, String]]](None)
  makeProducer = TransactionalKafkaProducer.resource(
    TransactionalProducerSettings(
      s"id-$topic",
      producerSettings[IO]
        .withRetries(Int.MaxValue)
    )
  )
  _ <- makeProducer.flatMap(producer => producer.createTransaction).use {
    transaction =>
      globalStateRef.set(Some(transaction))
      // once this resource is released, the transaction will finalize
  }
  toProduce = ProducerRecords.one(ProducerRecord(topic, "key-0", "value-0"))
  _ <- globalStateRef.get.flatMap(_.get.produceWithoutOffsets(toProduce)) // by now, the transaction is already over, so this throws TransactionLeakedException
} yield ()

Within Transaction's implementation, a Ref[F, Boolean] is used to indicate whether or not a transaction has been closed. If there's a call to produce when the transaction is over, it throws TransactionLeakedException.

Would be nice if you have any inputs to make this (if possible) not happen.

All in all, this approach is pretty much identical to what zio-kafka currently does with their transaction implementation. But I'd like to hear your opinion on how we can make this better.

bplommer · 2023-03-24T16:18:55Z

Thanks for this! It looks really thoughtful - much appreciated. I'm a bit (very, very) behind with core library maintenance so I want to catch up on that first, but I'll try to look at this properly soon.

vitoracle · 2023-03-24T16:37:31Z

@bplommer No worries! We are already using this in production but it'd be cool if it was integrated in the library so we don't have to maintain a fork.

Seems like the CICD fails during header checks, is that something I can fix?

bplommer · 2023-03-24T17:20:22Z

Seems like the CICD fails during header checks, is that something I can fix?

I've updated it against the base branch, that should fix this.

bplommer · 2023-03-24T17:51:35Z

Oh there are some new files that the update didn't touch - you need to run sbt headerCreate to add the missing headers.

vitoracle · 2023-03-24T19:57:25Z

@bplommer Should work now. At least now it works locally, unless I'm missing something.

vitoracle added 4 commits March 1, 2023 08:59

Add Transaction and TransactionLeakedException

3da998b

Expose transactional producer's internal semaphore

38ccab3

Add WithTransaction to TransactionalKafkaProducer

4f6c9d4

Add tests for Transaction

0d0b7ae

bplommer requested review from vlovgr and LMnet March 24, 2023 16:18

Merge branch 'series/2.x' into new-resource-transaction

53b6d95

Update copyright dates

c0d6434

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Long-lived transactions #1156

Long-lived transactions #1156

vitoracle commented Mar 2, 2023

bplommer commented Mar 24, 2023

vitoracle commented Mar 24, 2023

bplommer commented Mar 24, 2023

bplommer commented Mar 24, 2023

vitoracle commented Mar 24, 2023

Long-lived transactions #1156

Are you sure you want to change the base?

Long-lived transactions #1156

Conversation

vitoracle commented Mar 2, 2023

Enhancement

Proposal

API

Batch committing

Transaction leaks

bplommer commented Mar 24, 2023

vitoracle commented Mar 24, 2023

bplommer commented Mar 24, 2023

bplommer commented Mar 24, 2023

vitoracle commented Mar 24, 2023