KAFKA-12558: Do not prematurely mutate partiton state and provide con… by emilnkrastev · Pull Request #11818 · apache/kafka

emilnkrastev · 2022-02-27T12:58:42Z

This PR addresses the issue described here KAFKA-12558.

Additionally, The PR also allows to configure the max outstanding syncs in MirrorSourceTask because it is currently hardcoded.
A lot of offset syncs messages are lost during burst of messages in the source cluster or when the MirrorMaker has a lot to catch up (fist run or being inactive for a while). In such scenario it will take a while to sync the offsets in the destination cluster with partitions without regular activity even with reaching the maximum parallelism - 1 task per partition.

The PR tries to mitigate the issue by providing a way to change the maximum allowed concurrent offset syncs so that there are less offset syncs loses.

Here are my steps to reproduce the offset syncs issue because of the max outstanding syncs limited to 10:

Source topic with 12 partitions and 1400 messages with minimal activity. Messages are getting produced on daily basis
Run MirrorMaker2 process within the destination cluster network with offset syncs topic location set to target and 5 tasks
372 offset syncs messages arrived in the destination cluster offset syncs topic
9 out of 12 partitions are not synced correctly in the destination cluster
Waiting for hours and more for new messages to arrive in source Kafka cluster which will sync the correct offsets

gharris1727 · 2022-12-19T18:30:45Z

@emilnkrastev Thanks for the fix! Are you still interested in making this change?

If so, I would suggest removing the configuration changes, as those would require a KIP. We can much more easily merge a targeted bug fix.

Also I noticed that your name and email are not set in your commit. You may want to configure these so that the change can be attributed to you.

Thanks!

emilnkrastev · 2022-12-19T18:58:52Z

@gharris1727 I'm still interested in the change and I will update the MR in the next couple of days.

Thanks for the reply!

emilnkrastev · 2022-12-22T06:23:03Z

@gharris1727 The PR is updated. Could you please take a look?

gharris1727 · 2022-12-22T18:19:12Z

Thanks for the update @emilnkrastev.

It appears there are some CI failures that mention offsets translation that we need to look into, especially in MirrorConnectorsIntegrationSSLTest

Additionally, I'm trying to think of ways to reliably test this behavior. You've translated the existing test to work with the new update signature, but we haven't really targeted the semaphore balking behavior in a regression test. The lack of code coverage is partly why we didn't notice this before.

Here's a couple of ideas that might work, i'll let you choose whichever one you think makes more sense.

Mock the semaphore acquire/release and unit test to simulate balking in MirrorSourceTaskTest
Mock the producer and delay the send callback to cause the real semaphore to balk in MirrorSourceTaskTest
Refactor the MirrorSourceTask semaphore to make it more unit-testable.
Add a test to MirrorConnectorsIntegrationTestBase with a large contention on this semaphore and assert that the checkpoint offsets are sensible.

Thanks!

emilnkrastev · 2022-12-27T13:02:36Z

@gharris1727 The PR has been updated by adding additional unit test for testing partition state mutation and changing the update logic. The MirrorConnectorsIntegrationSSLTest Integration test was failing due to missing synced offsets and it was fix by always updating the previous upstream/downstream offset (the original logic is doing it).

There are still test failures but I believe there are not related to the PR changes. I can see similar test failures on the other PR's CI pipelines.

gharris1727

Thanks so much @emilnkrastev, this LGTM once some small nits are resolved.
The test looks great, and I'm glad that we're actually covering this code path. If all of the other tests are fine with those three instance variables being null, I'm wondering how much test coverage we're really missing right now.

gharris1727 · 2022-12-27T21:12:08Z

...st/java/org/apache/kafka/connect/mirror/integration/MirrorConnectorsIntegrationBaseTest.java


-        assertTrue(backupOffsets.containsKey(
-            new TopicPartition("primary.test-topic-1", 0)), "Offsets not translated downstream to backup cluster. Found: " + backupOffsets);
+        for (int i = 0; i < NUM_PARTITIONS; i++) {


Nice change!

connect/mirror/src/test/java/org/apache/kafka/connect/mirror/MirrorSourceTaskTest.java

gharris1727 · 2022-12-27T21:22:15Z

@C0urante could you take a look at this, please?

C0urante · 2022-12-29T15:24:19Z

Thanks for this fix, @emilnkrastev. I want to make sure I understand the goal here before commenting on the actual code changes (although I've made sure to read through them carefully before writing this).

It looks like incorrect updates to the partitionStates field (or its contents) might result in the starvation of offset syncs for some topic partitions. The task will believe that it has performed syncs that it hasn't, and as a result, may never see that the lag between the (true) last-synced offset and the offset for the last-replicated record has exceeded the value for the user-configured offset.lag.max property.

However, it looks like KAFKA-12468 and possibly KAFKA-12558 describe a different problem: it's not just a matter of offset syncs being delayed for longer than expected or even indefinitely; instead, the offset syncs that do make it to the downstream cluster actually contain incorrect values, which in some extreme cases can result in things like negative consumer lag.

I don't see how this PR would address an issue with correctness in the actual offsets used for syncs; instead, it seems like the value here is that it will allow offset syncs to take place in cases where they should be but are currently not.

Is that a fair assessment of the goal here? If so, I'm happy to review/merge a fix with that goal, and if not, would you mind shedding some light on what the goal is and where I might be missing something?

emilnkrastev · 2022-12-29T16:50:17Z

@C0urante your assessment of the goal is correct - the PR changes are aiming to allow syncs to take place where they should be.

I believe there are 2 different problems - one that results in delayed offset syncs and the other that results in negative lag in the destination cluster. I'm trying to mitigate the first problem (delayed offset syncs).

C0urante · 2022-12-30T16:38:45Z

Thanks @emilnkrastev, good to know we're on the same page!

Thinking about this a little more, I wonder if a slightly more-involved approach is warranted. With the current proposal, it looks like we might still miss some cases that are predicated on the last-seen (not last-synced) upstream/downstream offsets.

For example, if the downstream topic is deleted and then recreated, the downstreamOffset < previousDownstreamOffset part of the condition in shouldSyncOffsets will evaluate to true, and we'll try to send an offset sync as a result. If there are too many in-flight offset syncs already and we can't acquire access to the semaphore, we'll skip that sync--but since we'll also have already updated the previousDownstreamOffset field, the next call to shouldSyncOffsets won't (necessarily) return true, even though we should still attempt a sync in that case.

For a short-term fix, I think the PartitionState class could be adapted to "remember" whether syncs are necessary, and allow external callers to clear that state whenever an offset sync has actually been performed. That way, we never run the risk of "dropping" offset syncs that were supposed to be performed but were blocked by access to the semaphore.

How does that sound?

…nc event

emilnkrastev · 2022-12-30T19:57:30Z

@C0urante Your suggestion sounds perfect.

I have updated the PR.

C0urante

Thanks @emilnkrastev! The functional parts LGTM; just a few thoughts about the testing strategy.

connect/mirror/src/main/java/org/apache/kafka/connect/mirror/MirrorSourceTask.java

connect/mirror/src/test/java/org/apache/kafka/connect/mirror/MirrorSourceTaskTest.java

C0urante

Thanks @emilnkrastev! One final round of comments and this should be good to go.

connect/mirror/src/test/java/org/apache/kafka/connect/mirror/MirrorSourceTaskTest.java

C0urante

LGTM, thanks @emilnkrastev!

…rror Maker 2 (apache#11818) Reviewers: Greg Harris <greg.harris@aiven.io>, Chris Egerton <chrise@aiven.io>

* apache-github/trunk: KAFKA-14601: Improve exception handling in KafkaEventQueue apache#13089 KAFKA-14367; Add `OffsetCommit` to the new `GroupCoordinator` interface (apache#12886) KAFKA-14530: Check state updater more often (apache#13017) KAFKA-14304 Use boolean for ZK migrating brokers in RPC/record (apache#13103) KAFKA-14003 Kafka Streams JUnit4 to JUnit5 migration part 2 (apache#12301) KAFKA-14607: Move Scheduler/KafkaScheduler to server-common (apache#13092) KAFKA-14367; Add `OffsetFetch` to the new `GroupCoordinator` interface (apache#12870) KAFKA-14557; Lock metadata log dir (apache#13058) MINOR: Implement toString method for TopicAssignment and PartitionAssignment (apache#13101) KAFKA-12558: Do not prematurely mutate internal partition state in Mirror Maker 2 (apache#11818) KAFKA-14540: Fix DataOutputStreamWritable#writeByteBuffer (apache#13032) KAFKA-14600: Reduce flakiness in ProducerIdExpirationTest (apache#13087) KAFKA-14279: Add 3.3.x streams system tests (apache#13077) MINOR: bump streams quickstart pom versions and add to list in gradle.properties (apache#13064) MINOR: Update KRaft cluster upgrade documentation for 3.4 (apache#13063) KAFKA-14493: Introduce Zk to KRaft migration state machine STUBs in KRaft controller. (apache#12998) KAFKA-14570: Fix parenthesis in verifyFullFetchResponsePartitions output (apache#13072) MINOR: Remove public mutable fields from ProducerAppendInfo (apache#13091)

…master * apache-github/trunk: (23 commits) MINOR: Include the inner exception stack trace when re-throwing an exception (apache#12229) MINOR: Fix docs to state that sendfile implemented in `TransferableRecords` instead of `MessageSet` (apache#13109) Update ProducerConfig.java (apache#13115) KAFKA-14618; Fix off by one error in snapshot id (apache#13108) KAFKA-13709 (follow-up): Avoid mention of 'exactly-once delivery' or 'delivery guarantees' in Connect (apache#13106) KAFKA-14367; Add `TxnOffsetCommit` to the new `GroupCoordinator` interface (apache#12901) KAFKA-14568: Move FetchDataInfo and related to storage module (apache#13085) KAFKA-14612: Make sure to write a new topics ConfigRecords to metadata log iff the topic is created (apache#13104) KAFKA-14601: Improve exception handling in KafkaEventQueue apache#13089 KAFKA-14367; Add `OffsetCommit` to the new `GroupCoordinator` interface (apache#12886) KAFKA-14530: Check state updater more often (apache#13017) KAFKA-14304 Use boolean for ZK migrating brokers in RPC/record (apache#13103) KAFKA-14003 Kafka Streams JUnit4 to JUnit5 migration part 2 (apache#12301) KAFKA-14607: Move Scheduler/KafkaScheduler to server-common (apache#13092) KAFKA-14367; Add `OffsetFetch` to the new `GroupCoordinator` interface (apache#12870) KAFKA-14557; Lock metadata log dir (apache#13058) MINOR: Implement toString method for TopicAssignment and PartitionAssignment (apache#13101) KAFKA-12558: Do not prematurely mutate internal partition state in Mirror Maker 2 (apache#11818) KAFKA-14540: Fix DataOutputStreamWritable#writeByteBuffer (apache#13032) KAFKA-14600: Reduce flakiness in ProducerIdExpirationTest (apache#13087) ...

…rror Maker 2 (apache#11818) Reviewers: Greg Harris <greg.harris@aiven.io>, Chris Egerton <chrise@aiven.io>

…rror Maker 2 (#11818) Reviewers: Greg Harris <greg.harris@aiven.io>, Chris Egerton <chrise@aiven.io>

…rror Maker 2 (apache#11818) Reviewers: Greg Harris <greg.harris@aiven.io>, Chris Egerton <chrise@aiven.io>

KAFKA-12558: Do not prematurely mutate partiton state

36f691e

emilnkrastev force-pushed the KAFKA-12558 branch from c428a46 to 36f691e Compare December 21, 2022 09:06

KAFKA-12558: Added unit test and improve partition mutation logic

673b792

gharris1727 approved these changes Dec 27, 2022

View reviewed changes

emilnkrastev added 2 commits December 30, 2022 21:37

KAFKA-12558: Fix typo and remove new line in MirrorSourceTaskTest

27bb18f

KAFKA-12558: Update on the partitation state must result in offset sy…

d15ebcd

…nc event

emilnkrastev force-pushed the KAFKA-12558 branch from 67f6f71 to d15ebcd Compare December 30, 2022 19:55

C0urante reviewed Jan 3, 2023

View reviewed changes

KAFKA-12558: Increase test coverage

b73b226

C0urante reviewed Jan 9, 2023

View reviewed changes

KAFKA-12558: Update test for verifying offset sync events

126115f

C0urante approved these changes Jan 10, 2023

View reviewed changes

C0urante merged commit 6e7e2e0 into apache:trunk Jan 10, 2023

AnatolyPopov pushed a commit to aiven/kafka that referenced this pull request Jan 12, 2023

KAFKA-12558: Do not prematurely mutate internal partition state in Mi…

609986f

…rror Maker 2 (apache#11818) Reviewers: Greg Harris <greg.harris@aiven.io>, Chris Egerton <chrise@aiven.io>

guozhangwang pushed a commit to guozhangwang/kafka that referenced this pull request Jan 25, 2023

KAFKA-12558: Do not prematurely mutate internal partition state in Mi…

8d6e2d5

…rror Maker 2 (apache#11818) Reviewers: Greg Harris <greg.harris@aiven.io>, Chris Egerton <chrise@aiven.io>

AnatolyPopov pushed a commit to aiven/kafka that referenced this pull request Feb 22, 2023

KAFKA-12558: Do not prematurely mutate internal partition state in Mi…

92301f1

…rror Maker 2 (apache#11818) Reviewers: Greg Harris <greg.harris@aiven.io>, Chris Egerton <chrise@aiven.io>

C0urante pushed a commit that referenced this pull request Feb 23, 2023

KAFKA-12558: Do not prematurely mutate internal partition state in Mi…

55e69a0

…rror Maker 2 (#11818) Reviewers: Greg Harris <greg.harris@aiven.io>, Chris Egerton <chrise@aiven.io>

C0urante pushed a commit that referenced this pull request Feb 23, 2023

KAFKA-12558: Do not prematurely mutate internal partition state in Mi…

7c78f3f

…rror Maker 2 (#11818) Reviewers: Greg Harris <greg.harris@aiven.io>, Chris Egerton <chrise@aiven.io>

AnatolyPopov pushed a commit to aiven/kafka that referenced this pull request Mar 24, 2023

KAFKA-12558: Do not prematurely mutate internal partition state in Mi…

cca1bd3

…rror Maker 2 (apache#11818) Reviewers: Greg Harris <greg.harris@aiven.io>, Chris Egerton <chrise@aiven.io>

giuseppelillo pushed a commit to aiven/kafka that referenced this pull request Mar 29, 2023

KAFKA-12558: Do not prematurely mutate internal partition state in Mi…

342218b

…rror Maker 2 (apache#11818) Reviewers: Greg Harris <greg.harris@aiven.io>, Chris Egerton <chrise@aiven.io>

giuseppelillo pushed a commit to aiven/kafka that referenced this pull request Apr 6, 2023

KAFKA-12558: Do not prematurely mutate internal partition state in Mi…

58952a9

…rror Maker 2 (apache#11818) Reviewers: Greg Harris <greg.harris@aiven.io>, Chris Egerton <chrise@aiven.io>

jeqo pushed a commit to aiven/kafka that referenced this pull request Sep 28, 2023

KAFKA-12558: Do not prematurely mutate internal partition state in Mi…

e922b66

…rror Maker 2 (apache#11818) Reviewers: Greg Harris <greg.harris@aiven.io>, Chris Egerton <chrise@aiven.io>

Conversation

emilnkrastev commented Feb 27, 2022

Uh oh!

gharris1727 commented Dec 19, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

emilnkrastev commented Dec 19, 2022

Uh oh!

emilnkrastev commented Dec 22, 2022

Uh oh!

gharris1727 commented Dec 22, 2022

Uh oh!

emilnkrastev commented Dec 27, 2022

Uh oh!

gharris1727 left a comment

Choose a reason for hiding this comment

Uh oh!

gharris1727 Dec 27, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

gharris1727 commented Dec 27, 2022

Uh oh!

C0urante commented Dec 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

emilnkrastev commented Dec 29, 2022

Uh oh!

C0urante commented Dec 30, 2022

Uh oh!

emilnkrastev commented Dec 30, 2022

Uh oh!

C0urante left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

C0urante left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

C0urante left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gharris1727 commented Dec 19, 2022 •

edited

Loading

C0urante commented Dec 29, 2022 •

edited

Loading