KAFKA-12543: Change RawSnapshotReader ownership model #10431

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

mumrah merged 6 commits into apache:trunk from jsancio:kafka-12543-snapshot-ownership

May 18, 2021

Member

jsancio commented Mar 30, 2021 •

edited

Loading

Kafka networking layer doesn't close FileRecords and assumes that they are already open when sending them over a channel. To support this pattern this commit changes the ownership model for FileRawSnapshotReader so that they are owned by KafkaMetadataLog. This includes:

Changing KafkaMetadataLog's snapshotIds form a Set[OffsetAndEpoch] to a TreeMap[OffsetAndEpoch, Option[FileRawSnapshotReader]]. This map contains all of the known snapshots. The value will be Some if a snapshot reader has been opened in the past.
Split and change the functionality in KafkaMetadataLog::removeSnapshotFilesBefore so that a) forgetSnapshotsBefore removes any snapshot less that the given snapshot id from snapshots; b) removeSnapshots deletes the enumerated snapshots from forgetSnapshotsBefore.
Change the interface RawSnapshotReader to not extend Closeable since only KafkaMetadataLog is responsible for closing snapshots. FileRawSnapshotReader implements AutoCloseable.
Fixed the implementation of handleFetchSnapshotRequest in KafkaRaftClient so that RawSnapshotReader and the associated FileRecords are not close before sending to the network layer.

More detailed description of your change,
if necessary. The PR title and PR message become
the squashed commit message, so use a separate
comment to ping reviewers.

Summary of testing strategy (including rationale)
for the feature or bug fix. Unit and/or integration
tests are expected for any behaviour change and
system tests should be considered for larger changes.

Committer Checklist (excluded from commit message)

Verify design and implementation
Verify test coverage and CI build status
Verify documentation (including upgrade notes)

Member Author

jsancio commented Mar 30, 2021

All of the changes are in c7a0a5c. The rest of the changes are included in #10085

cmccabe added the kraft label


          KAFKA-12543: Change RawSnapshotReader ownership model

c7567f2

jsancio force-pushed the kafka-12543-snapshot-ownership branch from c7a0a5c to c7567f2 Compare

May 1, 2021 20:17


          Only hold a lock when modifying snapshots variable

ee79a74

jsancio marked this pull request as ready for review

May 3, 2021 17:42

Member Author

jsancio commented May 3, 2021

@hachikuji @dengziming @mumrah This PR is ready for review. Thanks!

dengziming reviewed

View reviewed changes

Member

dengziming left a comment

Left some minor questions.

core/src/main/scala/kafka/raft/KafkaMetadataLog.scala Outdated Show resolved Hide resolved

core/src/main/scala/kafka/raft/KafkaMetadataLog.scala Show resolved Hide resolved

core/src/main/scala/kafka/raft/KafkaMetadataLog.scala Show resolved Hide resolved


          Clean up code

4b0de9b

mumrah reviewed

View reviewed changes

Member

mumrah left a comment

Thanks for the patch @jsancio! A few questions and comments inline

core/src/main/scala/kafka/raft/KafkaMetadataLog.scala Outdated Show resolved Hide resolved

core/src/main/scala/kafka/raft/KafkaMetadataLog.scala Show resolved Hide resolved

raft/src/main/java/org/apache/kafka/snapshot/Snapshots.java Outdated Show resolved Hide resolved

raft/src/main/java/org/apache/kafka/snapshot/FileRawSnapshotReader.java Show resolved Hide resolved

raft/src/main/java/org/apache/kafka/snapshot/Snapshots.java Show resolved Hide resolved

core/src/main/scala/kafka/raft/KafkaMetadataLog.scala Show resolved Hide resolved


          Keep using atomicMoveWithFallback

eef4557

mumrah reviewed

View reviewed changes

core/src/main/scala/kafka/raft/KafkaMetadataLog.scala

    
                  latestSnapshotId().asScala match {

                    case Some(snapshotId) if (snapshotId.epoch > latestEpoch ||

                      (snapshotId.epoch == latestEpoch && snapshotId.offset > endOffset().offset)) =>

                  val (truncated, forgottenSnapshots) = latestSnapshotId().asScala match {

Member

mumrah May 10, 2021 •

edited

Loading

Should we grab the snapshots lock for this whole match expression like we do in deleteBeforeSnapshot? Is there possible a race between this block and deleteBeforeSnapshot?

Member Author

jsancio May 11, 2021

Synchronizing snapshots is only needed when accessing that object. In deleteBeforeSnapshot it is grabbed because the match expression accesses snapshots in one of the case/branch.

In this method I think it is safe to only grab the log where we currently do.

core/src/main/scala/kafka/raft/KafkaMetadataLog.scala Outdated Show resolved Hide resolved

core/src/main/scala/kafka/raft/KafkaMetadataLog.scala Outdated Show resolved Hide resolved

core/src/main/scala/kafka/raft/KafkaMetadataLog.scala Show resolved Hide resolved


          Use different syntax to get around IntelliJ error

1f3ce49

junrao reviewed

View reviewed changes

Contributor

junrao left a comment

@jsancio : Thanks for the PR. Just a few comments below.

core/src/main/scala/kafka/raft/KafkaMetadataLog.scala

    
                // This object needs to be thread-safe because it is used by the snapshotting thread to notify the

                // polling thread when snapshots are created.

                snapshotIds: ConcurrentSkipListSet[OffsetAndEpoch],

                snapshots: mutable.TreeMap[OffsetAndEpoch, Option[FileRawSnapshotReader]],

Contributor

junrao May 12, 2021

Is the above comment still accurate since snapshots is no longer thread safe?

Member Author

jsancio May 13, 2021

No. I updated the comment. I'll push a commit tomorrow after a few other changes.

raft/src/main/java/org/apache/kafka/snapshot/Snapshots.java Outdated

    
                      try {

                          Utils.atomicMoveWithFallback(immutablePath, deletedPath, false);

                      } catch (IOException e) {

                          log.error("Error renaming snapshot file from {} to {}", immutablePath, deletedPath, e);

Contributor

junrao May 12, 2021

Should we just fail the controller on IOException?

Member Author

jsancio May 13, 2021

@mumrah suggested converting all of the IOException to UncheckedIOException. Kafka doesn't have a precedence of doing that but maybe we should do that going forward. I filed https://issues.apache.org/jira/browse/KAFKA-12773 but I'll change it here to re-throw instead of logging this message.

Member Author

jsancio May 13, 2021

By changing it to UncheckedIOExcpetion this will unwind the stack for the polling thread. Tomorrow, I'll look into how we handle that case but it may already shutdown the broker and controller.

Member Author

jsancio May 13, 2021

Excuse the delay @junrao but I look into this in more detail today. I changed this code to throw an exception instead. This exception will be unhandled by the KafkaRaftClient polling thread in both the broker and controller. This will cause the thread to terminate but I don't think it will cause the JVM process to terminate.

We have the following Jira to revisit our exception handling: https://issues.apache.org/jira/browse/KAFKA-10594. I added a comment there to document the issue you highlighted here. Do you mind if we tackle this problem holistically in that Jira?

raft/src/main/java/org/apache/kafka/snapshot/FileRawSnapshotReader.java Outdated

    
                      try {

                          fileRecords.close();

                      } catch (IOException e) {

                          throw new RuntimeException(e);

Contributor

junrao May 12, 2021

Should we throw KafkaStorageException?

Member Author

jsancio May 13, 2021

I am not sure. I could use some guidance here. I read the documentation for KafkaStorageException: https://github.com/apache/kafka/blob/trunk/clients/src/main/java/org/apache/kafka/common/errors/KafkaStorageException.java#L19-L30. It looks like Kafka uses KafkaStorageException if the IO is visible to the client.

On the server (broker and controller) this code will be called async by the same scheduler used for deleting log segments. In that case CoreUtils.swallow is used which logs a WARN message. I think we should do the same here.


          Throw an exception instead of logging when renaming snapshot for delete

8617df7

junrao approved these changes

View reviewed changes

Contributor

junrao left a comment

@jsancio : Thanks for the updated PR. LGTM. I will wait to see if @mumrah has any other comments.

mumrah approved these changes

View reviewed changes

Member

mumrah left a comment

LGTM. Agree that we should revisit the exception handling later on.

mumrah merged commit 924c870 into apache:trunk

mumrah mentioned this pull request

Fix compile errors for KAFKA-12543 #10719

Merged

jsancio deleted the kafka-12543-snapshot-ownership branch

May 21, 2021 20:36

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels