MINOR: shutdown KafkaScheduler at appropriate time by rondagostino · Pull Request #10538 · apache/kafka

rondagostino · 2021-04-14T20:43:15Z

Both the ZooKeeper-based and KRaft brokers invoke KafkaScheduler.shutdown() too early -- before LogManager.shutdown() is invoked. So it is possible for LogManager to try to use the scheduler after the scheduler has been shutdown, which results in an exception. This patch moves the shutdown of the scheduler to a point after the shutdown of LogManager.

Committer Checklist (excluded from commit message)

Verify design and implementation
Verify test coverage and CI build status
Verify documentation (including upgrade notes)

cmccabe

LGTM

core/src/main/scala/kafka/server/BrokerServer.scala

…an shutdown (#11351) This also fixes KAFKA-13070. We have seen a problem caused by shutting down the scheduler before shutting down LogManager. When LogManager was closing partitions one by one, the scheduler called to delete old segments due to retention. However, the old segments could have been closed by the LogManager, which caused an exception and subsequently marked logdir as offline. As a result, the broker didn't flush the remaining partitions and didn't write the clean shutdown marker. Ultimately the broker took hours to recover the log during restart. This PR essentially reverts #10538 Reviewers: Ismael Juma <ismael@juma.me.uk>, Kowshik Prakasam <kprakasam@confluent.io>, Jun Rao <junrao@gmail.com>

…an shutdown (apache#11351) This also fixes KAFKA-13070. We have seen a problem caused by shutting down the scheduler before shutting down LogManager. When LogManager was closing partitions one by one, the scheduler called to delete old segments due to retention. However, the old segments could have been closed by the LogManager, which caused an exception and subsequently marked logdir as offline. As a result, the broker didn't flush the remaining partitions and didn't write the clean shutdown marker. Ultimately the broker took hours to recover the log during restart. This PR essentially reverts apache#10538 Reviewers: Ismael Juma <ismael@juma.me.uk>, Kowshik Prakasam <kprakasam@confluent.io>, Jun Rao <junrao@gmail.com>

MINOR: shutdown KafkaScheduler at appropriate time

79ce250

cmccabe approved these changes Apr 14, 2021

View reviewed changes

ijuma reviewed Apr 14, 2021

View reviewed changes

core/src/main/scala/kafka/server/BrokerServer.scala Show resolved Hide resolved

cmccabe added the kraft label Apr 14, 2021

Add comment as per review

c90b60b

cmccabe merged commit 2e25a1d into apache:trunk Apr 15, 2021

ccding mentioned this pull request Sep 21, 2021

KAFKA-13315: log layer exception during shutdown that caused an unclean shutdown #11351

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

MINOR: shutdown KafkaScheduler at appropriate time#10538

MINOR: shutdown KafkaScheduler at appropriate time#10538
cmccabe merged 2 commits intoapache:trunkfrom
rondagostino:minor_scheduler_shutdown

rondagostino commented Apr 14, 2021

Uh oh!

cmccabe left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

rondagostino commented Apr 14, 2021

Committer Checklist (excluded from commit message)

Uh oh!

cmccabe left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants