Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[improve][broker] Explicitly close LB internal topics when playing a follower (ExtensibleLoadManagerImpl only) #23144

Merged
merged 1 commit into from
Aug 10, 2024

Conversation

heesung-sn
Copy link
Contributor

@heesung-sn heesung-sn commented Aug 8, 2024

Motivation

Internal topics ownership conflict is a concern here when the leadership changes frequently - we should make sure only the current leader owns these internal topics.

The persistent internal topic can self-resolve this topic ownership conflict by the ledger fencing logic.

However, I think the non-persistent internal topics can be a problem because the fencing logic doesn't apply to non-persistent topics.

To gracefully release the previous ownership of these internal topics, I think we better make followers explicitly close the internal topics - when leadership changes, there might be a chance that some clients reconnect to the old leader if the connected zk and the old leader's zk are slow. Then, when the old leader zk catches up(notifies a follower signal to the old leader), the old leader should make sure to close any internal topics and its client connections, if any.

Modifications

  • call closeInternalTopics() inside playFollower()

Verifying this change

  • Make sure that the change passes the CI checks.

Does this pull request potentially affect one of the following parts:

If the box was checked, please highlight the changes

  • Dependencies (add or upgrade a dependency)
  • The public API
  • The schema
  • The default values of configurations
  • The threading model
  • The binary protocol
  • The REST endpoints
  • The admin CLI options
  • The metrics
  • Anything that affects deployment

Documentation

  • doc
  • doc-required
  • doc-not-needed
  • doc-complete

Matching PR in forked repository

PR in forked repository:

Copy link

github-actions bot commented Aug 8, 2024

@heesung-sn Please add the following content to your PR description and select a checkbox:

- [ ] `doc` <!-- Your PR contains doc changes -->
- [ ] `doc-required` <!-- Your PR changes impact docs and you will update later -->
- [ ] `doc-not-needed` <!-- Your PR changes do not impact docs -->
- [ ] `doc-complete` <!-- Docs have been already added -->

@github-actions github-actions bot added doc-not-needed Your PR changes do not impact docs and removed doc-label-missing labels Aug 8, 2024
@heesung-sn heesung-sn force-pushed the improve-role-change branch from 3cf89c4 to 80d1187 Compare August 8, 2024 22:05
@lhotari
Copy link
Member

lhotari commented Aug 9, 2024

so there could be fewer internal topic conflicts.

What is an internal topic conflict? What is the impact of it? How does it get resolved if the topic wouldn't be closed (before this PR)?

@codecov-commenter
Copy link

Codecov Report

Attention: Patch coverage is 81.81818% with 2 lines in your changes missing coverage. Please review.

Project coverage is 74.54%. Comparing base (bbc6224) to head (80d1187).
Report is 504 commits behind head on master.

Files Patch % Lines
...dbalance/extensions/ExtensibleLoadManagerImpl.java 81.81% 2 Missing ⚠️
Additional details and impacted files

Impacted file tree graph

@@             Coverage Diff              @@
##             master   #23144      +/-   ##
============================================
+ Coverage     73.57%   74.54%   +0.96%     
- Complexity    32624    34097    +1473     
============================================
  Files          1877     1919      +42     
  Lines        139502   144252    +4750     
  Branches      15299    15773     +474     
============================================
+ Hits         102638   107532    +4894     
+ Misses        28908    28485     -423     
- Partials       7956     8235     +279     
Flag Coverage Δ
inttests 27.78% <45.45%> (+3.19%) ⬆️
systests 24.77% <9.09%> (+0.44%) ⬆️
unittests 73.90% <81.81%> (+1.05%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files Coverage Δ
...dbalance/extensions/ExtensibleLoadManagerImpl.java 80.03% <81.81%> (-0.05%) ⬇️

... and 499 files with indirect coverage changes

@heesung-sn
Copy link
Contributor Author

Internal topics ownership conflict is concern here when the leadership changes frequently - we should make sure only the current leader owns these internal topics.

The persistent internal topic can self-resolve this topic ownership conflict by the ledger fencing logic.

However, I think the non-persistent internal topics can be problem because the fencing logic doesn't apply to non-persistent topics.

To gracefully release the previous ownership of these internal topics, I think we better make followers explicitly close the internal topics - when leadership changes, there might be a chance that some clients reconnect to the old leader if the connected zk and the old leader's zk are slow. Then, when the old leader zk catches up(notifies a follower signal to the old leader), the old leader should make sure to close any internal topics and its client connections, if any.

Copy link
Member

@lhotari lhotari left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@lhotari lhotari merged commit 1b43b9d into apache:master Aug 10, 2024
61 checks passed
@lhotari lhotari added this to the 3.4.0 milestone Aug 10, 2024
lhotari pushed a commit that referenced this pull request Aug 10, 2024
…follower (ExtensibleLoadManagerImpl only) (#23144)

(cherry picked from commit 1b43b9d)
lhotari pushed a commit that referenced this pull request Aug 10, 2024
…follower (ExtensibleLoadManagerImpl only) (#23144)

(cherry picked from commit 1b43b9d)
nikhil-ctds pushed a commit to datastax/pulsar that referenced this pull request Aug 14, 2024
…follower (ExtensibleLoadManagerImpl only) (apache#23144)

(cherry picked from commit 1b43b9d)
(cherry picked from commit 1e1860a)
srinath-ctds pushed a commit to datastax/pulsar that referenced this pull request Aug 20, 2024
…follower (ExtensibleLoadManagerImpl only) (apache#23144)

(cherry picked from commit 1b43b9d)
(cherry picked from commit 1e1860a)
grssam pushed a commit to grssam/pulsar that referenced this pull request Sep 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants