Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Jobs and tests for common maintenance operations #95

Merged
merged 13 commits into from
Feb 3, 2018
Merged

Conversation

solsson
Copy link
Contributor

@solsson solsson commented Nov 10, 2017

No description provided.

as long as you have one up-to-date replica.

The use case is broker outages that are longer than mere re-scheduling,
such as zone outage in a multi-zone cluster.
@solsson solsson force-pushed the ops-jobs branch 2 times, most recently from 914029e to 59fb4b9 Compare November 10, 2017 08:55
@solsson
Copy link
Contributor Author

solsson commented Nov 10, 2017

This is tagged automation but so far it only aids manual maintenance, so I suggest merge to v.3.1.

My work with this PR reinforced my position that production topics should have three replicas but producers should require no more than two acks (assuming a three-broker cluster over three availability zones).

@solsson
Copy link
Contributor Author

solsson commented Nov 10, 2017

Got a pretty good test case in #98. Maybe it would be a sensible default for the reassign job to list all topics and all Ready brokers, if we can somehow prompt for human confirmation after echoing the proposal.

@solsson
Copy link
Contributor Author

solsson commented Feb 3, 2018

There's other tooling for stuff like increasing replication factor, and bash scripts can be more elegant than mine, but I found 49649e1 immensely useful and it's quite by the (definitive guide) book.

Please be careful with these scripts in production though. They're not tested to the degree that automation should be.

@solsson solsson merged commit 467fbb9 into master Feb 3, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant