-
Notifications
You must be signed in to change notification settings - Fork 48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add feature to wait on ready replicas on scaling up #91
Add feature to wait on ready replicas on scaling up #91
Conversation
Signed-off-by: Matej Gera <matejgera@gmail.com>
Signed-off-by: Matej Gera <matejgera@gmail.com>
Hi, I have build an image from this PR, and unfortunately the controller doesnt seems to behave as expected. My architecture looks like this: My controller config:
I did a scale up from 15 to 18 replicas, so directly 3 replicas in one update and my service went down with the errors below:
The configmap generated by the controller has been update with all the pods including the not ready ones and also the not created ones. Did I miss something in my test? |
My controller's serviceaccount didnt have read permission on the pod resources.
|
So I have tried again and this Pr doesn't work. |
Hey @lud97x thanks for taking the time to try this out! So after you adjusted the service account permissions, did it work? The reason for the difference is stated in the README update I have added as part of this PR. I removed that part because scaling on every pod (un)readiness could potentially lead to a frequent hashring changes, see the explanation in the README in my PR. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Based on work done in #89
This change adds a flag
--allow-only-ready-replicas
that changes the behavior of controller on a scale up - if enabled, the controller will first wait on all replicas to be ready before adding them to the hashring. The feature is documented as well.