-
Notifications
You must be signed in to change notification settings - Fork 155
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GKE nodes going to NotReady state after installing capsule #597
Comments
Hi @MeghanaSrinath, thanks for reporting this! This is expected behavior and the reason is for the following webhook: This webhook controls specific actions that a tenant owner could issue against nodes of their tenant. It's a requirement for the BYOD scenario, along with Capsule Proxy, section For your use case, the Let me know if I can help you further with any change you'd like to propose, and feel free to close the issue. |
This fix worked perfectly. Thank you! |
We have a GKE cluster on which we have deployed capsule. We have used this values.yaml file during installation via helm. But once the cluster is scaled to 0 nodes and when we try to scale up to bring new nodes, all the nodes are going to NotReady state. Even if a new node tries to come up due to autoscaling, that goes to NotReady as well. We could observe the below errors when we describe the node.
Also, few of the pods in kube-system namespace were also in Pending state and kube-proxy pod was crashing continuously on each node.
Calico pods were having this error on describing:
We could observe that, on autoscaling, the new VM instances had indeed come up in GKE console, but they were unable to join the cluster and exisitng nodes were going to NotReady on cluster restart.
If we delete the
validatingwebhookconfiguration
-capsule-validating-webhook-configuration
, all these errors are resolved and cluster also works perfectly fine. We are even able to create tenants and the restrictions are also working fine.Please let us know why is this webhook causing this problem and if this can be fixed.
The text was updated successfully, but these errors were encountered: