-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Flanneld doesn't reconnect to the apiserver #1272
Comments
I have reproduced this as well in 0.12.0. When rolling updating master nodes, it will cause this to go into panic |
Any update on this issue? It's bad enough so 0.12.0 it's not production ready in a windows env. Thanks |
@DerrickMartinez you have any logs you can post? i have to look into something i think is related. |
Windows Update KB4551853 screws up my flannel connections. Could be related. Is not only related to flannel but also affects other CNI (ex Docker swarm). |
I have a workaround for this issue, see KnicKnic@023f21b |
This looks to be an open standing issue with the golang client: kubernetes/client-go#374 It looks like this type of failure is handled directly in kubelet as a workaround: kubernetes/kubernetes#78016 and other CNI's handle it as well: AliyunContainerService/terway#87 |
With kubernetes/kubernetes#95981 merged, I believe all that needs to be done to resolve this is bumping k8s.io/go-client to 1.20 after release or the 1.19 backport. |
And the cherrypick PR for 1.19 is now open - kubernetes/kubernetes#96778 1.19.5 is slated for release on 12/9 per https://github.com/kubernetes/sig-release/blob/master/releases/patch-releases.md. |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
When flanneld 0.12 on Windows worker node lost the connection with the apiserver, it doesn't retry to connect but continues to log:
reflector.go:201] github.com/coreos/flannel/subnet/kube/kube.go:307: Failed to list *v1.Node: Get https://10.128.0.12:6443/api/v1/nodes?resourceVersion=0: http2: no cached connection was
available`Expected Behavior
Flanneld should reconnect to the apiserver
Steps to Reproduce (for bugs)
In our environment the easiest way to reproduce the issue is to move the floating IP to a different master node.
Your Environment
The text was updated successfully, but these errors were encountered: