Flanneld doesn't reconnect to the apiserver #1272

angeloxx · 2020-03-24T17:38:48Z

When flanneld 0.12 on Windows worker node lost the connection with the apiserver, it doesn't retry to connect but continues to log:

reflector.go:201] github.com/coreos/flannel/subnet/kube/kube.go:307: Failed to list *v1.Node: Get https://10.128.0.12:6443/api/v1/nodes?resourceVersion=0: http2: no cached connection was available`

Expected Behavior

Flanneld should reconnect to the apiserver

Steps to Reproduce (for bugs)

In our environment the easiest way to reproduce the issue is to move the floating IP to a different master node.

Your Environment

Flannel version: 0.12.0
Startup Option: --kube-subnet-mgr --kubeconfig-file=<kubelet-file.conf>
Backend used: vxlan
Kubernetes version (if used): 1.16.3 on premise, 3 master nodes and we're using a floating IP (managed by keepalived) that is used by flanneld as backend
Operating System and version:
- Master: Redhat Linux 7
- Worker: Windows 2019 10.0.0.17763.934

The text was updated successfully, but these errors were encountered:

DerrickMartinez · 2020-05-02T02:50:56Z

I have reproduced this as well in 0.12.0. When rolling updating master nodes, it will cause this to go into panic

DerrickMartinez · 2020-05-27T23:35:25Z

Any update on this issue? It's bad enough so 0.12.0 it's not production ready in a windows env. Thanks

luthermonson · 2020-06-16T23:57:46Z

@DerrickMartinez you have any logs you can post? i have to look into something i think is related.

EagleIJoe · 2020-06-17T09:22:16Z

Windows Update KB4551853 screws up my flannel connections. Could be related. Is not only related to flannel but also affects other CNI (ex Docker swarm).

KnicKnic · 2020-06-19T22:41:20Z

I have a workaround for this issue, see KnicKnic@023f21b

jsturtevant · 2020-10-15T16:13:08Z

This looks to be an open standing issue with the golang client: kubernetes/client-go#374

It looks like this type of failure is handled directly in kubelet as a workaround: kubernetes/kubernetes#78016 and other CNI's handle it as well: AliyunContainerService/terway#87

rhockenbury · 2020-11-21T04:31:51Z

With kubernetes/kubernetes#95981 merged, I believe all that needs to be done to resolve this is bumping k8s.io/go-client to 1.20 after release or the 1.19 backport.

rhockenbury · 2020-11-21T19:14:10Z

And the cherrypick PR for 1.19 is now open - kubernetes/kubernetes#96778

1.19.5 is slated for release on 12/9 per https://github.com/kubernetes/sig-release/blob/master/releases/patch-releases.md.

stale · 2023-01-25T23:22:52Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

KnicKnic mentioned this issue Jul 21, 2020

add crash when failure in list or watch of kubernetes api server #1322

Closed

1 task

jsturtevant mentioned this issue Oct 15, 2020

windows pods are not reachable on a hybrid Linux/ windows cluster kubernetes-sigs/sig-windows-tools#103

Open

This was referenced Oct 15, 2020

Flannel fails to watch subnet leases for other nodes in host gateway backend kubernetes-sigs/sig-windows-tools#83

Closed

Flannel vxlan windows network creation and Failed to list from api-server #1359

Closed

rhockenbury mentioned this issue Dec 9, 2020

Release with bump in k8s to v0.19.5 #1380

Closed

stale bot added the wontfix label Jan 25, 2023

stale bot closed this as completed Feb 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flanneld doesn't reconnect to the apiserver #1272

Flanneld doesn't reconnect to the apiserver #1272

angeloxx commented Mar 24, 2020 •

edited

Loading

DerrickMartinez commented May 2, 2020

DerrickMartinez commented May 27, 2020

luthermonson commented Jun 16, 2020

EagleIJoe commented Jun 17, 2020

KnicKnic commented Jun 19, 2020

jsturtevant commented Oct 15, 2020

rhockenbury commented Nov 21, 2020

rhockenbury commented Nov 21, 2020

stale bot commented Jan 25, 2023

Flanneld doesn't reconnect to the apiserver #1272

Flanneld doesn't reconnect to the apiserver #1272

Comments

angeloxx commented Mar 24, 2020 • edited Loading

Expected Behavior

Steps to Reproduce (for bugs)

Your Environment

DerrickMartinez commented May 2, 2020

DerrickMartinez commented May 27, 2020

luthermonson commented Jun 16, 2020

EagleIJoe commented Jun 17, 2020

KnicKnic commented Jun 19, 2020

jsturtevant commented Oct 15, 2020

rhockenbury commented Nov 21, 2020

rhockenbury commented Nov 21, 2020

stale bot commented Jan 25, 2023

angeloxx commented Mar 24, 2020 •

edited

Loading