Calico in eBPF mode has bug and should be upgraded to 3.27.3 for Kops 1.28.5 and below #16589

zied-jt · 2024-05-24T09:15:00Z

/kind bug

1. What kops version are you running? The command kops version, will display
this information.
Client version: 1.28.5 (git-v1.28.5)

2. What Kubernetes version are you running? kubectl version will print the
version if a cluster is running or provide the Kubernetes version specified as
a kops flag.

v1.28.10

3. What cloud provider are you using?
AWS

4. What commands did you run? What is the simplest way to reproduce this issue?

Migrate from kube-proxy to Calico with eBPF

5. What happened after the commands executed?

For a fresh node, no traffic routed for loadBalancer service type like ingress-controller With externalTrafficPolicy=Local

By default, kops 1.28.5 provides calico 3.25.2 This version have the described bug projectcalico/calico#8112 fixed in projectcalico/calico#8313 available at calico 3.27.3

The issue reports for our specific config that :

kube-proxy frontend that we use in our kubeproxy does not expect to be shut down in any other way that hard stop of the process, while we "restart" the kubeproxy when the host ip changes as it was an easy way to reconcile the NAT tables. However, the webservers that handle the health checks don't shut down. So we need to be more careful about how we handle that without control of the k8s part of the code.

Steps to Reproduce (copied from same calico bug report)

1.Kubernetes Cluster with calico cni with eBPF dataplane
2.Create Kubernetes service type LoadBalancer with externalTrafficPolicy: Local
3.reboot the node where endpoints of the service are located
4.see logs in calico-node and curl HealtCheckPort on this node like:

err="listen tcp :30904: bind: address already in use" node="i-xxxxxxxxxxxxxxxxxx" service="nginx-controllers/nginx-ingress-controller"

6. What did you expect to happen?
Calico version up and running for eBPF mode.

7. Please provide your cluster manifest. Execute
kops get --name my.example.com -o yaml to display your cluster manifest.
You may want to remove your cluster name and other sensitive information.

networking:
  calico:
    bpfEnabled: true
    awsSrcDstCheck: Disable
    encapsulationMode: vxlan
kubeProxy:
  enabled: false

8. Please run the commands with most verbose logging by adding the -v 10 flag.
Paste the logs into this report, or in a gist and provide the gist link here.

err="listen tcp :30904: bind: address already in use" node="i-xxxxxxxxxxxxxxxxxx" service="nginx-controllers/nginx-ingress-controller"

9. Anything else do we need to know?
These kops PR have already the code for the upgrade of calico and could help fixing the issue by backporting to kops <= 1.28.5 :

PS: This bug report was written with the help of @rasta-rocket, @rsicart, @sgendrot-jobteaser, @yelaissaoui

The text was updated successfully, but these errors were encountered:

rsicart · 2024-06-12T10:22:42Z

Hi there!

Some news about that issue?

Thanks in advance!

hakman · 2024-06-13T05:00:22Z

Resolved via #16613.
/close

k8s-ci-robot · 2024-06-13T05:00:26Z

@hakman: Closing this issue.

In response to this:

Resolved via #16613.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

rsicart · 2024-07-02T10:09:39Z

Hello!

Thanks a lot for your work and reactivity!

Do you know if there's a scheduled release for that cherry-pick mentioned above soon?

Thanks again, great job!

hakman · 2024-07-05T06:46:41Z

@rsicart there is a release planned soon

k8s-ci-robot added the kind/bug Categorizes issue or PR as related to a bug. label May 24, 2024

rifelpet mentioned this issue Jun 13, 2024

Automated cherry pick of #16192: Update Calico to v3.27.0 #16363: Update Calico to v3.27.3 #16613

Merged

k8s-ci-robot closed this as completed Jun 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Calico in eBPF mode has bug and should be upgraded to 3.27.3 for Kops 1.28.5 and below #16589

Calico in eBPF mode has bug and should be upgraded to 3.27.3 for Kops 1.28.5 and below #16589

zied-jt commented May 24, 2024

rsicart commented Jun 12, 2024 •

edited

Loading

hakman commented Jun 13, 2024

k8s-ci-robot commented Jun 13, 2024

rsicart commented Jul 2, 2024

hakman commented Jul 5, 2024

Calico in eBPF mode has bug and should be upgraded to 3.27.3 for Kops 1.28.5 and below #16589

Calico in eBPF mode has bug and should be upgraded to 3.27.3 for Kops 1.28.5 and below #16589

Comments

zied-jt commented May 24, 2024

rsicart commented Jun 12, 2024 • edited Loading

hakman commented Jun 13, 2024

k8s-ci-robot commented Jun 13, 2024

rsicart commented Jul 2, 2024

hakman commented Jul 5, 2024

rsicart commented Jun 12, 2024 •

edited

Loading