-
Notifications
You must be signed in to change notification settings - Fork 6.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Calico-node crashed on Debian 11 #8186
Comments
I'm facing something similar in debugging #8175 |
This issue seems to be related to Using either Containerd deployment: https://gitlab.com/kargo-ci/kubernetes-sigs-kubespray/-/jobs/1793953991 but multiple runs yield different failures. |
After some tinkering it seems Debian 11 switch to cgroup v2 and containerd is not yet ready for this. As a result you need to revert to cgroup v1 by adding We need to address this in CI, we already have similar mitigations for fedora31+ (roles/container-engine/containerd/tasks/main.yml), it is interesting how come docker is happy with this. |
I just concluded a test and it seems with using |
On CentOS 8.5 I have containerd 1.5.7 / k8s 1.22.3 / cgroup_driver systemd / cgroupv2
|
This is how we configure containerd out of the box: version = 2
root = "/var/lib/containerd"
state = "/run/containerd"
oom_score = 0
[grpc]
max_recv_message_size = 16777216
max_send_message_size = 16777216
[debug]
level = "info"
[metrics]
address = ""
grpc_histogram = false
[plugins]
[plugins."io.containerd.grpc.v1.cri"]
sandbox_image = "k8s.gcr.io/pause:3.3"
max_container_log_line_size = -1
[plugins."io.containerd.grpc.v1.cri".containerd]
default_runtime_name = "runc"
snapshotter = "overlayfs"
[plugins."io.containerd.grpc.v1.cri".containerd.runtimes]
[plugins."io.containerd.grpc.v1.cri".containerd.runtimes.runc]
runtime_type = "io.containerd.runc.v2"
runtime_engine = ""
runtime_root = ""
[plugins."io.containerd.grpc.v1.cri".containerd.runtimes.runc.options]
systemCgroup = true
[plugins."io.containerd.grpc.v1.cri".registry]
[plugins."io.containerd.grpc.v1.cri".registry.mirrors]
[plugins."io.containerd.grpc.v1.cri".registry.mirrors."docker.io"]
endpoint = ["https://registry-1.docker.io"] I'm guessing the |
Seems like we introduced a typo in #8123 which broke containerd with systemd cgroups driver. /cc @pasqualet |
You are right, #8123 introduced the typo but I'm not sure if it's the reason for this issue. Anyway I've created the #8206 to fix the typo. I think we need better integration tests and #6400 seems could be the starting point to work on it. |
Environment:
Cloud provider or hardware configuration:
VM based on box generic/debian11
OS (
printf "$(uname -srm)\n$(cat /etc/os-release)\n"
):Linux 5.10.0-9-amd64 x86_64
PRETTY_NAME="Debian GNU/Linux 11 (bullseye)"
NAME="Debian GNU/Linux"
VERSION_ID="11"
VERSION="11 (bullseye)"
VERSION_CODENAME=bullseye
ID=debian
HOME_URL="https://www.debian.org/"
SUPPORT_URL="https://www.debian.org/support"
BUG_REPORT_URL="https://bugs.debian.org/"
Version of Ansible (
ansible --version
):ansible 2.10.11
Version of Python (
python --version
):Python 3.9.6
Kubespray version (commit) (
git rev-parse --short HEAD
):0d0468e
Network plugin used:
calico
Full inventory with variables (
ansible -i inventory/sample/inventory.ini all -m debug -a "var=hostvars[inventory_hostname]"
):https://termbin.com/76wp
Command used to invoke ansible:
ansible-playbook -i inventory/hosts.yaml -b cluster.yml
Output of ansible run:
Anything else do we need to know:
kubectl describe po -n kube-system calico-node-q2c7r
On Debian 10 work fine
The text was updated successfully, but these errors were encountered: