You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
There is a Talos cluster with three nodes. One of the nodes froze: it stopped responding on port 50k, the network card was responsive, but VNC showed that the system was completely stuck. After a reboot, the node came back up and rejoined the cluster. However, etcd was dead on the second node. Logs indicated that the API server was also not working.
CNI logs revealed widespread issues. Linstor on this node was also not functioning. After a reboot, the node came up in maintenance mode. I applied the configuration, and everything started working again, but what could have caused this behavior?
The main question is: why did the Talos node come up in maintenance mode?
Talos v1.8.1
Logs
Environment
Talos version: v1.8.1
Kubernetes version: 1.30
Platform:
The text was updated successfully, but these errors were encountered:
Bug Report
Description
There is a Talos cluster with three nodes. One of the nodes froze: it stopped responding on port 50k, the network card was responsive, but VNC showed that the system was completely stuck. After a reboot, the node came back up and rejoined the cluster. However, etcd was dead on the second node. Logs indicated that the API server was also not working.
CNI logs revealed widespread issues. Linstor on this node was also not functioning. After a reboot, the node came up in maintenance mode. I applied the configuration, and everything started working again, but what could have caused this behavior?
The main question is: why did the Talos node come up in maintenance mode?
Talos v1.8.1
Logs
Environment
The text was updated successfully, but these errors were encountered: