Recovering from node failures #10745

aedades-cw · 2024-10-21T16:46:43Z

aedades-cw
Oct 21, 2024

Hello,

My team and I were simulating a node failure by cordoning a node and force killing the Kafka broker pod on it, but Kubernetes was unable to schedule the pod on other nodes because of the PVC.

The KafkaNodePool is configured to use a storage class that relies on local persistent volumes that have a node affinity. Snippet of kubectl describe pv:

Status:            Bound
Reclaim Policy:    Retain
Node Affinity:     
  Required Terms:  
    Term 0:        topology.topolvm.io/node in [{node ID}]

We tried to create a KafkaRebalance resource as described in https://strimzi.io/docs/operators/latest/deploying#proc-scaling-down-node-pools-str to remove the broker from the cluster and assign partition replicas, but KCC wasn’t able to because the Kafka cluster was not in a healthy state.

Workaround:

We deleted the pod & PVC by applying an annotation to the pod as described in https://strimzi.io/docs/operators/0.22.1/full/using#proc-manual-delete-pod-pvc-kafka-str and the pod and PVC were recreated on a different node. Partitions were replicated to the broker on the new node.

In the event of an actual node failure, we would not want to delete the PVC until the cluster were recovered with all partitions replicated so that we can recover data from the disk if necessary. We weren’t able to reschedule the broker pod on another node without first deleting the PVC - what is the recommended action in this situation?

*I saw a similar thread (though the brokers there used ephemeral storage)

Answered by scholzj

Oct 21, 2024

You have to delete the PVC. However, the PVC just provides a link between the Pod and the PV. So, deleting the PVC does not mean deleting your PV / your data. If you configure your storage class to retain the PVs when PVCs are deleted, you will keep the PV and the data and you can use them if needed.

View full answer

scholzj · 2024-10-21T17:28:39Z

scholzj
Oct 21, 2024
Maintainer

You have to delete the PVC. However, the PVC just provides a link between the Pod and the PV. So, deleting the PVC does not mean deleting your PV / your data. If you configure your storage class to retain the PVs when PVCs are deleted, you will keep the PV and the data and you can use them if needed.

2 replies

aedades-cw Oct 22, 2024
Author

Thank you for confirming! Thinking more about different failure modes in general, does Cruise Control have any current ability/use to help recover in certain failure scenarios (whether broker, node, etc...) or is it purely for just standard administrative cluster balancing?

scholzj Oct 22, 2024
Maintainer

I don't think Cruise Control has anyhting to do with thi.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Strimzi

Recovering from node failures #10745

{{title}}

Replies: 1 comment 2 replies

{{title}}

{{title}}

{{title}}

Select a reply

Strimzi

Recovering from node failures #10745

aedades-cw Oct 21, 2024

Replies: 1 comment · 2 replies

scholzj Oct 21, 2024 Maintainer

aedades-cw Oct 22, 2024 Author

scholzj Oct 22, 2024 Maintainer

aedades-cw
Oct 21, 2024

Replies: 1 comment 2 replies

scholzj
Oct 21, 2024
Maintainer

aedades-cw Oct 22, 2024
Author

scholzj Oct 22, 2024
Maintainer