Don't consider Crashloopbackoff pods as part of PDB #684

himanshu-kun · 2022-02-28T10:28:59Z

How to categorize this issue?

/area performance
/area robustness
/area usability
/kind enhancement
/priority 3

What would you like to be added:
Currently upstream treats Crashloopbackoff pods as Unavailable and so if a PDB is configured with maxUnavailable=1 with 2 pods , 1 pod Pending and other Crashloopbackoff, then the pod eviction request is denied for such pod and node draining can't proceed.
There is a discussion upstream to deal with this kubernetes/kubernetes#72320 and a PR to ignore Crashloopbackoff pods from PDB is raised kubernetes/kubernetes#105296

Testing is required after the PR gets merged and MCM starts using the corresponding k8s version
Why is this needed:
So that MCM draining is not stuck till drainTimeout for CrashLoopbackoff pods with PDB.

The text was updated successfully, but these errors were encountered:

himanshu-kun · 2023-02-23T09:47:19Z

Post grooming discussion

The onus is on the customer now to configure the PDB in a way which allows to drain the CrashLoopBackoff pods also .
This PR introduced spec.unhealthyPodEvictionPolicy recently. It is currently in alpha and needs to be enabled via feature gate PDBUnhealthyPodEvictionPolicy.

We need to update the gardener docs after testing this feature, to tell the customers how to do use this. Also need to update the DOD playbook for operators.

timuthy · 2023-11-16T08:19:11Z

FYI: gardener/gardener#8821

himanshu-kun added the kind/enhancement Enhancement, improvement, extension label Feb 28, 2022

gardener-robot added area/performance Performance (across all domains, such as control plane, networking, storage, etc.) related area/robustness Robustness, reliability, resilience related area/usability Usability related priority/3 Priority (lower number equals higher priority) labels Feb 28, 2022

himanshu-kun changed the title ~~Don't consider Crashloop backoff pods as part of PDB~~ Don't consider Crashloopbackoff pods as part of PDB Feb 28, 2022

timebertt mentioned this issue May 19, 2022

Drop PodDisruptionBudget for calico-kube-controllers gardener/gardener-extension-networking-calico#183

Merged

gardener-robot added the lifecycle/stale Nobody worked on this for 6 months (will further age) label Aug 28, 2022

gardener-robot added the lifecycle/stale Nobody worked on this for 6 months (will further age) label Nov 2, 2023

gardener-robot added lifecycle/rotten Nobody worked on this for 12 months (final aging stage) and removed lifecycle/stale Nobody worked on this for 6 months (will further age) labels Jul 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't consider Crashloopbackoff pods as part of PDB #684

Don't consider Crashloopbackoff pods as part of PDB #684

himanshu-kun commented Feb 28, 2022

himanshu-kun commented Feb 23, 2023

timuthy commented Nov 16, 2023

Don't consider Crashloopbackoff pods as part of PDB #684

Don't consider Crashloopbackoff pods as part of PDB #684

Comments

himanshu-kun commented Feb 28, 2022

himanshu-kun commented Feb 23, 2023

Post grooming discussion

timuthy commented Nov 16, 2023