You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is similar to previous issues where an evicted pod that was not deleted can causes a newly spin up pod waiting for the lock #1874
We encounter an issue where a pod state becomes unknown due to a crashed node and a newly spin up pod cannot obtain the lock.
While we agree the node issue should be handled by a system administrator, the new operator pod should be able to become the leader.
What did you do?
A crashed node caused the operator pod state become unknown.
A new pod spinned up. What did you expect to see?
New pod should become the leader What did you see instead? Under which circumstances?
New pod cannot acquire the lock
Environment
operator-sdk version:
v0.11.0
go version:
1.13
Kubernetes version information:
16
Kubernetes cluster kind:
Are you writing your operator in ansible, helm, or go?
Possible Solution
Add an additional check to the leader package for pod state is unknown and its last transition time was a X time-window ago
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered:
Bug Report
This is similar to previous issues where an evicted pod that was not deleted can causes a newly spin up pod waiting for the lock #1874
We encounter an issue where a pod state becomes unknown due to a crashed node and a newly spin up pod cannot obtain the lock.
While we agree the node issue should be handled by a system administrator, the new operator pod should be able to become the leader.
What did you do?
A crashed node caused the operator pod state become unknown.
A new pod spinned up.
What did you expect to see?
New pod should become the leader
What did you see instead? Under which circumstances?
New pod cannot acquire the lock
Environment
v0.11.0
1.13
16
Kubernetes cluster kind:
Are you writing your operator in ansible, helm, or go?
Possible Solution
Add an additional check to the leader package for pod state is unknown and its last transition time was a X time-window ago
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: