Skip drain on NotReady Nodes #450

zuzzas · 2020-04-21T17:09:00Z

What this PR does / why we need it:

Sometimes (mainly due to incorrect requests/limits in our case) a Node's kubelet might become unavailable. Let's skip drain completely on these Nodes.

Which issue(s) this PR fixes:
Fixes #448

Special notes for your reviewer:

This article is of use while trying to understand Node NotReady status. I've come to the conclusion that checking current Node Conditions should be enough,

Release note:

The drain is completely skipped if the node is NotReady for the timeout period. Default timeout is 5 mins.

pkg/controller/drain.go

hardikdr

Thanks @zuzzas for the PR.

Fine with the current approach, not concerned but curious, if we should force-delete machine in case kubelet is only temporarily unavailable and may come back soon. Force-deletion implies the violation of PDB during the roll-out as well.

Can we overwrite the drain-timeout if NotReady is set during drain.? WDYT?

rfranzke · 2020-04-22T04:40:46Z

Can we overwrite the drain-timeout if NotReady is set during drain.? WDYT?

Not sure, what would you set it to? If the kubelet is down/not responding then the drain timeout will always elapse, I would assume. Does it makes sense then?

hardikdr · 2020-04-22T04:54:33Z

I would have set it to a value equal to health-check timeout[~10mins] or even lesser[~5mins].

The idea would be to consider the possibility:

that kubelet might not actually be completely unhealthy and may come back soon., hence give one chance for pods to evict.
we do sometimes see node flipping between ready and not-ready, may be due to unknown networking issues.

rfranzke · 2020-04-22T04:58:25Z

Hm okay, the second point is valid I guess, thanks for bringing it up. But then even a small timeout might not be good idea, or? What about starting with a very small timeout, ~5min?, and if it elapses reevaluate whether the node meanwhile became ready again? Only if it didn't then terminate it forcefully, otherwise drain again with normal timeout?

pkg/controller/drain.go

hardikdr · 2020-04-22T05:24:37Z

Hm okay, the second point is valid I guess, thanks for bringing it up. But then even a small timeout might not be good idea, or? What about starting with a very small timeout, ~5min?, and if it elapses reevaluate whether the node meanwhile became ready again? Only if it didn't then terminate it forcefully, otherwise drain again with normal timeout?

That would be a pretty reliable solution.

zuzzas · 2020-04-22T11:20:51Z

Ok, let's try this from this angle. We don't really have to track (time) anything by ourselves since there is a handy lastTransitionTime field in the Conditions.

Signed-off-by: Andrey Klimentyev <andrey.klimentyev@flant.com>

pkg/controller/drain.go

hardikdr · 2020-04-23T20:00:58Z

Ok, let's try this from this angle. We don't really have to track (time) anything by ourselves since > there is a handy lastTransitionTime field in the Conditions.

The approach looks good, I'll take a further look tomorrow. Mainly want to check, what happens if the node has been NotReady for less than 5 mins, and the drain starts. Would it then wait till complete drain-timeout occurs?

hardikdr · 2020-04-26T07:49:09Z

Drain-timeout seems to be still effective if the drain starts before 5mins since the node is NotReady. I think there would be more changes and complexity introduced if we target the perfect solution discussed above.

Though, the current solution is valuable for most cases when NotReady nodes will be skipped on the drain.
Should we merge this as a short term solution and keep the related issue open?

WDYS?

zuzzas · 2020-04-26T17:44:27Z

@hardikdr
One of the most prominent cases, where this issue has been hitting us is when Node has a lot of Pods with badly configured resource requests scheduled on it. If they collectively allocate lots of memory, kernel's memory management stalls, and Node becomes completely stuck without a chance of recovery. It is imperative that MCM swiftly removes and replaces it with a healthy one.

Let's call this PR "a bandaid" which fixes one the ugliest problems right now. And merge it, of course. :)

pkg/controller/drain.go

zuzzas requested review from ggaurav10 and a team as code owners April 21, 2020 17:09

rfranzke reviewed Apr 22, 2020

View reviewed changes

pkg/controller/drain.go Outdated Show resolved Hide resolved

hardikdr reviewed Apr 22, 2020

View reviewed changes

ggaurav10 suggested changes Apr 22, 2020

View reviewed changes

pkg/controller/drain.go Outdated Show resolved Hide resolved

zuzzas force-pushed the skip-drain-if-not-ready branch from fad43a9 to 05a4752 Compare April 22, 2020 11:19

Skip drain on NotReady Nodes

fa7fa31

Signed-off-by: Andrey Klimentyev <andrey.klimentyev@flant.com>

zuzzas force-pushed the skip-drain-if-not-ready branch from 05a4752 to fa7fa31 Compare April 22, 2020 11:25

zuzzas commented Apr 22, 2020

View reviewed changes

pkg/controller/drain.go Show resolved Hide resolved

hardikdr added the reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) label Apr 26, 2020

gardener-robot-ci-1 added needs/ok-to-test Needs approval for testing (check PR in detail before setting this label because PR is run on CI/CD) and removed reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) labels Apr 26, 2020

prashanth26 approved these changes Apr 27, 2020

View reviewed changes

ggaurav10 approved these changes Apr 27, 2020

View reviewed changes

hardikdr reviewed Apr 28, 2020

View reviewed changes

pkg/controller/drain.go Show resolved Hide resolved

hardikdr approved these changes Apr 28, 2020

View reviewed changes

hardikdr merged commit 8ec2a7f into gardener:master Apr 28, 2020

hardikdr mentioned this pull request Apr 28, 2020

Avoid draining of NotReady nodes #448

Closed

zuzzas deleted the skip-drain-if-not-ready branch April 29, 2020 07:44

prashanth26 mentioned this pull request Mar 30, 2021

Avoid draining of Ready Nodes with ReadonlyFilesystem condition being true #579

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Skip drain on NotReady Nodes #450

Skip drain on NotReady Nodes #450

zuzzas commented Apr 21, 2020 •

edited by hardikdr

Loading

hardikdr left a comment

rfranzke commented Apr 22, 2020

hardikdr commented Apr 22, 2020

rfranzke commented Apr 22, 2020

hardikdr commented Apr 22, 2020

zuzzas commented Apr 22, 2020

hardikdr commented Apr 23, 2020

hardikdr commented Apr 26, 2020

zuzzas commented Apr 26, 2020 •

edited

Loading

Skip drain on NotReady Nodes #450

Skip drain on NotReady Nodes #450

Conversation

zuzzas commented Apr 21, 2020 • edited by hardikdr Loading

hardikdr left a comment

Choose a reason for hiding this comment

rfranzke commented Apr 22, 2020

hardikdr commented Apr 22, 2020

rfranzke commented Apr 22, 2020

hardikdr commented Apr 22, 2020

zuzzas commented Apr 22, 2020

hardikdr commented Apr 23, 2020

hardikdr commented Apr 26, 2020

zuzzas commented Apr 26, 2020 • edited Loading

zuzzas commented Apr 21, 2020 •

edited by hardikdr

Loading

zuzzas commented Apr 26, 2020 •

edited

Loading