-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Revert "Enable NodeKiller on ClusterLoader presubmits" #16847
Conversation
/assign @jprzychodzen |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: mm4tt The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/lgtm |
/hold |
This reverts commit 53f19ac. Here is an example of PR when presubmit failed twice due to NK - kubernetes/perf-tests#1126 An example presubmit run https://prow.k8s.io/view/gcs/kubernetes-jenkins/pr-logs/pull/perf-tests/1126/pull-perf-tests-clusterloader2/1240479871608557568/ In the above run, NK killed 5 nodes during density which in the end resulted in test timing out after 1h40min. Here is the last NK log: ``` W0319 05:03:07.667] I0319 05:03:07.667437 12947 nodes.go:121] NodeKiller: Node e2e-1126-62db2-minion-group-sszg removed from killing. Runs pod prometheus-k8s-0 W0319 05:03:07.668] I0319 05:03:07.667536 12947 nodes.go:135] NodeKiller: 94 nodes available, wants to fail 1 nodes W0319 05:03:07.668] I0319 05:03:07.667547 12947 nodes.go:140] NodeKiller: Node "e2e-1126-62db2-minion-group-ngh6" schedule for failure W0319 05:03:07.668] I0319 05:03:07.667643 12947 nodes.go:154] NodeKiller: Stopping docker and kubelet on "e2e-1126-62db2-minion-group-ngh6" to simulate failure ``` We should change the logic to not kill the nodes so many times.
7d95a61
to
0529666
Compare
Done, ptal |
/lgtm |
Thanks! |
@mm4tt: Updated the
In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
This reverts commit 53f19ac.
Here is an example of PR when presubmit failed twice due to NK - kubernetes/perf-tests#1126
An example presubmit run https://prow.k8s.io/view/gcs/kubernetes-jenkins/pr-logs/pull/perf-tests/1126/pull-perf-tests-clusterloader2/1240479871608557568/
In the above run, NK killed 6 nodes during density which in the end resulted in test timing out after 1h40min.
We should change the logic to not kill the nodes so many times.