-
Notifications
You must be signed in to change notification settings - Fork 299
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[PartialAdmission] Fix preemption while partially admitting. #2826
[PartialAdmission] Fix preemption while partially admitting. #2826
Conversation
✅ Deploy Preview for kubernetes-sigs-kueue ready!
To edit notification comments on pull requests, go to your Netlify site configuration. |
/cc @gabesaba |
8530e5a
to
f8b7234
Compare
Please add a release note describing what's changed from the user's perspective. |
done |
/assign |
742725f
to
03a772d
Compare
I support the ask in comment. |
@trasc Could you describe what the scenario is fixed here, and what is the consequence from the user perspective when it happens? Please update the release note with a short description of the problematic scenario. It will also help us to guide the decision about cherry-picking. |
The scenario is |
Can you provide a little bit more details? In particular: is the preemption not working at all / erroring / panicing / or working sub-optimally. |
/reopen |
@mimowo: Reopened this PR. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
Add an unit test for preemption while partially admitting.
03a772d
to
6fa2dca
Compare
/cc |
I'm testing locally the impact of the bug and it seems to go beyond preemption. For example:
I'm yet going to determine the scope of the problem and fix, but it seems serious enough to justify cherry-picking. |
I think this is a different issue. Just open the issue in GH and feel free to assign it to me. |
Ok, there you go: #3108 |
Regarding the fix here I played a bit more, and observed, that the feature generally worked before (no panics, no errors), but there are differences in some scenarios, and they are really hard for me to briefly (also the other issue is interfering to test properly, because the other issue is also present with preemption enabled). Still, IIUC the culprit for the issue is wrong calculation of resources requested by the incoming workload. I would like to capture this in the release note as below.
Going forward it would be super helpful to have an integration test(s) for this feature, otherwise it is hard to judge how the unit tests translate into the e2e behavior, imo. Regarding the cherry-pick, no strong view here, but I think we don't need to, given that we don't have customers / users affected by the issue, or demanding the fix. Additionally, the other issue opened above still interferes to fully evaluate the fix. We can re-evaluate cherry-picking them together, or when we have users demanding the fix. |
LGTM, except for the point raised here, but I can accept as is, just waiting a bit for @alculquicondor in case he has something to add or other ideas to improve. |
/lgtm |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: mimowo, trasc The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
LGTM label has been added. Git tree hash: bfea979511531e090af34379b6573419f2bde216
|
/cherrypick release-0.8 |
@trasc: new pull request created: #3205 In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
…tes-sigs#2826) * [PartialAdmission] Fix preemption while partially admitting. Add an unit test for preemption while partially admitting. * Review Remarks * Review remarks * Review remarks.
What type of PR is this?
/kind bug
What this PR does / why we need it:
Which issue(s) this PR fixes:
Fixes #2799
Special notes for your reviewer:
Does this PR introduce a user-facing change?