-
Notifications
You must be signed in to change notification settings - Fork 977
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Gang scheduling job with high-priority not preempting lower priority jobs #2337
Comments
Thanks for your reporting, It seems the same issue with #2034, we are dealing with it. |
@talcoh2x Would you like to provide the job yaml? |
|
/assign @waiterQ |
@william-wang: GitHub didn't allow me to assign the following users: waiterQ. Note that only volcano-sh members, repo collaborators and people who have commented on this issue/PR can be assigned. Additionally, issues/PRs can only have 10 assignees at the same time. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
@talcoh2x @snirkop89 I'm also taking a test about this bug. Can you provide your scheduler configuration? |
sure:
|
@snirkop89 Hi, Snir. I've taken a look at the bug and preemption was broken indeed. There are several reasons about that. Firstly, the podgroup for job with high priority cannot convert from actions: "enqueue, allocate, backfill, preempt"
tiers:
- plugins:
- name: priority
- name: gang
- name: conformance
- plugins:
- name: overcommit
enableJobEnqueued: false ## disable jobEnqueued function for overcommit plugin
- name: drf
- name: predicates
- name: proportion
enableJobEnqueued: false ## disable jobEnqueued function for proportion plugin
- name: nodeorder
- name: binpack As what I tested locally, it can make podgroup with high priority enter |
@Thor-wl Hi, I'm also studying the preemption behavior of Volcano, and found the same problem. It seems that the JobStarvingFn of the gang plugin forbids preemption from a job where |
Thanks for the feedback. That's what I also found yesterday. IMO, it's not something as expected. I'm tracking which commit and when this behavior is introduced. |
volcano/pkg/scheduler/actions/preempt/preempt.go Lines 124 to 126 in 1b26306
Pods with Preemptable = false will not be preempted, but it seems that task.Preemptable is false by default if we don't set annotation or label.volcano/pkg/scheduler/api/pod_info.go Line 101 in 1b26306
@Thor-wl I don't know if this could be the problem. Similar reclaim action may have problem as well, #2340 volcano/pkg/scheduler/actions/reclaim/reclaim.go Lines 135 to 137 in 1b26306
|
In order to keep compatible with the former versions, volcano/pkg/scheduler/api/pod_info.go Line 76 in 2bb5ac7
@wpeng102 It seems that TDM plugin introduced the behavior. Let's take a review. Thanks! |
That's great to hear. Thank you for the fast response and feedback. |
No worries. The fix is under discussion. |
Hi, is there an update about this? |
@william-wang @Thor-wl |
Hi, there is something new to update ? |
@zhypku Hi, can you share with us the Volcano configuration you have and worked for you ? I mean the preemption flow |
Hello 👋 Looks like there was no activity on this issue for last 90 days. |
Closing for now as there was no activity for last 60 days after marked as stale, let us know if you need this to be reopened! 🤗 |
we run gang scheduling job with high-priority but we dont see that the default priory jobs releasing once we don't enough resources.
expected:
we expect that in such cases lower priority jobs are getting deleted.
Volcano version 1.6.0
K8s version 1.22/1.21
The text was updated successfully, but these errors were encountered: