Partial preemption of workloads #975

ahg-g · 2023-07-11T17:25:03Z

What would you like to be added:
Partial preemption of workloads. Currently preemption is performed for the whole workload, for example when giving back borrowed capacity. This is too aggressive for workloads that tolerate downscaling (e.g., a Ray cluster).

We can come up with a heuristic to select which podset to downscale, could be as simple as going by their order in addition to having a flag indicating which ones can downscale and which can't (and so at the extreme just preempt the whole workload).

Why is this needed:
To limit disruptions caused by preemption.

Completion requirements:

This enhancement requires the following artifacts:

Design doc
API change
Docs update

The artifacts should be linked in subsequent comments.

kerthcet · 2023-07-19T02:54:50Z

/cc

k8s-triage-robot · 2024-01-24T20:07:03Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

tenzen-y · 2024-01-25T16:57:09Z

/remove-lifecycle stale

gu-san · 2024-02-21T23:26:19Z

Thanks for the great project. We have a very similar requirement to what @ahg-g outlines.
We are keen to learn if there have been any design discussions on this topic, and if we could help contribute to the implementation/discussion?

tenzen-y · 2024-02-22T02:57:45Z

Thanks for the great project. We have a very similar requirement to what @ahg-g outlines. We are keen to learn if there have been any design discussions on this topic, and if we could help contribute to the implementation/discussion?

We have yet to discuss this feature, but we're open to discussion.

k8s-triage-robot · 2024-05-22T03:38:24Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

tenzen-y · 2024-05-22T03:43:21Z

/remove-lifecycle stale

k8s-triage-robot · 2024-08-20T04:29:21Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

tenzen-y · 2024-08-20T15:33:33Z

/remove-lifecycle stale

k8s-triage-robot · 2024-11-18T16:31:54Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

tenzen-y · 2024-11-27T13:16:37Z

/remove-lifecycle stale

mimowo · 2024-12-06T16:29:08Z

A related, but more specialized issue: #3762

ahg-g added the kind/feature Categorizes issue or PR as related to a new feature. label Jul 11, 2023

ahg-g mentioned this issue Jul 11, 2023

☂️ Requirements for v0.5 #974

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 24, 2024

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 25, 2024

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 22, 2024

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 22, 2024

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Aug 20, 2024

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Aug 20, 2024

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 18, 2024

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 27, 2024

mimowo mentioned this issue Dec 6, 2024

Serving-aware partial preemption of workloads #3762

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Partial preemption of workloads #975

Partial preemption of workloads #975

ahg-g commented Jul 11, 2023

kerthcet commented Jul 19, 2023

k8s-triage-robot commented Jan 24, 2024

tenzen-y commented Jan 25, 2024

gu-san commented Feb 21, 2024

tenzen-y commented Feb 22, 2024

k8s-triage-robot commented May 22, 2024

tenzen-y commented May 22, 2024

k8s-triage-robot commented Aug 20, 2024

tenzen-y commented Aug 20, 2024

k8s-triage-robot commented Nov 18, 2024

tenzen-y commented Nov 27, 2024

mimowo commented Dec 6, 2024

Partial preemption of workloads #975

Partial preemption of workloads #975

Comments

ahg-g commented Jul 11, 2023

kerthcet commented Jul 19, 2023

k8s-triage-robot commented Jan 24, 2024

tenzen-y commented Jan 25, 2024

gu-san commented Feb 21, 2024

tenzen-y commented Feb 22, 2024

k8s-triage-robot commented May 22, 2024

tenzen-y commented May 22, 2024

k8s-triage-robot commented Aug 20, 2024

tenzen-y commented Aug 20, 2024

k8s-triage-robot commented Nov 18, 2024

tenzen-y commented Nov 27, 2024

mimowo commented Dec 6, 2024