feature: Support choosing Kubernetes QoS class through the decorator #2155

saikonen · 2024-11-28T00:17:52Z

enables choosing the Kubernetes QoS class out of Guaranteed/Burstable/BestEffort through the decorator.

metaflow/plugins/kubernetes/kubernetes_decorator.py

savingoyal · 2024-12-02T17:19:17Z

metaflow/plugins/kubernetes/kube_utils.py

+        # Guaranteed - has both cpu/memory limits. requests not required, as these will be inferred.
+        qos_limits = {
+            "cpu": str(cpu),
+            "memory": "%sM" % str(memory),


what about storage?

storage limit/request does not affect the QoS class from what I could tell, so I kept it out of this portion. Same withgpu requests. I can pre-emptively move these into the same function for introducing custom QoS classes, but for now it shouldn't have any effect.

should I make the change so ephemeral-storage and gpu requests/limits get the same treatment as cpu/memory?

just ephemeral-storage. gpu's always need to specified with both requests and limits. also, can you verify airflow too?

handling storage as well now. For the BestEffort case, I figure we keep the ephemeral-storage in the requests, or do we omit it completely?

metaflow/plugins/kubernetes/kubernetes_decorator.py

…alue.

metaflow/plugins/kubernetes/kube_utils.py

savingoyal · 2024-12-05T04:04:40Z

metaflow/plugins/kubernetes/kube_utils.py

        # Guaranteed - has both cpu/memory limits. requests not required, as these will be inferred.
        qos_limits = {
            "cpu": str(cpu),
            "memory": "%sM" % str(memory),
            "ephemeral-storage": "%sM" % str(storage),
        }
-    elif qos == "BestEffort":
+    elif qos == "besteffort":
        # BestEffort - no limit or requests for cpu/memory
        qos_requests = {"ephemeral-storage": "%sM" % str(storage)}


should we even support best effort at the moment? there doesn't seem to be a use case for it at the moment

removed this and left a comment on adding support once there is a proper use case.

… happy

savingoyal · 2024-12-05T17:02:31Z

lgtm! can you verify airflow works as expected?

saikonen added 5 commits November 28, 2024 01:28

initial QoS selection working

378d659

add validation to qos_class

9dece08

refactor into a QoS helper, enable QoS for jobsets as well

9325255

fix docstring

c17ebd0

add QoS support for argo as well

91ec8ad

savingoyal reviewed Dec 2, 2024

View reviewed changes

savingoyal mentioned this pull request Dec 2, 2024

Support resource limits in Kubernetes. #1544

Closed

savingoyal reviewed Dec 2, 2024

View reviewed changes

metaflow/plugins/kubernetes/kubernetes_decorator.py Outdated Show resolved Hide resolved

saikonen marked this pull request as ready for review December 3, 2024 10:43

rename deco attribute from qos_class to qos. add config for default v…

216dce0

…alue.

savingoyal reviewed Dec 4, 2024

View reviewed changes

metaflow/plugins/kubernetes/kube_utils.py Outdated Show resolved Hide resolved

saikonen added 2 commits December 4, 2024 22:34

add ephemeral-storage as part of the QoS effects.

3da42f0

case-insensitive matching for qos attribute

4080672

savingoyal reviewed Dec 5, 2024

View reviewed changes

saikonen added 2 commits December 5, 2024 12:05

remove support for BestEffort as unnecessary.

a0af234

specify requests explicitly for "guaranteed" case to make K8S tooling…

31b4725

… happy

add QoS support for airflow as well

43e99df

saikonen merged commit 0469eeb into master Dec 5, 2024
29 checks passed

saikonen deleted the feature/kubernetes-QoS-support branch December 5, 2024 18:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature: Support choosing Kubernetes QoS class through the decorator #2155

feature: Support choosing Kubernetes QoS class through the decorator #2155

saikonen commented Nov 28, 2024

savingoyal Dec 2, 2024

saikonen Dec 4, 2024

savingoyal Dec 4, 2024

saikonen Dec 4, 2024

savingoyal Dec 5, 2024

saikonen Dec 5, 2024

savingoyal commented Dec 5, 2024

feature: Support choosing Kubernetes QoS class through the decorator #2155

feature: Support choosing Kubernetes QoS class through the decorator #2155

Conversation

saikonen commented Nov 28, 2024

savingoyal Dec 2, 2024

Choose a reason for hiding this comment

saikonen Dec 4, 2024

Choose a reason for hiding this comment

savingoyal Dec 4, 2024

Choose a reason for hiding this comment

saikonen Dec 4, 2024

Choose a reason for hiding this comment

savingoyal Dec 5, 2024

Choose a reason for hiding this comment

saikonen Dec 5, 2024

Choose a reason for hiding this comment

savingoyal commented Dec 5, 2024