Advertising specific GPU types as separate extended resource #424

deepanker-s · 2023-07-21T08:29:12Z

Hello,
I am working at Uber.

1. Feature description

Advertising special hardware (specific GPU types say A100) as a separate extended resource.

As of now, we have a blanket of "nvidia.com/gpu" for all types of GPUs that this plugin supports. If we want our pods to run specifically on some GPU types, then, we need to be able to request such a resource.

For requesting such a specific resource, there are 2 ways -

[Existing] Using nodeLabels/nodeSelectors
[New] Advertising the same directly as a new resource such as "nvidia.com/gpu-A100-...."

This added functionality can be enabled based upon a configuration flag and can use gpu-feature-discovery labels to extract the SKU/GPU type.

2. Why

There is already a similar resource advertising being done for MIG enabled devices -

nvidia.com/gpu
nvidia.com/mig-1g.5gb
nvidia.com/mig-2g.10gb
nvidia.com/mig-3g.20gb
nvidia.com/mig-7g.40gb

Another reason is that usage of nodeLabels/nodeSelectors may not be possible due to some limitations.

3. Similar existing work

I found a design doc for "Custom Resource Naming and Supporting Multiple GPU SKUs on a Single Node in Kubernetes".

It is actually advertising different types of GPUs as new resource name but those different GPU cards should be on the same node. I am not sure whether the same will also support if the corresponding GPU cards/types are on different nodes as well.

4. Summary of queries

Is the above feature request already being supported by the above mentioned "Similar existing work"?
If yes, when will that work be approved and available?

The text was updated successfully, but these errors were encountered:

klueska · 2023-07-24T08:20:30Z

There is still no planned support for this in the k8s-device plugin. All of the functionality is there (as described in the link you provided), but it is explicitly disabled by this line in the code https://github.com/NVIDIA/k8s-device-plugin/blob/main/cmd/nvidia-device-plugin/main.go#L322

The future for supporting multiple GPU cards per node is via a new mechanism in Kubernetes called Dynamic Resource Allocation (DRA):
https://docs.google.com/document/d/1BNWqgx_SmZDi-va_V31v3DnuVwYnF2EmN7D-O_fB6Oo/edit
https://github.com/NVIDIA/k8s-dra-driver

deepanker-s · 2023-07-26T10:45:42Z

Hey Kevin,
Thanks for the info.

I was actually asking about specific GPU resource naming for GPUs on different nodes (not on same node).
But looks like, the answer seems to be the same. DRA can help achieve that as well.

deepanker-s · 2023-08-08T12:54:33Z

Hey Kevin,
I understand now that DRA can be used to specify GPU types (A100, H100) for different pods using "GpuClaimParameters".

Is there any functionality to advertise these specified resources/resourceClaims?

Example -
Using DRA "GpuClaimParameters") (as in gpu-test6 example), if -

podA is scheduled on A100 GPU
podB is scheduled on H100 GPU

Will device plugin advertise the resource usage details - how many A100 devices are being used?
Like currently we adverstise -
nvidia.com/gpu : 10

Will it provide details such as below in any manner?
nvidia.com/gpu-A100 : 5

dimm0 · 2023-09-27T23:29:02Z

We're looking to install the yunikorn scheduler on the cluster and having different resources for different GPUs will help a lot to prioritize the use of more powerful (and less available) GPUs among users using the fair share. It's impossible to do with just labels.

sjdrc · 2023-10-19T11:07:44Z

There is still no planned support for this in the k8s-device plugin

Is there a reason why this isn't planned to be implemented here? This seems like an essential feature for any cluster with more than 1 model of GPU and there's currently no adequate workaround at all.

klueska · 2023-10-19T11:21:34Z

It was a product decision, not an engineering one.

All of the code to support it is merged in the plugin and simply disabled by https://github.com/NVIDIA/k8s-device-plugin/blob/main/cmd/nvidia-device-plugin/main.go#L239.

The decision not to support this gets revisited periodically, but our product team is still not in favor of it, so our hands are tied.

If you want to enable it in a custom build of the plugin, just remove that line referenced above and it should work as described in https://docs.google.com/document/d/1dL67t9IqKC2-xqonMi6DV7W2YNZdkmfX7ibB6Jb-qmk/edit#heading=h.jw5js7865egx.

yuzliu · 2023-10-22T22:19:23Z

@klueska thanks for the explanation. We also explored the extended resource options and we even have a component we wrote ourselves to patch node with gpu extended resources. Just curious would you be open to add a flag to turn this feature on/off so we don't have to deploy a customized version of nvidia device plugin?

klueska · 2023-11-01T11:12:42Z

@yuzliu Do you have multiple GPU types per node? If not, are node-labels from GFD / nodeSelectors not enough for your use case?

yuzliu · 2023-11-01T11:39:52Z

@klueska Thanks for the reply! We don't have multiple GPU types per node but we do have multiple GPU types per cluster. We have already deployed the GPU feature discovery and have gpu product label on each GPU node but that doesn't solve our problem because:

We have clusters having multiple GPU types e.g. A100 + T4 mixed in one cluster
We have resourcequota on each namespace and we want to achieve resource quota enforcement e.g. namespace A can only use 1 A100 and 5 T4 on a namespace level.
We want to collect metrics accurately per each GPU type. For example we'd like to know namespace A has 4 A100 available and 1 A100 was requested but still have 3 left.

klueska · 2023-11-01T11:47:58Z

Got it -- labels from GPU feature discovery are sufficient for 1, but not for 2 and 3 -- for that you need a unique extended resources.

yuzliu · 2023-11-01T11:51:15Z

Yep, we even have an internal component to advertise extended resources e.g. V100, A100 and T4. But I'd really love to have less customized logic internally but rely on Nvidia's official component to make our long term maintenance easier.

github-actions · 2024-02-27T04:24:52Z

This issue has become stale and will be closed automatically within 30 days if no activity is recorded.

leoncamel · 2024-04-18T03:41:22Z

Any progress on this issue?

ZDWWWWW · 2024-05-27T02:41:54Z

Any progress on this issue?

github-actions bot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 27, 2024

This was referenced Jul 31, 2024

How to configure the Capacity Plugin to reclaim resources in v1.9.0? volcano-sh/volcano#3510

Closed

Enable resource naming in config volcano-sh/devices#68

Merged

klueska mentioned this issue Oct 30, 2024

Support for Registering GPU Resources by Model Name (e.g., nvidia.com/A100) #1024

Closed

amir-bialek mentioned this issue Nov 15, 2024

NVIDIA Device Plugin Only Exposes One GPU Out of Two GPUs Installed on Single Node #1020

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Advertising specific GPU types as separate extended resource #424

Advertising specific GPU types as separate extended resource #424

deepanker-s commented Jul 21, 2023

klueska commented Jul 24, 2023 •

edited

Loading

deepanker-s commented Jul 26, 2023

deepanker-s commented Aug 8, 2023 •

edited

Loading

dimm0 commented Sep 27, 2023

sjdrc commented Oct 19, 2023 •

edited

Loading

klueska commented Oct 19, 2023

yuzliu commented Oct 22, 2023

klueska commented Nov 1, 2023

yuzliu commented Nov 1, 2023

klueska commented Nov 1, 2023

yuzliu commented Nov 1, 2023

github-actions bot commented Feb 27, 2024

leoncamel commented Apr 18, 2024

ZDWWWWW commented May 27, 2024

Advertising specific GPU types as separate extended resource #424

Advertising specific GPU types as separate extended resource #424

Comments

deepanker-s commented Jul 21, 2023

1. Feature description

2. Why

3. Similar existing work

4. Summary of queries

klueska commented Jul 24, 2023 • edited Loading

deepanker-s commented Jul 26, 2023

deepanker-s commented Aug 8, 2023 • edited Loading

dimm0 commented Sep 27, 2023

sjdrc commented Oct 19, 2023 • edited Loading

klueska commented Oct 19, 2023

yuzliu commented Oct 22, 2023

klueska commented Nov 1, 2023

yuzliu commented Nov 1, 2023

klueska commented Nov 1, 2023

yuzliu commented Nov 1, 2023

github-actions bot commented Feb 27, 2024

leoncamel commented Apr 18, 2024

ZDWWWWW commented May 27, 2024

klueska commented Jul 24, 2023 •

edited

Loading

deepanker-s commented Aug 8, 2023 •

edited

Loading

sjdrc commented Oct 19, 2023 •

edited

Loading