NVIDIA Device Plugin Only Exposes One GPU Out of Two GPUs Installed on Single Node #1021
Replies: 2 comments 5 replies
-
Multiple GPU types per node is not supported with the standard device plugin. |
Beta Was this translation helpful? Give feedback.
-
Hey @klueska Some update: Following the discussion here: #424 and information here: https://docs.google.com/document/d/1dL67t9IqKC2-xqonMi6DV7W2YNZdkmfX7ibB6Jb-qmk/edit?tab=t.0 (Many thanks for writing this ! it really helps :) I downloaded the release version of 0.14.5 of Nvidia device plugin: Made the following changes:
Build with Push to dockerhub Edit gpu-operator helm in our cluster changes:
to:
work, and when I exec into the pod and run
New issue, however on gpu-feature-discovery pod that running on the mix-gpu-node, logs showing:
In addition the nvidia-operator-validator pod was in crash-loop-bach.
..
Greatly appreciate any help here. |
Beta Was this translation helpful? Give feedback.
-
Hey all,
"I have an on-premises Kubernetes cluster with multiple nodes. One of these nodes is equipped with two different GPU models:
NVIDIA GeForce RTX 3090 and NVIDIA GeForce RTX 4090
When I SSH into this node and run nvidia-smi, both GPUs are properly detected and displayed.
I have installed the NVIDIA Device Plugin using gpu-operator Helm chart (https://github.com/NVIDIA/gpu-operator/tree/main/deployments/gpu-operator).
However, only the RTX 4090 is being exposed as a resource to Kubernetes.
Here is my current configuration:
Any help ?
Beta Was this translation helpful? Give feedback.
All reactions