Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rootless Podman Error: setting up CDI devices: failed to inject devices: failed to stat CDI host device "/dev/dri/renderD129": no such file or directory #885

Open
adi-lb-phoenix opened this issue Jan 30, 2025 · 0 comments

Comments

@adi-lb-phoenix
Copy link

Had installed nvidia-container-toolkit in ubuntu 22.04 container using distrobox ( podman ). But when I tried to enter the distrobox, below was the error I received.

distrobox enter ubuntu 
Error: setting up CDI devices: failed to inject devices: failed to stat CDI host device "/dev/dri/renderD129": no such file or directory

Next I ran the command

podman run --rm --device nvidia.com/gpu=all  ubuntu nvidia-smi
Error: setting up CDI devices: failed to inject devices: failed to stat CDI host device "/dev/dri/renderD129": no such file or directory

Next had to log into sudo account of the host system and execute the below

sudo nvidia-ctk cdi generate --output=/etc/cdi/nvidia.yaml
INFO[0000] Using /usr/lib/x86_64-linux-gnu/libnvidia-ml.so.560.35.03 
INFO[0000] Auto-detected mode as 'nvml'                 
INFO[0000] Selecting /dev/nvidia0 as /dev/nvidia0       
INFO[0000] Selecting /dev/dri/card0 as /dev/dri/card0   
WARN[0000] Could not locate /dev/dri/controlD64: pattern /dev/dri/controlD64 not found 
INFO[0000] Selecting /dev/dri/renderD128 as /dev/dri/renderD128 
INFO[0000] Using driver version 560.35.03               
INFO[0000] Selecting /dev/nvidia-modeset as /dev/nvidia-modeset 
INFO[0000] Selecting /dev/nvidia-uvm-tools as /dev/nvidia-uvm-tools 
INFO[0000] Selecting /dev/nvidia-uvm as /dev/nvidia-uvm 
INFO[0000] Selecting /dev/nvidiactl as /dev/nvidiactl   
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvidia-egl-gbm.so.1.1.1 as /usr/lib/x86_64-linux-gnu/libnvidia-egl-gbm.so.1.1.1 
WARN[0000] Could not locate libnvidia-egl-wayland.so.*.*: pattern libnvidia-egl-wayland.so.*.* not found
64-bit library libnvidia-egl-wayland.so.*.*: not found 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvidia-allocator.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvidia-allocator.so.560.35.03 
WARN[0000] Could not locate libnvidia-vulkan-producer.so.560.35.03: pattern libnvidia-vulkan-producer.so.560.35.03 not found
64-bit library libnvidia-vulkan-producer.so.560.35.03: not found 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/nvidia/xorg/nvidia_drv.so as /usr/lib/x86_64-linux-gnu/nvidia/xorg/nvidia_drv.so 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/nvidia/xorg/libglxserver_nvidia.so.560.35.03 as /usr/lib/x86_64-linux-gnu/nvidia/xorg/libglxserver_nvidia.so.560.35.03 
INFO[0000] Selecting /usr/share/glvnd/egl_vendor.d/10_nvidia.json as /usr/share/glvnd/egl_vendor.d/10_nvidia.json 
INFO[0000] Selecting /usr/share/egl/egl_external_platform.d/15_nvidia_gbm.json as /usr/share/egl/egl_external_platform.d/15_nvidia_gbm.json 
WARN[0000] Could not locate egl/egl_external_platform.d/10_nvidia_wayland.json: pattern egl/egl_external_platform.d/10_nvidia_wayland.json not found 
INFO[0000] Selecting /usr/share/nvidia/nvoptix.bin as /usr/share/nvidia/nvoptix.bin 
INFO[0000] Selecting /usr/share/X11/xorg.conf.d/10-nvidia.conf as /usr/share/X11/xorg.conf.d/10-nvidia.conf 
WARN[0000] Could not locate X11/xorg.conf.d/nvidia-drm-outputclass.conf: pattern X11/xorg.conf.d/nvidia-drm-outputclass.conf not found 
INFO[0000] Selecting /usr/share/vulkan/icd.d/nvidia_icd.json as /etc/vulkan/icd.d/nvidia_icd.json 
WARN[0000] Could not locate vulkan/icd.d/nvidia_layers.json: pattern vulkan/icd.d/nvidia_layers.json not found
pattern vulkan/icd.d/nvidia_layers.json not found 
INFO[0000] Selecting /usr/share/vulkan/implicit_layer.d/nvidia_layers.json as /etc/vulkan/implicit_layer.d/nvidia_layers.json 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libEGL_nvidia.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libEGL_nvidia.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libGLESv1_CM_nvidia.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libGLESv1_CM_nvidia.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libGLESv2_nvidia.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libGLESv2_nvidia.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libGLX_nvidia.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libGLX_nvidia.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libcuda.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libcuda.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libcudadebugger.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libcudadebugger.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvcuvid.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvcuvid.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvidia-allocator.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvidia-allocator.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvidia-cfg.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvidia-cfg.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvidia-eglcore.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvidia-eglcore.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvidia-encode.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvidia-encode.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvidia-fbc.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvidia-fbc.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvidia-glcore.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvidia-glcore.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvidia-glsi.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvidia-glsi.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvidia-glvkspirv.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvidia-glvkspirv.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvidia-gpucomp.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvidia-gpucomp.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvidia-ml.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvidia-ml.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvidia-ngx.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvidia-ngx.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvidia-nvvm.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvidia-nvvm.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvidia-opencl.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvidia-opencl.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvidia-opticalflow.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvidia-opticalflow.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvidia-pkcs11-openssl3.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvidia-pkcs11-openssl3.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvidia-pkcs11.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvidia-pkcs11.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvidia-ptxjitcompiler.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvidia-ptxjitcompiler.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvidia-rtcore.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvidia-rtcore.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvidia-tls.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvidia-tls.so.560.35.03 
INFO[0000] Selecting /usr/lib/x86_64-linux-gnu/libnvoptix.so.560.35.03 as /usr/lib/x86_64-linux-gnu/libnvoptix.so.560.35.03 
INFO[0000] Selecting /run/nvidia-persistenced/socket as /run/nvidia-persistenced/socket 
WARN[0000] Could not locate /nvidia-fabricmanager/socket: pattern /nvidia-fabricmanager/socket not found 
WARN[0000] Could not locate /tmp/nvidia-mps: pattern /tmp/nvidia-mps not found 
INFO[0000] Selecting /lib/firmware/nvidia/560.35.03/gsp_ga10x.bin as /lib/firmware/nvidia/560.35.03/gsp_ga10x.bin 
INFO[0000] Selecting /lib/firmware/nvidia/560.35.03/gsp_tu10x.bin as /lib/firmware/nvidia/560.35.03/gsp_tu10x.bin 
INFO[0000] Selecting /usr/bin/nvidia-smi as /usr/bin/nvidia-smi 
INFO[0000] Selecting /usr/bin/nvidia-debugdump as /usr/bin/nvidia-debugdump 
INFO[0000] Selecting /usr/bin/nvidia-persistenced as /usr/bin/nvidia-persistenced 
INFO[0000] Selecting /usr/bin/nvidia-cuda-mps-control as /usr/bin/nvidia-cuda-mps-control 
INFO[0000] Selecting /usr/bin/nvidia-cuda-mps-server as /usr/bin/nvidia-cuda-mps-server 
INFO[0000] Generated CDI spec with version 0.8.0        

And then try again

 podman run --rm --device nvidia.com/gpu=all  ubuntu nvidia-smi
Thu Jan 30 11:34:50 2025       
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 560.35.03              Driver Version: 560.35.03      CUDA Version: 12.6     |
|-----------------------------------------+------------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id          Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |           Memory-Usage | GPU-Util  Compute M. |
|                                         |                        |               MIG M. |
|=========================================+========================+======================|
|   0  NVIDIA GeForce RTX 4070 ...    Off |   00000000:01:00.0 Off |                  N/A |
|  0%   42C    P8             11W /  285W |      90MiB /  16376MiB |      0%      Default |
|                                         |                        |                  N/A |
+-----------------------------------------+------------------------+----------------------+
                                                                                         
+-----------------------------------------------------------------------------------------+
| Processes:                                                                              |
|  GPU   GI   CI        PID   Type   Process name                              GPU Memory |
|        ID   ID                                                               Usage      |
|=========================================================================================|
|  No running processes found                                                             |
+-----------------------------------------------------------------------------------------+

Why is this crash happening while I'm trying to enter distrobox?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant