You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What version of Apptainer (or Singularity) are you using? Run
Apptainer> apptainer version
1.1.6-1.el8
Expected behavior
When running nested containers I would expect that I can access the GPUs in the nested container, i.e. GPUs are passed through from Host to Container 1 to Container 2
Actual behavior
When running nvidia-smi in Container 2 you get following error:
Apptainer> nvidia-smi
NVIDIA-SMI couldn't find libnvidia-ml.so library in your system. Please make sure that the NVIDIA Display Driver is properly installed and present in your system.
Please also try adding directory that contains libnvidia-ml.so to your system PATH.
This is caused by differences in the /.singularity/libs between Container 1 and Container 2:
I believe the problem is that the --nv option loses the symlink connection between .so files and their corresponding underlying versioned library, because it does a bind mount for the .so. I think that if it instead created a symlink for .so files in /.singularity.d/libs, it would solve this problem.
Version of Apptainer
What version of Apptainer (or Singularity) are you using? Run
Apptainer> apptainer version
1.1.6-1.el8
Expected behavior
When running nested containers I would expect that I can access the GPUs in the nested container, i.e. GPUs are passed through from Host to Container 1 to Container 2
Actual behavior
When running
nvidia-smi
in Container 2 you get following error:This is caused by differences in the
/.singularity/libs
between Container 1 and Container 2:Steps to reproduce this behavior
I fixed this by bind mounting the missing library into container 2:
How did you install Apptainer
First apptainer comes from CVMFS (
oasis.opensciencegrid.org/apptainer/mis/apptainer/bin
) second comes from the RPMThe text was updated successfully, but these errors were encountered: