-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
shiftfs error on /usr/bin/nvidia-persistenced with NVIDIA GPU Operator #111
Comments
@deansheather @johnstcn do either of you have bandwidth to reproduce and see if we can fix this one for this sprint? we're pulling into this sprint as there is a customer request. |
@bpmct I'll take a look at this, will need to gather some more information first to reproduce. |
Took a stab at this on GKE (kernel 5.15.0-1067, Ubuntu 22.04.5, NVIDIA Driver version 550.90.12, operator version v24.6.1, idmapped mounts disabled), but couldn't reproduce this exact issue with the same version of the operator. I did find some other issues we'll want to address, but can't be sure they're directly related to the original issue:
would end up becoming this:
So you'd be missing the
I figure we'd want to use the same dest prefix as |
Got some more details, will attempt a bare metal repro. |
After some clarification, no need to attempt bare metal repro. We'll still want to address the above issues found on GKE, either through documentation or code. |
Customer reporting this is using the Helm installed https://helm.ngc.nvidia.com/nvidia "gpu-operator" at version v24.6.1
They are using ghcr.io/coder/envbox:latest as of last week.
The text was updated successfully, but these errors were encountered: