You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Install the normal CUDA driver (as we do it in our packer script)
Reboot the instance
It hands forever on reboot
Reproduced
Potential fix
Instead of the normal CUDA driver, install the GRID driver (e.g. by following Azure's guide)
Reproduced
Implementation notes
The Azure's guide is using 535.154.05 version and it seems to be compatible with what we use (535.54.03-1). Where to download the exact 535.54.03-1 Grid driver – I couldn't find it. Because both versions seem to be compatible, we theoretically could go with Azure's guide instructions.
It seems that for A10 for Azure, we need to build a separate AMI.
Steps to reproduce:
dstack run . -b azure --gpu A10 --spot-auto
Running
The text was updated successfully, but these errors were encountered: