-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DeepSpeed internal error on CPU #12607
Comments
Can i get this assigned to me please? |
@myxik Sure! Thank you :) |
The error still appears when using DeepSpeed in combination with Pytorch Lightning on azure |
@gabriead Can you share the error stacktrace? What PyTorch Lightning version are you using? |
Hi @carmocca sure, this is the stacktrace of the first exception:
and the second
Pytorch Lightning Version: latest, 1.6.5 |
I don't think this fix was included in any 1.6.x release. Would you mind trying master? |
Or better yet, the |
🐛 Bug
DeepSpeed raises an internal error when the
Trainer
runs on CPU. I imagine they don't support CPU training so we should raise a MisconfigurationException in that case.To Reproduce
Code
Stacktrace
Expected behavior
Better error message
Environment
cc @Borda @SeanNaren @awaelchli @rohitgr7 @akihironitta
The text was updated successfully, but these errors were encountered: