-
Notifications
You must be signed in to change notification settings - Fork 85
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Run on windows #4
Comments
Hi, thanks for your interest! I tested the code on Ubuntu20.04 with NCCL backend. Windows does not support the NCCL backend so you have to use gloo. However it does not seem like a Windows-related problem since I got the same error when I run with gloo on Ubuntu 😂 Before I figure this out, a temporary solution would be to use DP instead of DDP which does not require a communication backend. To do this you have to change the parameter of trainer in |
Windows single-GPU training is now supported in my latest commit. Please have a try using the same training command in README. It is possible to support multi-GPU training on windows using DP, but it requires more code changes:
see https://pytorch-lightning.readthedocs.io/en/latest/accelerators/gpu_intermediate.html#dp-caveats for more details. The Invalid scalar type error encountered when using gloo backend is related to the bool type parameters used in nerfacc. I'll try to fix this issue when I have time. |
Thank you very much, that was quick. |
Hi there, thank you for sharing, good work.
I want to run the code on windows and it says NCCL error.
So i changed the backend from NCCL to GLOO, and an invalid scalar type error pop up.
Do you have any idea why? What is your environment running the code? Mine is python3.10 Cudatoolkit11.3 with torch 1.12.1+cu113
Appreciate!
The text was updated successfully, but these errors were encountered: