-
Notifications
You must be signed in to change notification settings - Fork 8.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cannot assign 'torch.cuda.LongTensor' as parameter 'step' (torch.nn.Parameter or None expected) #489
Comments
@shoegazerstella I was not able to test parallel GPU training during development since I don't have that kind of hardware. You can add this code to the top of synthesizer_train.py to make it only run on a single GPU for now.
|
Thanks a lot! |
@shoegazerstella I'd like to try fixing this. It will also make your training faster if it works. When you get a chance, could you try changing these lines in synthesizer/model/tacotron.py. Then comment out the If it doesn't fix the problem you should revert the change because I noticed a slight speed improvement with the current code. Old
New
I made this change since Corentin did something similar when he converted fatchord's vocoder, but now I am wondering if it breaks multi-GPU. |
Awesome! It seems it's working. There's just a new warning I am reporting if you need it for reference:
Note that if I load the model created with the old code I have this error, I had to train from scratch in order to make it work.
|
Nice! I pushed the fix. About the warning, you should compare training speed for single GPU and multi-GPU to make sure it is not adding too much overhead. Next, monitor the GPU and memory usage with I would recommend not getting too attached to your first few models, the time is better spent learning what works and adjusting the training schedule for maximum efficiency. Train it to 20k steps, listen to the wavs and look at the plots, and try it in the toolbox (set your expectations low until it gets to 100k steps). |
Going to close this issue, please share updates in #449. Thanks @shoegazerstella ! |
Hi,
I am trying to re-train the synthesizer model as discussed in #449 (comment), but I get this error below:
The text was updated successfully, but these errors were encountered: