-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The learning rate #4
Comments
Hello, I have the same question. Judging by the results of your experiment,How is the sound quality of avocodo compared with hifigan? Is there a suitable super parameter recommendation, such as learning_rate, 2e-3 or 2e-4? I am looking forward to your reply |
Hello, The learning rate of 2e-4 worked better at the end (the official implementation is also using this value). Perhaps there is a typo in the paper. The default setup did not worked well for my training setup. The following additions helped:
The sound quality of avocodo sounds is overall good and theres a reduction on artefacts. |
Thank you for your interest in our paper. Looking at your solutions, it seems that there exist cases that the discriminator failed at training. In our case, some of the discriminators failed to learn when the size of dataset was very small. It makes artifacts occur in outputs. We tried to prevent the failure by adjusting the discriminator's parameters. The solutions you have suggested also seems to be a good solution to the problem. Thanks for the suggestion. |
Thanks for your reply, I will try it. |
Hello,
Thank you for presenting awesome ideas with your work and addressing fundamental issues in previous works.
In the Training Setup section of your paper the learning rate is mentioned as 2e-3 whereas your implementation usws 2e-4.
2e-4 sounds more reasonable (due to hifigan baseline). However, I couldn't achieve a balanced training using this value, which always ended up with slight metallic artifact.
I am 1M steps in with 2e-3 and it looks better - but I still have doubts around it.
Can you explain the discrepancy?
Thank you
The text was updated successfully, but these errors were encountered: