-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About speechT5 is trainable? #4
Comments
Hi, in the inference code, speecht5 is loaded initially with public weights, but the parameters are overwritten again with state_dict from the VoiceLDM checkpoint. |
Thanks for your quick reply! |
I have another question, if you don't mind answering. |
No, using SpeechT5 isn't strictly necessary, any form of 'text encoder' would likely do the job. |
|
Have you tried freezing the parameters of SpeechT5? Or, is it necessary to update the text encoder parameters in this TTS modeling approach? |
I have tried both, and found that updating the text encoder's parameters led to better performance. |
Thanks for your reply! It's very helpful for my research! |
I found that in I found that in the training code, speecht5 can be trained.
However, in the inference code, speecht5 is loaded with Microsoft's public weights.
Could you please clarify whether training speecht5 affects the results?
The text was updated successfully, but these errors were encountered: