Recommendations for fine-tuning for custom voice #1208
Replies: 3 comments 17 replies
-
Hello Gurus & Friends, An update (not a very good one) on my above post is that I tried fine tuning tts_models--en--ljspeech--tacotron2-DDC_ph using the suggested method:
But the resulting best model did not perform at all. It only generated noise for every text. The training data and the config.json file can be accessed at https://cutt.ly/VOZ0wB5 I'll really appreciate if anyone can please tell what am I missing here? Thanks |
Beta Was this translation helpful? Give feedback.
-
@DesiKeki If you were able to fine-tune your model for a new voice, if possible, can we connect ? I'm facing issues in fine-tuning the model on my own voice, so looking for some guidance. |
Beta Was this translation helpful? Give feedback.
-
hi @DesiKeki, i am trying to perform the same but i am getting a similar outcome (only noise) and was curious to see the config.js you used to do your fine tuning with and the command used as well. You might have noticed i sent a request to get access to your google drive links. Thanks in advance! |
Beta Was this translation helpful? Give feedback.
-
Hello TTS Community,
I am trying to fine tune tts_models--en--ljspeech--tacotron2-DDC_ph for my own voice. I see that documentation https://tts.readthedocs.io/en/latest/finetuning.html gives some instructions to get it working, like using a smaller learning rate and a minimum of 100 audio samples etc.
But I would like to know from the experience of the community what are the recommended do's and don't's for good results. For example, having answers to following questions can be really helpful for newbies like me:
These are the doubts before starting the training. And while the training is happening, it continues for a while. So what are the things that should be noted/monitored to ensure that everything is going fine.
Eg in Evaluation performance metrics,.. what is the acceptable average loss or acceptable average align error etc.
Thanks in advance!
Keki
Beta Was this translation helpful? Give feedback.
All reactions