-
Notifications
You must be signed in to change notification settings - Fork 8.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Try to train Synthesizer #486
Comments
Do you have the LibriSpeech alignments? A link is on this page: https://github.com/CorentinJ/Real-Time-Voice-Cloning/wiki/Training If it can't find the alignment text files then it thinks there's nothing to process for LibriSpeech. |
Yep, see this, but how can I create own one for future fine tuning on my data? |
An alignment file is used to split long utterances into smaller ones. It is unnecessary for datasets like LibriTTS where you can discard samples that are too long and still have a lot of data remaining. See the violin plot below If you are making a custom dataset, just try to make your samples 2 to 7 seconds in length for training and don't bother with generating alignments. You can manually split long utterances yourself. If you have a very large number of files to work with and must automate it, use something like the Montreal Forced Aligner. For finetuning your data just make your dataset look like: #437 (comment) |
Can you help me with creating dataset for training? I use data from https://www.caito.de/2019/01/the-m-ailabs-speech-dataset/ for Ukrainian.
Mb I can do it automatically or something like that? |
@rlutsyshyn I suggest you write a script that does this:
After you do this, you can move files around to make it look like #437 (comment) and the command I provided there should work. |
Hey, how can I contact you? I have some more questions but here is not comfortable to ask it. |
I apologize, I am not available to provide consultation outside of the issues board here. For now, my priorities are 1) code development and 2) bug fixes. I answer support questions as time permits but that is not my purpose here. |
Okay, understood. I have created dataset for new training (just for testing used one speaker), when I start synthesizer_preprocess_audio.py first it seems good, but after I have an error like that:
|
This is in your traceback: Please check that for every |
Thank you for reporting the bug with librosa 0.8.0 @rlutsyshyn . |
Try to train synthesized on train-clean-100 data, but have the next one issue:
can you help me with this? In the next one steps I also want to try train vocoder on that data
The text was updated successfully, but these errors were encountered: