Amount of data to train own voice #165

Rachine · 2019-04-23T06:08:21Z

Hey, I am trying to build a model with your version of Tacotron and my own data in US english.

As I am collecting data and formatting it, I am wondering what is the amount of data necessary to start getting some good results for the target voice? Does anyone know any empirical experiments so that I can set a target.

I have around 3/4h right now.

Thanks for the hard work on the repo

erogol · 2019-04-23T13:18:30Z

Amazon has some papers about the amount of the data needed for TTS. I'd say you can try to finetune one of the released models with your own data. That'd be the easiest way to go. You can also start from scratch but I'd guess the data is not sufficient. In general, my personal estimation is around 15 hours for something reasonable.

Rachine · 2019-04-24T09:02:44Z

Alright thank you very much! Will try that, and tell you if that improves it 👍

erogol closed this as completed Apr 23, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Amount of data to train own voice #165

Amount of data to train own voice #165

Rachine commented Apr 23, 2019

erogol commented Apr 23, 2019

Rachine commented Apr 24, 2019

Amount of data to train own voice #165

Amount of data to train own voice #165

Comments

Rachine commented Apr 23, 2019

erogol commented Apr 23, 2019

Rachine commented Apr 24, 2019