You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Aug 10, 2022. It is now read-only.
Hi @marshonhuckleberry ,
Thanks for interesting in this work.
This is just a vocoder, not a full text-to-speech system, which converts audio features into sound. I worked on this repo in about 2018. At this time, vocoders were too slow to generate sound (i.e. wavenet). It's just a hobby project, and I'm no longer working on this anymore.
If you interest in tts, please use other repos like mozilla/tts or espnet,...
Thanks.
My vocoder needs input is the spectrogram of audio, so you need to generate it somehow (i.e. train neural network to predict spectrogram given text). After that, it's easy to follow the guide to generate audio:
No description provided.
The text was updated successfully, but these errors were encountered: