Custom Indonesian VITS Text-to-speech Training with Coqui TTS

Made by : Zahrizhal Ali

This project uses the VITS (Very Deep Image Prior for Text-to-Speech) model and Coqui TTS (Text-to-Speech) to generate speech from text. The VITS model is a deep learning model that can convert text to mel-spectrogram, which is then used by Coqui TTS to generate speech.

Installation

To install the Text-to-Speech project using VITS model and Coqui TTS, follow these steps:

Clone the repository using git clone https://github.com/ZahrizhalAli/indonesian-tts-vits.git
Navigate to the project directory using cd indonesian-tts-vits
Install the dependencies, if you use google collab its even better
Start set up your dataset and transcript

Customization

The Text-to-speech project can be customized by modifying the following parameters:

sample_rate: The sample rate of the output speech (default is 22050)
epochs: Training iteration
save_checkpoints : to save checkpoint in case we need to continue training our interrupted training

Credits

Coqui TTS

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
YourTTS_TTS_and_Voice_Conversion.ipynb		YourTTS_TTS_and_Voice_Conversion.ipynb
vits_za.ipynb		vits_za.ipynb
voice_cloning_english.ipynb		voice_cloning_english.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Custom Indonesian VITS Text-to-speech Training with Coqui TTS

Installation

Customization

Credits

About

Releases

Packages

Languages

ZahrizhalAli/indonesian-tts-vits

Folders and files

Latest commit

History

Repository files navigation

Custom Indonesian VITS Text-to-speech Training with Coqui TTS

Installation

Customization

Credits

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages