Tacotron 2 and WaveGlow model TTS

The Tacotron 2 and WaveGlow model forms a text-to-speech system which synthesises human-like speech without additional prosody information.

Tacotron 2

Tacotron 2 is a neural network architecture for natural speech synthesis. It consists of recurrent seq2seq feature prediction network which maps char embeddings to mel spectograms, which are usually fed into a vocoder to generate speech.

WaveGlow

WaveGlow is a flow-based network which can generate speech from mel-spectograms.

In this model, pre trained Tacotron 2 and WaveGlow models are loaded from torch.hub.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
tacotron2_tts.ipynb		tacotron2_tts.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tacotron 2 and WaveGlow model TTS

Tacotron 2

WaveGlow

About

Releases

Packages

Languages

pavs315/Text-To-Speech

Folders and files

Latest commit

History

Repository files navigation

Tacotron 2 and WaveGlow model TTS

Tacotron 2

WaveGlow

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages