Overview

This Docker image is ready to run Nvidia's implementation of a Tacotron2+Waveglow neural network based Text to Speech (TTS) engine. The image sets up pytorch, conda, apex, gcc, and various python dependencias from a debian-slim base.

Setup

Clone Nvidia's deep learning repo: Github - NVIDIA/DeepLearningExamples
Download the Trained models below, and add them to a folder at 'PyTorch/SpeechSynthesis/Tacotron2/models'.

Inference

Put text that you would like to perform TTS on in a file called speak1.txt in the Tacotron2 folder we've been working from. Use one line of text for each desired output.
Start a container from the 'PyTorch/SpeechSynthesis/Tacotron2' folder.

docker run --gpus all --ipc=host --ulimit memlock=-1 --ulimit stack=67108864 -it -v ${PWD}:/workspace  --rm relicx74/tacotron2-runner:latest

Run '/go' to begin inference using the text and models provided.

Synthesized wav files will show up in the 'result' folder.

Optional:
You can modify the entrypoint to perform inference automatically by adding --entrypoint 'bash /go' to your Docker run command.

Additional details are available from the source: Nvidia's Tacotron2 repo documentation

Trained models:

Credits

a modified Tacotron 2 model from Nvidia based on Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
a flow-based neural network model from Nvidia WaveGlow: A Flow-based Generative Network for Speech Synthesis
Github - NVIDIA/DeepLearningExamples
Voice recordings and text annotations used to train the model are from the LJ Speech Dataset

No - Warranty

The provider is not liable for any damages resulting from the use of this Docker image. No steps have been taken to strengthen the image. It makes use of docker hub, conda-forge, and debian GNU/Linux distribution sources, which are outside of my control. This container image has not been locked down in any way.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
scripts		scripts
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Setup

Inference

Credits

No - Warranty

About

Releases

Packages

Languages

License

SDShooter/tacotron2-runner

Folders and files

Latest commit

History

Repository files navigation

Overview

Setup

Inference

Credits

No - Warranty

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages