FastSpeech-Pytorch

The Implementation of FastSpeech Based on Pytorch.

Update (2020/07/20)

Optimize the training process.
Optimize the implementation of length regulator.
Use the same hyper parameter as FastSpeech2.
The measures of the 1, 2 and 3 make the training process 3 times faster than before.
Better speech quality.

Model

My Blog

Prepare Dataset

Download and extract LJSpeech dataset.
Put LJSpeech dataset in data.
Unzip alignments.zip.
Put Nvidia pretrained waveglow model in the waveglow/pretrained_model and rename as waveglow_256channels.pt;
Run python3 preprocess.py.

Training

Run python3 train.py.

Evaluation

Run python3 eval.py.

Notes

In the paper of FastSpeech, authors use pre-trained Transformer-TTS model to provide the target of alignment. I didn't have a well-trained Transformer-TTS model so I use Tacotron2 instead.
I use the same hyper-parameter as FastSpeech2.
The examples of audio are in sample.
pretrained model.

Name		Name	Last commit message	Last commit date
Latest commit History 270 Commits
audio		audio
data		data
img		img
sample		sample
text		text
transformer		transformer
waveglow		waveglow
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
alignments.zip		alignments.zip
dataset.py		dataset.py
eval.py		eval.py
glow.py		glow.py
hparams.py		hparams.py
loss.py		loss.py
model.py		model.py
modules.py		modules.py
optimizer.py		optimizer.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FastSpeech-Pytorch

Update (2020/07/20)

Model

My Blog

Prepare Dataset

Training

Evaluation

Notes

Reference

Repository

Paper

About

Uh oh!

Releases

Packages

Languages

License

chenwaner/FastSpeech

Folders and files

Latest commit

History

Repository files navigation

FastSpeech-Pytorch

Update (2020/07/20)

Model

My Blog

Prepare Dataset

Training

Evaluation

Notes

Reference

Repository

Paper

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages