Add synthesizer preprocessing support for other datasets #441

ghost · 2020-07-23T12:31:16Z

Resolves #413 by adding preprocessing support for other datasets. This is designed with LibriTTS in mind but is compatible with other datasets (e.g. VCTK) by using the same directory structure and file naming convention. This change is designed to be almost transparent for those using LibriSpeech. The training documentation on the wiki requires no update.

These changes have been tested to ensure both LibriSpeech and LibriTTS can be processed and used in training without errors.

Changes

Remove hardcoding of dataset folder name and subfolders in synthesizer/preprocess.py. These are now specified with new command line arguments:
- --dataset_name (defaults to "LibriSpeech")
- --subfolders (defaults to train-clean-100, train-clean-360)
New command line argument "--no_alignments" (default False)
- If True, does not split audio clips. For a given wave file "filename.ext" the preprocessing script will look for the text in "filename.txt" or "filename.normalized.txt" in the same location.

synthesizer/preprocess.py

Co-authored-by: Corentin Jemine <corentin.jemine@gmail.com>

mbdash · 2020-07-23T15:35:14Z

@blue-fish, Would it be useful if I was to offer a GPU (2080 ti) for contributing on training a new model based on LibriTTS ?
I have yet to train any models and would gladly exchange GPU time for an opportunity to learn.
I wonder how long it would take on a single 2080 ti.

blue-fish added 2 commits July 23, 2020 05:05

Add synthesizer preprocessing support for other datasets

688de8e

Allow for spaces in --subfolders argument

2be9b55

ghost requested a review from CorentinJ July 23, 2020 12:31

Remove extraneous with_suffix()

0a1cf41

CorentinJ approved these changes Jul 23, 2020

View reviewed changes

synthesizer/preprocess.py Outdated Show resolved Hide resolved

CorentinJ approved these changes Jul 23, 2020

View reviewed changes

Update synthesizer/preprocess.py

fbe017f

Co-authored-by: Corentin Jemine <corentin.jemine@gmail.com>

ghost merged commit 054f16e into CorentinJ:master Jul 23, 2020

ghost deleted the 413_libritts_support branch July 23, 2020 13:35

This was referenced Jul 23, 2020

How i can train my audio files .to use indian assent . #429

Closed

Request: Add support for torchaudio.datasets #442

Closed

Training a new model based on LibriTTS #449

Closed

Single speaker fine-tuning process and results #437

Closed

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add synthesizer preprocessing support for other datasets #441

Add synthesizer preprocessing support for other datasets #441

ghost commented Jul 23, 2020

mbdash commented Jul 23, 2020

Add synthesizer preprocessing support for other datasets #441

Add synthesizer preprocessing support for other datasets #441

Conversation

ghost commented Jul 23, 2020

Changes

mbdash commented Jul 23, 2020