Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add synthesizer preprocessing support for other datasets #441

Merged
4 commits merged into from Jul 23, 2020
Merged

Add synthesizer preprocessing support for other datasets #441

4 commits merged into from Jul 23, 2020

Conversation

ghost
Copy link

@ghost ghost commented Jul 23, 2020

Resolves #413 by adding preprocessing support for other datasets. This is designed with LibriTTS in mind but is compatible with other datasets (e.g. VCTK) by using the same directory structure and file naming convention. This change is designed to be almost transparent for those using LibriSpeech. The training documentation on the wiki requires no update.

These changes have been tested to ensure both LibriSpeech and LibriTTS can be processed and used in training without errors.

Changes

  1. Remove hardcoding of dataset folder name and subfolders in synthesizer/preprocess.py. These are now specified with new command line arguments:
    • --dataset_name (defaults to "LibriSpeech")
    • --subfolders (defaults to train-clean-100, train-clean-360)
  2. New command line argument "--no_alignments" (default False)
    • If True, does not split audio clips. For a given wave file "filename.ext" the preprocessing script will look for the text in "filename.txt" or "filename.normalized.txt" in the same location.

@ghost ghost requested a review from CorentinJ July 23, 2020 12:31
synthesizer/preprocess.py Outdated Show resolved Hide resolved
Co-authored-by: Corentin Jemine <corentin.jemine@gmail.com>
@ghost ghost merged commit 054f16e into CorentinJ:master Jul 23, 2020
@ghost ghost deleted the 413_libritts_support branch July 23, 2020 13:35
@mbdash
Copy link
Collaborator

mbdash commented Jul 23, 2020

@blue-fish, Would it be useful if I was to offer a GPU (2080 ti) for contributing on training a new model based on LibriTTS ?
I have yet to train any models and would gladly exchange GPU time for an opportunity to learn.
I wonder how long it would take on a single 2080 ti.

This pull request was closed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Updates for synthesizer training using LibriTTS
2 participants