Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dev v0.0.10 #2

Merged
merged 254 commits into from
Mar 10, 2021
Merged

Dev v0.0.10 #2

merged 254 commits into from
Mar 10, 2021

Conversation

erogol
Copy link
Member

@erogol erogol commented Feb 22, 2021

🐸 v0.0.10

🐞Bug Fixes

  • Make synthesizer.py saving the output audio with the vocoder sampling rate. It is necessary if there is sampling rates of the tts and the vocoder models are different and interpolation is applied to the tts model output before running the vocoder. Practically, it fixes generated Spanish and French voices by tts or tts-server on the terminal.
  • Handling utf-8 on Windows. (by @adonispujols)
  • Fix Loading the last model when --continue_training. It was loading the best_model regardless.

💾 Code updates

  • Breaking Change: Update default set of characters in symbols.py. This might require you to set your character set in config.json if you like to use this version with your models trained with the previous version.
  • Chinese backend for text processing (Any tips on fixing alignment problems #654 by @kirianguiller)
  • Enable torch.hub integration for the released models.
  • First github release.
  • dep. version fixes. Using numpy > 1.17.5 breaks some tests.
  • WaveRNN fix (by @gerazov )
  • Big refactoring for the training scripts to share the init part of the code. (by @gerazov)
  • Enable ModelManager to download models from Github releases.
  • Add a test for compute_statistics.py
  • light-touch updates in tts and tts-server entry points. (thanks @thorstenMueller )
  • Define default vocoder models for each tts model in .models.json. tts and tts-server entry points use the default vocoder if the user does not specify.
  • find_unique_chars.py to find all the unique characters in a dataset.
  • A better way to handling best models through training. (thx @gerazov )
  • pass used characters to the model config.json at the beginning of the training. This prevents any code update later to affect the trained models.
  • Migration to Github Actions for CI.
  • Deprecate wheel based use of tts-server for the sake of the new design.
  • 🐸

🚶‍♀️ Operational Updates

  • Move released models to Github Releases and deprecate GDrive being the first option.

🏅 Model implementations

  • No updates 😓

🚀 New Pre-Trained Model Releases

  • English ek1 - Tacotron2 model and WaveGrad vocoder under .models.json. (huge THX!! to @nmstoker)
  • Russian Ruslan - Tacotron2-DDC model.
  • Dutch model. (huge THX!! to @r-dh )
  • Chinese Tacotron2 model. (huge THX!! to @kirianguiller)
  • English LJSpeech - SpeechSpeech with WaveNet decoder.

Eren Gölge and others added 30 commits January 28, 2021 13:10
Added info if model already downloaded in --list_models
this should fixe the device mismatch as seen here mozilla/TTS#622 (comment)
fix device mismatch wavegrad training
Set out_path to be required param in compute_statistics.py.
@erogol
Copy link
Member Author

erogol commented Mar 8, 2021

@r-dh would you mind sending your Dutch model PR here? And what vocoder did you use for this model ?

@erogol erogol merged commit 58b6f71 into master Mar 10, 2021
erogol added a commit that referenced this pull request Apr 8, 2021
eginhard referenced this pull request in idiap/coqui-ai-TTS Apr 2, 2024
CI: remove unused check_skip steps
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

10 participants