Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

v0.8.0 #1810

Merged
merged 39 commits into from
Aug 22, 2022
Merged

v0.8.0 #1810

merged 39 commits into from
Aug 22, 2022

Commits on Jun 22, 2022

  1. Fix checkpointing GAN models (#1641)

    * checkpoint sae step crash fix
    
    * checkpoint save step crash fix
    
    * Update gan.py
    
    updated requested changes
    
    * crash fix
    manmay-nakhashi authored Jun 22, 2022
    Configuration menu
    Copy the full SHA
    577ec40 View commit details
    Browse the repository at this point in the history

Commits on Jun 27, 2022

  1. Fix the --model_name and --vocoder_name arguments need a <model_type>…

    … element (#1469)
    
    Co-authored-by: Eren Gölge <erogol@hotmail.com>
    camillem and erogol authored Jun 27, 2022
    Configuration menu
    Copy the full SHA
    5c821d9 View commit details
    Browse the repository at this point in the history

Commits on Jul 5, 2022

  1. Fix Publish CI (#1597)

    * Try out manylinux
    
    * temporary removal of useless pipeline
    
    * remove check and use only manylinux
    
    * Try --plat-name
    
    * Add install requirements
    
    * Add back other actions
    
    * Add PR trigger
    
    * Remove conditions
    
    * Fix sythax
    
    * Roll back some changes
    
    * Add other python versions
    
    * Add test pypi upload
    
    * Add username
    
    * Add back __token__ as username
    
    * Modify name of entry to testpypi
    
    * Set it to release only
    
    * Fix version checking
    WeberJulian authored Jul 5, 2022
    Configuration menu
    Copy the full SHA
    9e00e31 View commit details
    Browse the repository at this point in the history

Commits on Jul 6, 2022

  1. Configuration menu
    Copy the full SHA
    5cef6fa View commit details
    Browse the repository at this point in the history

Commits on Jul 11, 2022

  1. Remove redundant config field

    erogol committed Jul 11, 2022
    Configuration menu
    Copy the full SHA
    f1e3559 View commit details
    Browse the repository at this point in the history

Commits on Jul 12, 2022

  1. Fix SSIM loss

    erogol committed Jul 12, 2022
    Configuration menu
    Copy the full SHA
    c17ff17 View commit details
    Browse the repository at this point in the history
  2. Separate loss tests

    erogol committed Jul 12, 2022
    Configuration menu
    Copy the full SHA
    eefd482 View commit details
    Browse the repository at this point in the history
  3. Fix BCELoss adressing #1192

    erogol committed Jul 12, 2022
    Configuration menu
    Copy the full SHA
    a6f73a1 View commit details
    Browse the repository at this point in the history
  4. Make style

    erogol committed Jul 12, 2022
    Configuration menu
    Copy the full SHA
    2cf89b8 View commit details
    Browse the repository at this point in the history
  5. Add durations as aux input for VITS (#1694)

    * Add durations as aux input for VITS
    
    * Make style
    
    * Fix tts_tests
    
    * Fix test_get_aux_input
    WeberJulian authored Jul 12, 2022
    Configuration menu
    Copy the full SHA
    c614f21 View commit details
    Browse the repository at this point in the history
  6. Make lint

    erogol committed Jul 12, 2022
    Configuration menu
    Copy the full SHA
    48a4f36 View commit details
    Browse the repository at this point in the history
  7. feat: updated recipes and lr fix (#1718)

    - updated the recipes activating more losses for more stable training
    - re-enabling guided attention loss
    - fixed a bug about not the correct lr fetched for logging
    a-froghyar authored Jul 12, 2022
    Configuration menu
    Copy the full SHA
    34b80e0 View commit details
    Browse the repository at this point in the history
  8. Implement VitsAudioConfig (#1556)

    * Implement VitsAudioConfig
    
    * Update VITS LJSpeech recipe
    
    * Update VITS VCTK recipe
    
    * Make style
    
    * Add missing decorator
    
    * Add missing param
    
    * Make style
    
    * Update recipes
    
    * Fix test
    
    * Bug fix
    
    * Exclude tests folder
    
    * Make linter
    
    * Make style
    erogol authored Jul 12, 2022
    Configuration menu
    Copy the full SHA
    49bac72 View commit details
    Browse the repository at this point in the history
  9. Fix device allocation

    erogol authored Jul 12, 2022
    Configuration menu
    Copy the full SHA
    bc1f93c View commit details
    Browse the repository at this point in the history

Commits on Jul 13, 2022

  1. Fix SSIM loss correction

    erogol authored Jul 13, 2022
    Configuration menu
    Copy the full SHA
    f7587fc View commit details
    Browse the repository at this point in the history

Commits on Jul 19, 2022

  1. Fix aux tests (#1753)

    * Set n_jobs to 1 for resample script
    
    * Delete resample test
    
    * Set n_jobs 1 in vad test
    
    * delete vad test
    
    * Revert "Delete resample test"
    
    This reverts commit bb7c846.
    
    * Remove tests with resample
    WeberJulian authored Jul 19, 2022
    Configuration menu
    Copy the full SHA
    4f31402 View commit details
    Browse the repository at this point in the history

Commits on Jul 20, 2022

  1. Fix for FloorDiv Function Warning (#1760)

    * Fix for Floor Function Warning
    
    Fix for Floor Function Warning
    
    * Adding double quotes to fix formatting
    
    Adding double quotes to fix formatting
    
    * Update glow_tts.py
    
    * Update glow_tts.py
    iprovalo authored Jul 20, 2022
    Configuration menu
    Copy the full SHA
    903d9c7 View commit details
    Browse the repository at this point in the history
  2. Fix type in download_vctk.sh (#1739)

    typo in comment
    mengting7tw authored Jul 20, 2022
    Configuration menu
    Copy the full SHA
    9d32cbc View commit details
    Browse the repository at this point in the history

Commits on Jul 26, 2022

  1. Update decoder.py (#1792)

    Minor comment correction.
    p0p4k authored Jul 26, 2022
    Configuration menu
    Copy the full SHA
    10195c4 View commit details
    Browse the repository at this point in the history
  2. Update requirements.txt (#1791)

    Support for #1775
    p0p4k authored Jul 26, 2022
    Configuration menu
    Copy the full SHA
    669966d View commit details
    Browse the repository at this point in the history
  3. Update README.md (#1776)

    Fix typo in different and code sample
    yuripourre authored Jul 26, 2022
    Configuration menu
    Copy the full SHA
    1a065fa View commit details
    Browse the repository at this point in the history
  4. Fix & update WaveRNN vocoder model (#1749)

    * Fixes KeyError bug. Adding logging to dashboard.
    
    * Make pep8 compliant
    
    * Make style compliant
    
    * Still fixing style
    vanIvan authored Jul 26, 2022
    Configuration menu
    Copy the full SHA
    5094499 View commit details
    Browse the repository at this point in the history

Commits on Aug 1, 2022

  1. Configuration menu
    Copy the full SHA
    7d8b166 View commit details
    Browse the repository at this point in the history
  2. Update requirements.txt; inflect==5.6 (#1809)

    New inflect version (6.0) depends on pydantic which has some issues irrelevant to 🐸 TTS. #1808 
    Force inflect==5.6 (pydantic free) install to solve dependency issue.
    p0p4k authored Aug 1, 2022
    Configuration menu
    Copy the full SHA
    d9bad91 View commit details
    Browse the repository at this point in the history
  3. Update README.md; download progress bar in CLI. (#1797)

    * Update README.md
    
    - minor PR
    - added model_info usage guide based on #1623 in README.md .
    
    * "added tqdm bar for model download"
    
    * Update manage.py
    
    * fixed style
    
    * fixed style
    
    * sort imports
    p0p4k authored Aug 1, 2022
    Configuration menu
    Copy the full SHA
    4fe5080 View commit details
    Browse the repository at this point in the history
  4. Update wavenet.py (#1796)

    * Update wavenet.py
    
    Current version does not use "in_channels" argument. 
    In glowTTS, we use normalizing flows and so "input dim" == "ouput dim" (channels and length). So, the existing code just uses hidden_channel sized tensor as input to first layer as well as outputs hidden_channel sized tensor. 
    However, since it is a generic implementation, I believe it is better to update it for a more general use.
    
    * "in_channels -> hidden_channels"
    p0p4k authored Aug 1, 2022
    Configuration menu
    Copy the full SHA
    903a77c View commit details
    Browse the repository at this point in the history

Commits on Aug 7, 2022

  1. Adjust default to be able to process longer sentences (#1835)

    Running `tts --text "$text" --out_path …` with a somewhat longer
    sentences in the text will lead to warnings like “Decoder stopped with
    max_decoder_steps 500” and the sentences just being cut off in the
    resulting WAV file.
    
    This happens quite frequently when feeding longer texts (e.g. a blog
    post) to `tts`. It's particular frustrating since the error is not
    always obvious in the output. You have to notice that there are missing
    parts. This is something other users seem to have run into as well [1].
    
    This patch simply increases the maximum number of steps allowed for the
    tacotron decoder to fix this issue, resulting in a smoother default
    behavior.
    
    [1] mozilla/TTS#734
    lkiesow authored Aug 7, 2022
    Configuration menu
    Copy the full SHA
    8c64508 View commit details
    Browse the repository at this point in the history
  2. Fix language flags generated by espeak-ng phonemizer (#1801)

    * fix language flags generated by espeak-ng phonemizer
    
    * Style
    
    * Updated language flag regex to consider all language codes alike
    Lokhozt authored Aug 7, 2022
    Configuration menu
    Copy the full SHA
    75ac9e3 View commit details
    Browse the repository at this point in the history
  3. fix get_random_embeddings --> get_random_embedding (#1726)

    * fix get_random_embeddings --> get_random_embedding
    
    function typo leads to training crash, no such function
    
    * fix typo
    
    get_random_embedding
    manmay-nakhashi authored Aug 7, 2022
    Configuration menu
    Copy the full SHA
    7fd9b89 View commit details
    Browse the repository at this point in the history

Commits on Aug 8, 2022

  1. Introduce numpy and torch transforms (#1705)

    * Refactor audio processing functions
    
    * Add tests for numpy transforms
    
    * Fix imports
    
    * Fix imports2
    erogol authored Aug 8, 2022
    Configuration menu
    Copy the full SHA
    d46fbc2 View commit details
    Browse the repository at this point in the history

Commits on Aug 15, 2022

  1. Configuration menu
    Copy the full SHA
    bfc6382 View commit details
    Browse the repository at this point in the history
  2. Update capacitron_layers.py (#1664)

    crashing because of dimension miss match   at line no. 57
    [batch, 256] vs [batch , 1, 512]
    enc_out = torch.cat([enc_out, speaker_embedding], dim=-1)
    manmay-nakhashi authored Aug 15, 2022
    Configuration menu
    Copy the full SHA
    e4db7c5 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    c30b648 View commit details
    Browse the repository at this point in the history
  4. Fix BCE loss issue (#1872)

    * Fix BCE loss issue
    
    * Remove import
    erogol authored Aug 15, 2022
    Configuration menu
    Copy the full SHA
    4333492 View commit details
    Browse the repository at this point in the history
  5. Remove deprecated files (#1873)

    - samplers.py is moved
    - distribute.py is replaces by the 👟Trainer
    erogol authored Aug 15, 2022
    Configuration menu
    Copy the full SHA
    7442bce View commit details
    Browse the repository at this point in the history

Commits on Aug 18, 2022

  1. Configuration menu
    Copy the full SHA
    fcb0bb5 View commit details
    Browse the repository at this point in the history

Commits on Aug 22, 2022

  1. Fix tune wavegrad (#1844)

    * fix imports in tune_wavegrad
    
    * load_config returns Coqpit object instead None
    
    * set action (store true) for flag "--use_cuda"; start to tune if module is running as the main program
    
    * fix var order in the result of batch collating
    
    * make style
    
    * make style with black and isort
    geth-network authored Aug 22, 2022
    Configuration menu
    Copy the full SHA
    2c9f00a View commit details
    Browse the repository at this point in the history
  2. Bump up to v0.8.0

    erogol committed Aug 22, 2022
    Configuration menu
    Copy the full SHA
    8845f06 View commit details
    Browse the repository at this point in the history
  3. Add new DE Thorsten models (#1898)

    - Tacotron2-DDC
    - HifiGAN vocoder
    erogol authored Aug 22, 2022
    Configuration menu
    Copy the full SHA
    e5430a6 View commit details
    Browse the repository at this point in the history