V0.24.2 #94

eginhard · 2024-10-04T10:14:22Z

This feature branch contains all PRs since v0.24.1 except #77 which requires increasing the minimum Pytorch version from 2.1 to 2.4. We can leave that until later.

Previously, e.g. `--use_cuda false` would actually set use_cuda=True: coqui-ai#3762

[ci skip]

Improve CLI handling of boolean arguments

This doesn't convert numbers into English words.

…45) Fixes coqui-ai#3787

Identified necessary code changes with the NPY201 ruff rule. Gruut is the only dependency that doesn't support numpy2 yet. NB: At build time numpy>=2.0.0 should be required to be able to build wheels compatible with both numpy1+2: https://numpy.org/devdocs/dev/depending_on_numpy.html#numpy-2-abi-handling

Add multilingual phoneme cleaner

build: add numpy2 support

….41.1 Fixes #31. The handling of special tokens in `transformers` was changed in huggingface/transformers#30624 and huggingface/transformers#30746. This updates the XTTS streaming code accordingly.

In line with https://github.com/huggingface/transformers/blob/eed9ed679878ada2f6d2eefccdbda368cabc88b1/src/transformers/generation/utils.py

Fix XTTS streaming for transformers update

Already exist as: TTS.tts.layers.vits.stochastic_duration_predictor.DilatedDepthSeparableConv TTS.tts.layers.vits.stochastic_duration_predictor.ElementwiseAffine

torch.range(a, b) == torch.arange(a, b+1) meshgrid indexing: pytorch/pytorch#50276 checkpoint use_reentrant: https://dev-discuss.pytorch.org/t/bc-breaking-update-to-torch-utils-checkpoint-not-passing-in-use-reentrant-flag-will-raise-an-error/1745 optimizer.step() before scheduler.step(): https://pytorch.org/docs/stable/optim.html#how-to-adjust-learning-rate

VC-related refactors and fixes

Except for notebooks, it's only used to show embedding plots during speaker encoder training, in which case a warning is now shown to install it.

Update to coqui-tts-trainer 0.1.4

docs(xtts): fix typo in example

build: move umap-learn into optional notebook dependencies

fix(xtts): load tokenizer file based on config as last resort

* Fix Stream Generator on MacOS * Make it work on mps * Implement custom tensor.isin * Fix for latest TF * Comment out hack for now * Remove unused code * build: increase minimum transformers version * style: fix --------- Co-authored-by: Enno Hermann <Eginhard@users.noreply.github.com>

Avoids hard failures when the audio can't be decoded.

Skip audio files that can't be decoded

4.43.* broke XTTS streaming again

Preparations for Numpy 2 support (gruut, soxr, spacy)

Added proper tokenizer support for Hindi Language which would prevent crash while fine tuning Hindi language. Co-authored-by: Akshat Bhardwaj <157223825+akshatrocky@users.noreply.github.com>

Bark was previously adapted to download Hubert from HuggingFace, so the manual download is superfluous.

Due to breaking change in upload-artifact action: actions/upload-artifact#602

* Add normalizer type C to text cleaners * Linter recommendations * Add unicode normalize to every cleaner * Format test_text_cleaners.py

* Update pyproject.toml * Update pyproject.toml * Update pyproject.toml * Update pyproject.toml * build: simplify requirement restrictions --------- Co-authored-by: Enno Hermann <enno.hermann@idiap.ch>

Use previous release which didn't make the torch.load(..., weights_only=True) change yet.

eginhard and others added 30 commits May 31, 2024 08:39

fix(bin.synthesize): correctly handle boolean arguments

77722cb

Previously, e.g. `--use_cuda false` would actually set use_cuda=True: coqui-ai#3762

fix(utils.generic_utils): correctly call now()

29e91f2

docs: update readme

bdd44cf

docs: fix readthedocs links

03de4b8

[ci skip]

Merge pull request #38 from idiap/cli

063e9e9

Improve CLI handling of boolean arguments

feat(cleaners): add multilingual phoneme cleaner

e5c208d

This doesn't convert numbers into English words.

fix(recipes): use multilingual phoneme cleaner in non-english recipes

a1495d4

chore(cleaners): add type hints

9cfcc0a

fix(freevc): use the specified device for pretrained speaker encoder (#…

3a20f47

…45) Fixes coqui-ai#3787

Merge pull request #44 from idiap/phoneme-cleaners

bd9b21d

Add multilingual phoneme cleaner

Merge pull request #47 from idiap/numpy2

81ac7ab

build: add numpy2 support

refactor(stream_generator): update special tokens for transformers>=4…

4b6da4e

….41.1 Fixes #31. The handling of special tokens in `transformers` was changed in huggingface/transformers#30624 and huggingface/transformers#30746. This updates the XTTS streaming code accordingly.

refactor(stream_generator): update code for transformers>=4.41.1

2a28123

In line with https://github.com/huggingface/transformers/blob/eed9ed679878ada2f6d2eefccdbda368cabc88b1/src/transformers/generation/utils.py

chore(stream_generator): address lint issues

4d9e18e

Merge pull request #46 from idiap/fix-xtts-streaming

98c0f86

Fix XTTS streaming for transformers update

test(helpers): add test_ prefix so tests actually run

c9f7197

test(helpers): fix test_rand_segment, test_generate_path

857cd55

refactor(freevc): use existing layernorm

9f80e04

chore(freevc): remove duplicate DDSConv and ElementwiseAffine

d65bcf6

Already exist as: TTS.tts.layers.vits.stochastic_duration_predictor.DilatedDepthSeparableConv TTS.tts.layers.vits.stochastic_duration_predictor.ElementwiseAffine

fix: clarify types, fix missing functions

cd7b6da

refactor: remove duplicate convert_pad_shape

f8df19a

refactor(freevc): remove duplicate sequence_mask

a755328

chore: remove duplicate init_weights

c30fb0f

refactor: remove duplicate get_padding

4bd3df2

Merge pull request #49 from idiap/vc-refactors

ff2cd5c

VC-related refactors and fixes

build: move umap-learn into optional notebook dependencies

59ef28d

Except for notebooks, it's only used to show embedding plots during speaker encoder training, in which case a warning is now shown to install it.

build: update trainer to 0.1.4

c693b08

refactor: use get_git_branch from trainer

28296c6

eginhard and others added 22 commits June 29, 2024 17:33

ci: test lowest and highest compatible versions of dependencies

8cab2e3

Merge pull request #51 from idiap/update-trainer

c1a929b

Update to coqui-tts-trainer 0.1.4

Update xtts.py (#53)

6ea3b75

docs(xtts): fix typo in example

fix(xtts): load tokenizer file based on config as last resort

9192ef1

Merge pull request #50 from idiap/umap

de35920

build: move umap-learn into optional notebook dependencies

Merge pull request #57 from idiap/xtts-vocab

20583a4

fix(xtts): load tokenizer file based on config as last resort

fix(dataset): skip files where audio length can't be computed

8c460d0

Avoids hard failures when the audio can't be decoded.

chore(dataset): address lint issues

9c604c1

Merge pull request #66 from idiap/skip-broken-audio

19fce2c

Skip audio files that can't be decoded

build: update gruut version for numpy2 support

d304ab2

build: require numpy<2 because spacy/thinc lack support

b1558b0

build: add upper bound for transformers

7014782

4.43.* broke XTTS streaming again

Merge pull request #56 from idiap/update-gruut

204588f

Preparations for Numpy 2 support (gruut, soxr, spacy)

docs(tacotron): fix wrong paper links (#74)

233dfb5

feat(xtts): support hindi in tokenizer (#64)

1920328

Added proper tokenizer support for Hindi Language which would prevent crash while fine tuning Hindi language. Co-authored-by: Akshat Bhardwaj <157223825+akshatrocky@users.noreply.github.com>

chore(bark): remove manual download of hubert model

865bc39

Bark was previously adapted to download Hubert from HuggingFace, so the manual download is superfluous.

ci: explicitly upload hidden files for coverage

242e278

Due to breaking change in upload-artifact action: actions/upload-artifact#602

build: allow numpy2, which should be supported in spacy 3.8 now (#81)

6f8f15e

feat: normalize unicode characters in text cleaners (#85)

1d39246

* Add normalizer type C to text cleaners * Linter recommendations * Add unicode normalize to every cleaner * Format test_text_cleaners.py

fix(build): restrict spacy version to unbreak installation (#92)

4887a2e

* Update pyproject.toml * Update pyproject.toml * Update pyproject.toml * Update pyproject.toml * build: simplify requirement restrictions --------- Co-authored-by: Enno Hermann <enno.hermann@idiap.ch>

build: restrict coqui trainer version

de22d24

Use previous release which didn't make the torch.load(..., weights_only=True) change yet.

eginhard force-pushed the v0.24.2 branch from 5a867e3 to 867cfe5 Compare October 4, 2024 10:21

eginhard added 2 commits October 4, 2024 12:26

ci: switch to cibuildwheel

f667ee4

chore: bump version to 0.24.2

282b2da

eginhard force-pushed the v0.24.2 branch from 867cfe5 to 282b2da Compare October 4, 2024 10:27

colombine-idiap approved these changes Oct 4, 2024

View reviewed changes

Colombine-cyber approved these changes Oct 4, 2024

View reviewed changes

eginhard merged commit 3e1e2b8 into main Oct 4, 2024
49 checks passed

eginhard deleted the v0.24.2 branch October 4, 2024 11:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

V0.24.2 #94

V0.24.2 #94

eginhard commented Oct 4, 2024

V0.24.2 #94

V0.24.2 #94

Conversation

eginhard commented Oct 4, 2024