XTTS v2.0 #3137

Edresson · 2023-11-03T17:06:43Z

Updates with V2

HU and KO added
Updated voice cloning
Trained more so likely to be better across the board.

erogol · 2023-11-04T12:42:46Z

@Edresson Currently cloning does not work correctly. The voice is different than it's supposed to be.

Also, this model can take multiple references, we need a way to allow that.

erogol · 2023-11-04T23:47:05Z

I figured one issue. I was forgetting to pass mel stats.

akx

Would probably be a good idea to run the changed/new files here through make style to avoid those changes later on.

TTS/tts/layers/xtts/gpt.py

TTS/tts/models/xtts.py

akx · 2023-11-06T06:46:21Z

TTS/tts/models/xtts.py

+        # print(" > Input text: ", text)
+        # print(" > Input text preprocessed: ",self.tokenizer.preprocess_text(text, language))
+        # print(" > Input tokens: ", text_tokens)
+        # print(" > Decoded text: ", self.tokenizer.decode(text_tokens[0].cpu().numpy()))


recipes/ljspeech/xtts_v2/train_gpt_xtts.py

akx · 2023-11-06T06:46:59Z

recipes/ljspeech/xtts_v2/train_gpt_xtts.py

+        run_description="""
+            GPT XTTS training
+            """,


Suggested change

run_description="""

GPT XTTS training

""",

run_description="GPT XTTS training",

tests/xtts_tests/test_xtts_v2-0_gpt_train.py

akx · 2023-11-06T06:51:03Z

tests/zoo_tests/test_models.py

+    wav_chuncks = []
+    for i, chunk in enumerate(chunks):
+        if i == 0:
+            assert chunk.shape[-1] > 5000
+        wav_chuncks.append(chunk)
+    assert len(wav_chuncks) > 1


Typo chuncks, but this simplifies to

Suggested change

wav_chuncks = []

for i, chunk in enumerate(chunks):

if i == 0:

assert chunk.shape[-1] > 5000

wav_chuncks.append(chunk)

assert len(wav_chuncks) > 1

wav_chunks = list(chunks) # consume generator

assert wav_chunks[0].shape[-1] > 5000

assert len(wav_chunks) > 1

akx · 2023-11-06T10:14:21Z

@erogol please consider #3127 instead of make styleing everything in this PR. (Running make style made this a +1900 line PR.)

…tts_trainer

Edresson force-pushed the xtts_trainer branch from 95a9c32 to e786f25 Compare November 3, 2023 17:21

Edresson requested review from erogol and WeberJulian November 3, 2023 18:04

Edresson marked this pull request as draft November 4, 2023 14:23

akx reviewed Nov 6, 2023

View reviewed changes

Edresson and others added 22 commits November 6, 2023 11:37

Implement most similar ref training approach

077a849

Use non-enhanced hifigan for test samples

1fb6c20

Add Perceiver

dff3902

Update GPT Trainer for perceiver support

8479a37

Update XTTS docs

5df8f76

Bug fix masking with XTTS perceiver

a032d98

Bug fix on gpt forward

32796fd

Bug Fix on XTTS v2.0 training

cff8542

Add XTTS v2.0 unit tests

8133b10

Add XTTS v2.0 inference unit tests

b621ab1

Bug Fix on diffusion inference

0664c84

Add XTTS v2.0 training recipe

08e0432

Placeholder model entry

d2a2b7a

Add cloning params to config

aa16da9

Make prompt embedding configurable

c182535

Make cloning configurable

b1b6876

Cheap fix for a cheaper fix

2d65f00

Prevent resampling

9b5c295

Update model entry

b47afc5

Update docs

3a8432d

Update requirements

d045bfc

Make style

b094979

erogol added 2 commits November 6, 2023 11:41

Code linting

5580104

Add xtts v2 to sep tests

3e59050

erogol force-pushed the xtts_trainer branch from 0c3dbad to 3e59050 Compare November 6, 2023 10:42

Edresson and others added 5 commits November 6, 2023 09:15

Bug fix on XTTS get_gpt_cond_latents

662ee2b

Bug fix on rebase

00b24ee

Make style

1a9ca35

Bug fix in Japenese tokenizer

a1c441f

Add num2words to deps

2120212

erogol marked this pull request as ready for review November 6, 2023 13:52

Edresson added 2 commits November 6, 2023 10:53

Remove unused kwarg and added num_beams=1 as default

9e92adc

Merge branch 'xtts_trainer' of https://github.com/coqui-ai/TTS into x…

80a3fbc

…tts_trainer

erogol merged commit e45227d into dev Nov 6, 2023
53 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

XTTS v2.0 #3137

XTTS v2.0 #3137

Edresson commented Nov 3, 2023 •

edited by erogol

Loading

erogol commented Nov 4, 2023 •

edited

Loading

erogol commented Nov 4, 2023

akx left a comment

akx Nov 6, 2023

akx Nov 6, 2023

akx Nov 6, 2023 •

edited

Loading

akx commented Nov 6, 2023 •

edited

Loading

XTTS v2.0 #3137

XTTS v2.0 #3137

Conversation

Edresson commented Nov 3, 2023 • edited by erogol Loading

Updates with V2

erogol commented Nov 4, 2023 • edited Loading

erogol commented Nov 4, 2023

akx left a comment

Choose a reason for hiding this comment

akx Nov 6, 2023

Choose a reason for hiding this comment

akx Nov 6, 2023

Choose a reason for hiding this comment

akx Nov 6, 2023 • edited Loading

Choose a reason for hiding this comment

akx commented Nov 6, 2023 • edited Loading

Edresson commented Nov 3, 2023 •

edited by erogol

Loading

erogol commented Nov 4, 2023 •

edited

Loading

akx Nov 6, 2023 •

edited

Loading

akx commented Nov 6, 2023 •

edited

Loading