-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
XTTS v2.0 #3137
XTTS v2.0 #3137
Conversation
95a9c32
to
e786f25
Compare
@Edresson Currently cloning does not work correctly. The voice is different than it's supposed to be. Also, this model can take multiple references, we need a way to allow that. |
I figured one issue. I was forgetting to pass mel stats. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Would probably be a good idea to run the changed/new files here through make style
to avoid those changes later on.
# print(" > Input text: ", text) | ||
# print(" > Input text preprocessed: ",self.tokenizer.preprocess_text(text, language)) | ||
# print(" > Input tokens: ", text_tokens) | ||
# print(" > Decoded text: ", self.tokenizer.decode(text_tokens[0].cpu().numpy())) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Dead code?
run_description=""" | ||
GPT XTTS training | ||
""", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
run_description=""" | |
GPT XTTS training | |
""", | |
run_description="GPT XTTS training", |
wav_chuncks = [] | ||
for i, chunk in enumerate(chunks): | ||
if i == 0: | ||
assert chunk.shape[-1] > 5000 | ||
wav_chuncks.append(chunk) | ||
assert len(wav_chuncks) > 1 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Typo chuncks
, but this simplifies to
wav_chuncks = [] | |
for i, chunk in enumerate(chunks): | |
if i == 0: | |
assert chunk.shape[-1] > 5000 | |
wav_chuncks.append(chunk) | |
assert len(wav_chuncks) > 1 | |
wav_chunks = list(chunks) # consume generator | |
assert wav_chunks[0].shape[-1] > 5000 | |
assert len(wav_chunks) > 1 |
Updates with V2