Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

tortoise-tts-fast #18

Closed
HobisPL opened this issue May 29, 2023 · 8 comments
Closed

tortoise-tts-fast #18

HobisPL opened this issue May 29, 2023 · 8 comments

Comments

@HobisPL
Copy link

HobisPL commented May 29, 2023

Have you considered adding Tortoise-TTS-Fast instead of the original Tortoise-TTS? Fast performs much faster, and you can add your own models to it. It's better to clone voices because you can extract the latents from whole audio samples, making them more sophisticated.
https://github.com/152334H/tortoise-tts-fast

@rsxdalv
Copy link
Owner

rsxdalv commented May 29, 2023 via email

@rsxdalv
Copy link
Owner

rsxdalv commented Jun 4, 2023

That repo has a very strong license (AGPL v3).
However, this repo is under the same Apache 2.0: https://git.ecker.tech/mrq/tortoise-tts
Do you know if the mrq repo is worthwhile?
Also, tortoise-tts-fast offers a web UI that could be useful.

If that repo is very important then you could create a fork of this webui to swap tortoise-tts with tortoise-tts-fast.

@rsxdalv
Copy link
Owner

rsxdalv commented Jun 5, 2023

Ok so - as far as I tested it:
The API seems compatible, so if you replaced tortoise/ folder with this fork, it should work.
And it does seem to install partially, but there seem to be some incompatibilities with torch versions, not sure if they will be fixable too soon.

This is the error I got:
ImportError: cannot import name 'fail_with_message' from 'torchaudio._internal.module_utils'

For installation I used:

# Installing TortoiseTTS Fast - AGPLv3
git clone https://github.com/152334H/tortoise-tts-fast tortoise_tts_fast
cd tortoise_tts_fast
pip install --ignore-installed llvmlite -e .
pip install --ignore-installed llvmlite git+https://github.com/152334H/BigVGAN.git

@rsxdalv
Copy link
Owner

rsxdalv commented Jun 19, 2023

Improved performance and UI, although tortoise-tts-fast should still be faster.
#45

@HobisPL
Copy link
Author

HobisPL commented Jun 19, 2023

I found a repository that already uses Gradio, and it includes training and many cool options.
https://git.ecker.tech/mrq/ai-voice-clonin

@manmay-nakhashi
Copy link

@rsxdalv you can update this repo with new api call

tts = api.TextToSpeech(use_deepspeed=True, kv_cache=True, half=True)

rsxdalv added a commit that referenced this issue Jul 19, 2023
* add latest tortoise improvements

* update readme
@rsxdalv
Copy link
Owner

rsxdalv commented Jul 19, 2023

Implemented in newest update. I didn't add automatic deepspeed installation since it often fails on windows.
127 0 0 1_7860_ (2)

@rsxdalv
Copy link
Owner

rsxdalv commented Jul 20, 2023

I'm considering this resolved for now. Feel free to reopen this or another issue for more tortoise optimizations.

@rsxdalv rsxdalv closed this as completed Jul 20, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants