This fork contains a completely reworked Streamlit GUI. The 152334H fork of Tortoise-TTS has the best likeness to imported voices at the moment, in my humble opinion. The mrq version has much better nuances and control but adds an American accent to most of my imported voices that are not fine tuned which was driving me nuts. I started working on this because I just wanted to have the extra settings mrq had on their Gradio UI. So I taught myself some Streamlit and python and got the GUI working. I've also fixed a few annoying issues along the way and added some extra features. I'm not sure how far I can develop this or keep it updated with my knowledge but for the time being it works.
- Fully reworked GUI
- Fully control all generation parameters.
- Generate and dump conditioning latent for faster generations or for use with other ML models.
- Download multiple generated candidates.
- Will this work on an older 8gb and under card? Yes it will. Just check Low VRAM in the gui.
- Is the install easy? It relatively is. There is a proper (AHEM!) step by step guide below.
- Can I save conditional latents? Yes you can!
- CudaToolkit : Download from here and install. Choose custom install and uncheck everything except the cudatoolkit option. Your graphic driver will have more updated drivers for the other components you DON'T WANT TO CHANGE THAT.
- Visual C + : Download from here and install. Make sure you select Use in Python
To install on Windows please follow these steps (It was very unclear in the original repo and a lot of users had issues) : Open a miniconda/anaconda terminal as an administrator
conda create --name ttsgui python==3.8
conda activate ttsgui
If you have Tortoise installed before next run :
pip uninstall tortoise
Install Cuda 11 to your environment
conda install cudatoolkit=11.7
Next navigate to the drive/folder where you want the tortoise-fast-tts folder to be extracted to. Then run :
git clone https://github.com/Acephalia/tortoise-tts-fast-GUI
cd tortoise-tts-fast-GUI
conda install pytorch==1.13.1 torchvision==0.14.1 torchaudio==0.13.1 pytorch-cuda=11.7 -c pytorch -c nvidia
pip install -e .
pip install git+https://github.com/152334H/BigVGAN.git
To Run open anaconda/miniconda terminal :
conda activate ttsgui
s:
cd tortoise-tts-fast-GUI
streamlit run app.py
Set your drive letter and path in my case s:/tortoise-tts-fast-GUI
Install Poetry into your environment
conda activate ttsgui
s:
cd tortoise-tts-fast-GUI
pip install poetry
poetry install
poetry shell
Set your drive letter and path in my case s:/tortoise-tts-fast-GUI
If you encounter any messages stating xyz module is missing or the likes simply go back into your env and run
pip install xyz
Replace xyz and repeat till all missing modules are installed.
Add remaining settings- Fix files being overwritten in results folder
Fix Download Audio fileFix multiple candidate downloadAdd voice refreshAdd GUI to create and download conditional latentsAutomatically create conditional latents in voice folder
Coming Soon
Visit the original repo for more info.