Utilizes https://github.com/SWivid/F5-TTS,
This is a simple grpc tts server that can run inference for a single voice and prompt at a time. Each generation is returned back as bytes of a wav pcm file.
This is designed to be used in a pipeline with multiple instances ran with different configurations of voices and ports.
The server probably can handle a few client connections at a time. Voice generation is serialized between client connections so the more clients the more lag between generations may be experienced.
- potentially a docker container with multiple instances
- example integration
- a plexer that wraps all of the python processes and exposes a hgher level abstraction via grpc that will support multiple voices. (go implementation)
git clone https://github.com/beeblebrox/ff5-ttsgrpc
git submodule update --recursive
pip install -r requirements.txt
Edit setup-example.py and move to setup.py (or whatever name you want). Create one for each different instance you need.
ex:
python serve.py setup
python serve.py setup2
From the base directory after venv is setup and sourced:
python generateproto.py
Make sure you add the directory /F5-TTS to your system path or marked as source in project.