Streaming inference for XTTS 🚀 #3035

WeberJulian · 2023-10-06T09:53:26Z

Replace the slow diffusion model with our own hifigan variant model
Implement deepspeed as an option for huge GPT speedup
Add support for streaming output audio
Corresponding doc
Toy streaming server example
Additional XTTS Tests

* add add cli options for play and speed --play argument uses simpleaudio to play the tts wav --speed <float 0.0-2.0> passes speed argument to Coqui Studio models * remove simpleaudio not referenced in file * fix simpleaudio dependency version * add ALSA headers for simpleaudio compilation * Dockerfile ALSA headers for simpleaudio * base changes to use stdout instead of play audio Considering conversion to pipe wav data for audio playback with ohter program like aplay. This is incomplete code. Using to get feedback before proceeding with implementation. * remove play for pipe_out arg that suppresses stdout removed play and simpleaudio dependency in place of pipe fuctionality to allow passing wav file data to a program dedicated to playing audio. * scipy.io.wavfile.write fails with /dev/null target * Streaming inference for XTTS 🚀 (#3035) * v0.17.7 * Redownload XTTS with the local and remote config do not match * Remove unused method * Print a message when it is already donwloaded * Try-except to present error when the user dont have connection * Fix style * 0.17.8 * v0.17.8 --------- Co-authored-by: Julian Weber <julian.weber@hotmail.fr> Co-authored-by: Eren Gölge <erogol@hotmail.com> Co-authored-by: Edresson Casanova <edresson1@gmail.com> Co-authored-by: ggoknar <ggoknar@coqui.ai>

WeberJulian added 8 commits October 6, 2023 11:41

Add support for hifigan and streaming

e7a91be

Update the checkpoints

abfc738

Use load_audio in get_speaker_embedding

a0f657c

Add inference with precomputed latents

a45cf83

Add credit for stream_generator

2ecf84a

Loading only one decoder and removing lazy loading

0d36dcf

Add inference_mode

1d752b7

Add documentation

be51205

erogol changed the title ~~Faster inference for XTTS 🚀~~ Streaming inference for XTTS 🚀 Oct 6, 2023

WeberJulian added 5 commits October 6, 2023 14:40

small fixes

a097541

Add streaming test

a357f81

Remove deterministic inference

3063846

Fixing english tokenization

1ec3418

2nd version of the tokenizer fix

2fdf51e

gorkemgoknar marked this pull request as ready for review October 6, 2023 16:09

erogol merged commit e5e0cbf into dev Oct 6, 2023
49 checks passed

David-bfg pushed a commit to David-bfg/TTS that referenced this pull request Oct 9, 2023

Streaming inference for XTTS 🚀 (coqui-ai#3035)

098fa07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Streaming inference for XTTS 🚀 #3035

Streaming inference for XTTS 🚀 #3035

WeberJulian commented Oct 6, 2023 •

edited

Loading

Streaming inference for XTTS 🚀 #3035

Streaming inference for XTTS 🚀 #3035

Conversation

WeberJulian commented Oct 6, 2023 • edited Loading

WeberJulian commented Oct 6, 2023 •

edited

Loading