Implement Mimic 3 engine (TTS) #30

PeterBowman · 2023-03-08T23:07:53Z

@rsantos88 has found a "fast, privacy-focused, open-source, neural Text to Speech (TTS) engine" that looks great:

https://mycroft-ai.gitbook.io/docs/mycroft-technologies/mimic-tts/mimic-3
https://github.com/MycroftAI/mimic3
https://github.com/MycroftAI/mimic3-voices

It is lightweight, offline, and features human-like voice (as opposed to the more robotic one we currently use via eSpeak). It is written in Python and can be installed through pip (mycroft-mimic3-tts package). There are four available Spanish voices (3 male, 1 female) compiled in two datasets; the "tux" voice from the "m-ailabs" dataset sounds quite appealing.

Sample invocation:

mimic3 --voice es_ES/m-ailabs#tux "hola, me llamo teo y tengo 10 años"

(or simply --voice es_ES/m-ailabs since tux is the default voice)

Pro tip: add --cuda to enable GPU acceleration (requires the "onnxruntime-gpu" pip package).

I'm thinking of a Python client implementation of our TextToSpeech IDL service similar to speechRecognition.py.

The text was updated successfully, but these errors were encountered:

PeterBowman · 2023-03-08T23:12:47Z

As a side note regarding installation: although the voice models should be automatically downloaded by the CLI app and stored in ${HOME}/.local/share/mycroft/mimic3/voices on first use, in my case the process got stuck every time and I had to complete it manually. I provided the link to the mimic3-voices in the previous comment. Note that there is a generator.onnx inside each voice directory that is handled by Git LFS (which weighs around 60-70 MB). It needs to be downloaded and pasted in the correct location separately.

See also https://mycroft-ai.gitbook.io/docs/mycroft-technologies/mimic-tts/mimic-3#downloading-voices and the mimic3-download command.

PeterBowman · 2023-03-09T01:29:36Z

Already working (not fully implemented) at 7fadf45.

@rsantos88 in case you want to use this in the upcoming demos, assuming you have installed the Spanish voices, launch it with:

speechSynthesis --voice es_ES/m-ailabs --speaker tux --port /teo/tts

On the teo-self-presentation side, pass --language es_ES/m-ailabs#tux to dialogueManager ~~and change the output port in the yarpmanager's connections tab from "/teo/tts/rpc:s" to "/speechSynthesis/rpc:s"~~ (edit: added --port).

PeterBowman · 2023-03-09T13:18:22Z

Done at 618cb83, see speechSynthesis.py. All IDL commands have been implemented except the pitch accessors, pause and resume. I'd consider expanding the API with volume commands and renaming "language" to "voice", which might or might not include speaker information.

There are two caveats to this implementation/engine:

On certain voices, including the Spanish ones, the last letters/vowels are trimmed from the synthesised result: TTS. Last letter of the text won't be spoken in spanish MycroftAI/mimic3#30 and Last character with polish voice is always cutten MycroftAI/mimic3-voices#4. It can be worked around for now by repeating the last letter, e.g. "holaa" instead of "hola".
MycroftAI has undergone staff reduction recently and seems to be on radio silence since the last blog post by their CEO back in January. For this reason, I'm forking their relevant repos into our org.

synesthesiam · 2023-03-20T17:09:24Z

Author of Mimic 3 here. I'm continuing my TTS work elsewhere: https://github.com/rhasspy/larynx2/

PeterBowman · 2023-12-12T21:00:50Z

Author of Mimic 3 here. I'm continuing my TTS work elsewhere: https://github.com/rhasspy/larynx2/

Thank you for the heads-up and your great work! We have migrated from Mimic 3 to Piper (current name) at #33.

PeterBowman self-assigned this Mar 8, 2023

PeterBowman closed this as completed Mar 9, 2023

PeterBowman mentioned this issue Jul 24, 2023

Replace ad-hoc subprocess with python-sounddevice stream callbacks in TTS #33

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Mimic 3 engine (TTS) #30

Implement Mimic 3 engine (TTS) #30

PeterBowman commented Mar 8, 2023 •

edited

Loading

PeterBowman commented Mar 8, 2023

PeterBowman commented Mar 9, 2023 •

edited

Loading

PeterBowman commented Mar 9, 2023 •

edited

Loading

synesthesiam commented Mar 20, 2023

PeterBowman commented Dec 12, 2023

Implement Mimic 3 engine (TTS) #30

Implement Mimic 3 engine (TTS) #30

Comments

PeterBowman commented Mar 8, 2023 • edited Loading

PeterBowman commented Mar 8, 2023

PeterBowman commented Mar 9, 2023 • edited Loading

PeterBowman commented Mar 9, 2023 • edited Loading

synesthesiam commented Mar 20, 2023

PeterBowman commented Dec 12, 2023

PeterBowman commented Mar 8, 2023 •

edited

Loading

PeterBowman commented Mar 9, 2023 •

edited

Loading

PeterBowman commented Mar 9, 2023 •

edited

Loading