You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What should we do for text that contains multiple languages? Since inference sends everything to eSpeak with a fixed language setting, eSpeak does not handle it well!
To Reproduce
Since inference sends everything to eSpeak with a fixed language setting, eSpeak does not handle it well!
Expected behavior
well phonemizer working
Logs
No response
Environment
What should we dofor text that contains multiple languages? Since inference sends everything to eSpeak with a fixed language setting, eSpeak does not handle it well!
Additional context
No response
The text was updated successfully, but these errors were encountered:
Yes, there is currently no way to do this directly in Coqui. But you can do mixed-language TTS with Vits/YourTTS models if you add some custom code (including calling Espeak separately per language if you don't use grapheme-based models), see #104 for details.
Integrating this would first need SSML support (see previous discussions in coqui-ai#752), which is very complex, so closing as not planned for now, but I'm open to contributions regarding SSML.
Describe the bug
What should we do for text that contains multiple languages? Since inference sends everything to eSpeak with a fixed language setting, eSpeak does not handle it well!
To Reproduce
Since inference sends everything to eSpeak with a fixed language setting, eSpeak does not handle it well!
Expected behavior
well phonemizer working
Logs
No response
Environment
Additional context
No response
The text was updated successfully, but these errors were encountered: