Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug] Mix language inference text #115

Closed
cod3r0k opened this issue Oct 28, 2024 · 4 comments
Closed

[Bug] Mix language inference text #115

cod3r0k opened this issue Oct 28, 2024 · 4 comments
Labels
bug Something isn't working

Comments

@cod3r0k
Copy link

cod3r0k commented Oct 28, 2024

Describe the bug

What should we do for text that contains multiple languages? Since inference sends everything to eSpeak with a fixed language setting, eSpeak does not handle it well!

To Reproduce

Since inference sends everything to eSpeak with a fixed language setting, eSpeak does not handle it well!

Expected behavior

well phonemizer working

Logs

No response

Environment

What should we do for text that contains multiple languages? Since inference sends everything to eSpeak with a fixed language setting, eSpeak does not handle it well!

Additional context

No response

@cod3r0k cod3r0k added the bug Something isn't working label Oct 28, 2024
@eginhard
Copy link
Member

Yes, there is currently no way to do this directly in Coqui. But you can do mixed-language TTS with Vits/YourTTS models if you add some custom code (including calling Espeak separately per language if you don't use grapheme-based models), see #104 for details.

Integrating this would first need SSML support (see previous discussions in coqui-ai#752), which is very complex, so closing as not planned for now, but I'm open to contributions regarding SSML.

@eginhard eginhard closed this as not planned Won't fix, can't repro, duplicate, stale Oct 28, 2024
@cod3r0k
Copy link
Author

cod3r0k commented Nov 3, 2024

Hi, I check espeak with Persian (fa) language. I found that it handle English also. Isnt it enough? @eginhard

@eginhard
Copy link
Member

eginhard commented Nov 4, 2024

If you just need to mix Persian and English, maybe? If you want to also mix other languages, then maybe not? I don't know your use case.

@cod3r0k
Copy link
Author

cod3r0k commented Nov 4, 2024

For the first run, yes Just Persian and English.
maybe Arabic in the next version.
in the last one, Germany (it is different from all of them)

But step by step. for the first step (Persian and English), do you have any concern?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants