-
-
Notifications
You must be signed in to change notification settings - Fork 149
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ability to use models other than xtts? #99
Comments
Hi @0xYc0d0ne Not currently no. Its something Im considering, however, there will be a chunk of code to re-write to make it integrate with other models. There is no way to drop in place another model currently. Thanks |
@erew123 I have an experimental fork which is designed to allow use of the English Out of the box AllTalk only supports single speaker models, but my fork allows the use of models with multiple speakers like I use this when testing and demonstrating portable offline TTS from my M1 MacBook which doesn't have GPU inference via DeepSpeed for While the To make AllTalk use the
If you're on macOS you can install the
You'll need to see what the equivalent is for Windows or Linux if you are using those OS to run AllTalk. When making the request via AllTalk's REST API you need to add a |
@UXVirtual That's interesting! Ill need to have a play at some point and continue my thoughts on how this might be integrated. Ive had a few week long debate in my head about how to maybe separate the model loaders out from the rest of AllTalk, allowing the potential to load/use theoretically any model. What you've done though is a nice little addition that isn't too heavy on a re-code. I'm going to make a note of this in the Feature requests on the discussion forum... and let me head roll over it a bit more. Give me a bit of time and Ill get back to you at some point! (if thats ok!) Thanks |
Hey @erew123 no problem! The separation of model loaders sounds like a good approach - I look forward to seeing what integrations can be done there :-) |
i was wondering if its possible to use another model like StyleTTS with alltalk instead of the default coqui xtts model since there are probably better models out there for voice cloning...
The text was updated successfully, but these errors were encountered: