-
Notifications
You must be signed in to change notification settings - Fork 49
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add more VITS voices via piper #215
Comments
Whilst epub2tts is extremely stable as far as features and functionality currently at least for me, just a +1 for this. |
Yes, on my 24 thread 3900X, it take around 10 hours to encode a 5 hour book with xtts (wish it could run on OpenCL or OneAPI from my A770). VITS is decent (epub2tts already has 335 & 307 VITS models) and quick. |
This seems like it would be worth adding, I will take a look at https://github.com/rhasspy/piper/?tab=readme-ov-file#running-in-python and see. It will probably be a while (a few weeks at least) before I've got time again for this, but it looks like it might not be very difficult. |
This is going to be a problem that needs to be resolved before incorporating piper: rhasspy/piper#395 I'm able to install on linux without trouble, but my primary dev environment is macOS, and I would not want to introduce a dependency that makes it so epub2tts can not be installed on mac. I'll keep an eye on this though while I poke at a test integration branch. |
There's a first pass at this, still needs a lot of work around model name and speaker, and I have not tested anything with other languages, etc. BUT the branch https://github.com/aedocw/epub2tts/tree/add-piper has a very simple implementation that seems to work in a minimal sense. Adds |
Awesome, this is great! Just FYI, I got the following error using the branch with Ubuntu/PopOS:
In order to fix, I manually pulled in the correct piper model via:
and then added the folder:
Obviously this was a quick fix and should be done differently, but it worked for a quick and dirty solution. Piper sounds great and ill probably use this branch as a starting point with a couple of creative commons books (more of a proof of concept than anything, comparing them all). Piper is very quick compared to the others, but sounds a bit more robotic, which is fine to me. Here is the sample: https://github.com/michaelachrisco/epub2tts/blob/add-piper/sample-piper.m4b |
Tested on a debian testing VM with pipx. Same as Michael, doesn't download model, but works
|
Thanks both of you for sharing this. Once the issues with installing on apple silicon are resolved, I'll do some more work to clean this up and make it usable. In the mean time I suggest you check out the |
Hi again, |
It looks like there are still open issues around installing on mac (rhasspy/piper#395). Feel free though to try it out and see if the issues are resolved now on mac. I don't think I'll have much time to play with this over the next few weeks (getting busy these days!) but update here if it is actually working OK on mac, then I could try to clean up that piper branch and add it to main. |
Hi, |
I know I previously mention edge-tts, which is cloud-based, fast and free, but under the GPL. I have recently been trying out https://github.com/rhasspy/piper/, which uses the VITS model and is under the MIT.
The text was updated successfully, but these errors were encountered: