Replies: 5 comments 3 replies
-
By doing this you would be able to fit twice as much data in VRAM. In fact, I'm wondering if the result couldn't be even better since sampling frequency regions that don't contain much useful information might be conterproductive. Also setting mel_fmax to 11025 Hz would increase the spectral resolution. You might even be able to decrease n_mel_channels (if it helps training).
Like in: https://huggingface.co/blog/collaborative-training In fact, as I understand it, it is based on hivemind: If you can put in place such a solution, I would be happy to give GPU time. Anyway, I hope you call for help will be heard. Best regards |
Beta Was this translation helpful? Give feedback.
This comment was marked as spam.
This comment was marked as spam.
-
How long would it take to train initial models on a RTX3090? |
Beta Was this translation helpful? Give feedback.
-
What is needed for QuickVC training? Provided we have a dataset available I could go on and do initial training on my 4090 (or rent multiple 4090s for some hours). I'd have to look deeper into this to try and maintain the repo, but lending an hand in my (albeit limited) free time is something I could do. Not alone certainly. But I'm up for discussion. And looks like I'm not alone here either. |
Beta Was this translation helpful? Give feedback.
-
I would like to help but I don't know much programming. I am still in school and will stay for a long time and will complete school in 6 years :( |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
All reactions