Changing f0_method to improve results? #710
Replies: 3 comments 9 replies
-
I am surprised to see that you are able to change F0 prediction method prior to the training session with online Colab. I can only change it on the GUI when doing inference. I have tried all F0 methods for my inference. And the best F0 method in my case is "parselmouth"! It sounds more natural than other methods. And the worst F0 method in my case was "crepe" or "crepe-tiny", which causes the tone to go up and down in an unnatural way. To improve missing words, the Silence threshold on the GUI should be set to -60.0, which makes the threshold the lowest. The default value for the Silence threshold is so high that missing words are inevitable. |
Beta Was this translation helpful? Give feedback.
-
I had the same issues.
I am also newbie here, but managed to solve that.. now more interested to see how to remove back vocals from UVR-processed input. |
Beta Was this translation helpful? Give feedback.
-
Hi all,
New to this, but this has helped me learn so much!
I've been training a model from scratch, and so far its going pretty well. The timbre is really great, and the model is starting to capture the singers voice quite well. But there are two issues I'm struggling with, and hoping someone here can help:
To improve pitch:
I'm currently on colab and using the f0_method "dio". Can I switch to using Crepe to improve the pitch? Will that add to the model or will I have to start again? I'm about 4.5k epochs through and its kinda slow so dont really want to go back to the start.
Or does anyone have any other ideas that could help?
To improve the missed words - is there a way to improve this tracking? Not sure what I'm meant to be trying to improve for this.
I'm training on vocal stems from a studio album. So the training source is great quality. And I have a lot of it.
I've attached some of the tensor graphs. As far as I can see some look like they're converging nicely and others I'm not sure if they're right or not.
Thanks in advance!
Beta Was this translation helpful? Give feedback.
All reactions