Some intermediate samples of training vits-2 (non native EN | customer service phone calls dataset) #18
Replies: 4 comments 4 replies
-
You can add more samples as you progress @athenasaurav. Maybe a comparison of another model samples and vits-2 model samples would be great as well later on! Thanks. |
Beta Was this translation helpful? Give feedback.
-
Sure @p0p4k |
Beta Was this translation helpful? Give feedback.
-
Thanks for the great repo, awesome work. Figured I should at least do you the courtesy of sharing some samples (roughly 110k iterations). https://drive.google.com/drive/folders/13H5qeZ6U-jJVas76ln4TnukT3q1CK6eb?usp=sharing (If the content is a bit strange, it's because it was all generated via LLaMa 2 in a little demo project I put together Chinese will follow soon. It's still a little unstable in parts (and it does struggle with short phrases), I'm not sure whether that's an artifact of the model, the dataset or the parameters I was training with. |
Beta Was this translation helpful? Give feedback.
-
Hi @nmfisher, English sample voices sound very good. Do you have any sample audio for Chinese? |
Beta Was this translation helpful? Give feedback.
-
Thanks for sharing @athenasaurav ! (#17)
Beta Was this translation helpful? Give feedback.
All reactions