Replies: 4 comments 21 replies
-
Hmm, why do you say that? The stack_factor we use is 8, not 10. |
Beta Was this translation helpful? Give feedback.
-
What is the approximate final loss ? thanks @farzadab @zqhuang211 |
Beta Was this translation helpful? Give feedback.
-
Depending on the training data, choice of whisper encoder, and number of epochs, the training loss can be anywhere 0.1 and 0.05. We haven’t seen cases where training loss drops below 0.1 and then goes up significantly. |
Beta Was this translation helpful? Give feedback.
-
@farzadab I'm excited that you've release the latest Ultravox v0.5 weights. I was wondering if you could share the |
Beta Was this translation helpful? Give feedback.
-
the weights you share on huggingface need to change the stack_factor to 10 https://huggingface.co/fixie-ai/ultravox-v0_4_1-llama-3_1-8b/blob/main/ultravox_config.py#L103
Beta Was this translation helpful? Give feedback.
All reactions