-
Notifications
You must be signed in to change notification settings - Fork 444
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
请问如何计算qwen2-vl的loss? #2403
Comments
please use ms-swift==2.5.2 or use transformers<4.46 |
您好!我用了transformers=4.45.2和ms-swift==2.5.2,依然出现loss特别大的问题 |
用swift sft跑呗, swift只会对response, eos, history中的response部分计算损失 |
谢谢!我想只统计loss,纯推理,不对模型做sft,请问该怎么操作呢 |
我把代码改了下,已解决问题。感谢回复! |
请问能分享一下是怎么修改的吗 谢谢! |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
我设计这个函数计算loss,发现loss值特别大(大概在16-18左右),请问应该怎么修正成正确的计算方式呢?
The text was updated successfully, but these errors were encountered: