Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

您好,请问一下在Qwen2.5的发布榜单中Arena-Hard评测中,裁判模型是使用哪个?基线模型是使用哪个? #1086

Open
4 tasks done
13416157913 opened this issue Nov 18, 2024 · 1 comment
Assignees
Labels

Comments

@13416157913
Copy link

Model Series

Qwen2.5

What are the models used?

Qwen2.5-72B-Instruct,Qwen2.5-32B-Instruct

What is the scenario where the problem happened?

vllm

Is this a known issue?

  • I have followed the GitHub README.
  • I have checked the Qwen documentation and cannot find an answer there.
  • I have checked the documentation of the related framework and cannot find useful information.
  • I have searched the issues and there is not a similar one.

Information about environment

您好,请问一下在Qwen2.5的发布榜单中Arena-Hard评测中,裁判模型是使用哪个?基线模型是使用哪个?
image

Log output

您好,请问一下在Qwen2.5的发布榜单中Arena-Hard评测中,裁判模型是使用哪个?基线模型是使用哪个?

Description

您好,请问一下在Qwen2.5的发布榜单中Arena-Hard评测中,裁判模型是使用哪个?基线模型是使用哪个?
image

Copy link

This issue has been automatically marked as inactive due to lack of recent activity. Should you believe it remains unresolved and warrants attention, kindly leave a comment on this thread.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants