您好，请问一下在Qwen2.5的发布榜单中Arena-Hard评测中，裁判模型是使用哪个？基线模型是使用哪个？ #1086

13416157913 · 2024-11-18T02:41:52Z

Model Series

Qwen2.5

What are the models used?

Qwen2.5-72B-Instruct,Qwen2.5-32B-Instruct

What is the scenario where the problem happened?

vllm

Is this a known issue?

I have followed the GitHub README.
I have checked the Qwen documentation and cannot find an answer there.
I have checked the documentation of the related framework and cannot find useful information.
I have searched the issues and there is not a similar one.

Information about environment

您好，请问一下在Qwen2.5的发布榜单中Arena-Hard评测中，裁判模型是使用哪个？基线模型是使用哪个？

Log output

您好，请问一下在Qwen2.5的发布榜单中Arena-Hard评测中，裁判模型是使用哪个？基线模型是使用哪个？

Description

您好，请问一下在Qwen2.5的发布榜单中Arena-Hard评测中，裁判模型是使用哪个？基线模型是使用哪个？

github-actions · 2024-12-20T08:00:35Z

This issue has been automatically marked as inactive due to lack of recent activity. Should you believe it remains unresolved and warrants attention, kindly leave a comment on this thread.

jklj077 assigned hzhwcmhf Nov 19, 2024

github-actions bot added the inactive label Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

您好，请问一下在Qwen2.5的发布榜单中Arena-Hard评测中，裁判模型是使用哪个？基线模型是使用哪个？ #1086

您好，请问一下在Qwen2.5的发布榜单中Arena-Hard评测中，裁判模型是使用哪个？基线模型是使用哪个？ #1086

13416157913 commented Nov 18, 2024

github-actions bot commented Dec 20, 2024

您好，请问一下在Qwen2.5的发布榜单中Arena-Hard评测中，裁判模型是使用哪个？基线模型是使用哪个？ #1086

您好，请问一下在Qwen2.5的发布榜单中Arena-Hard评测中，裁判模型是使用哪个？基线模型是使用哪个？ #1086

Comments

13416157913 commented Nov 18, 2024

Model Series

What are the models used?

What is the scenario where the problem happened?

Is this a known issue?

Information about environment

Log output

Description

github-actions bot commented Dec 20, 2024