Request the subject-wise breakdown MMLU-Pro scores #18

Wyyyb · 2024-12-01T04:37:50Z

Hi Hunyuan Model Team, I am one of the authors of MMLU-Pro, and I've noticed with great interest that the Hunyuan model has achieved impressive results on our benchmark. To provide a more detailed representation of Hunyuan's capabilities on our leaderboard, I would like to request the subject-wise breakdown scores of your model's performance on MMLU-Pro. We plan to update our leaderboard (https://huggingface.co/spaces/TIGER-Lab/MMLU-Pro) to include these detailed metrics.

Thank you for your time and consideration. I am excited to hear from you.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request the subject-wise breakdown MMLU-Pro scores #18

Request the subject-wise breakdown MMLU-Pro scores #18

Wyyyb commented Dec 1, 2024

Request the subject-wise breakdown MMLU-Pro scores #18

Request the subject-wise breakdown MMLU-Pro scores #18

Comments

Wyyyb commented Dec 1, 2024