Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add GPQA scenario #3017

Closed
yifanmai opened this issue Sep 24, 2024 · 0 comments · Fixed by #3068
Closed

Add GPQA scenario #3017

yifanmai opened this issue Sep 24, 2024 · 0 comments · Fixed by #3068
Assignees
Labels
additions New models or scenarios good first issue Good for newcomers scenarios

Comments

@yifanmai
Copy link
Collaborator

yifanmai commented Sep 24, 2024

Paper: https://arxiv.org/abs/2311.12022
It is easiest to use the Hugging Face version: https://huggingface.co/datasets/Idavidrein/gpqa

Should be similar to original MMLU: see mmlu_scenario.py for the original MMLU and air_bench_scenario.py for how to use load_dataset() with Hugging Face datasets.

Edit: Also look at simple_scenarios.py and test_simple_scenarios.py for an example of MCQA.

@yifanmai yifanmai added good first issue Good for newcomers scenarios additions New models or scenarios labels Sep 24, 2024
@liamjxu liamjxu self-assigned this Oct 3, 2024
@liamjxu liamjxu linked a pull request Oct 17, 2024 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
additions New models or scenarios good first issue Good for newcomers scenarios
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants