-
Notifications
You must be signed in to change notification settings - Fork 11.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Jeopardy Example Script #1168
Jeopardy Example Script #1168
Conversation
All that's left is the readme. |
I don't think this is correct:
The question should be in the answer format, like so:
|
I haven't changed the prompt question from the original repository. The point is to see if it can bring up facts, not if it can play Jeopardy as it is on the show. |
Then the answer should be "John Williams" not "Who is John Williams?" |
See https://github.com/aigoopy/llm-jeopardy/blob/main/qasheet.ods and #1163, I would assume we're trying to use the same data as everyone else, so I'm not sure if this issue is supposed to be an implementation of aigoopy's jeopardy or just something with a similar style. @ggerganov which would you prefer? |
The columns "Original Answer" and "Original Correct Question" in the spreadsheet is the data they used (what is the source? maybe https://j-archive.com/). Then they created "Model Prompt" where it has been turned into a question, and for all the models, they are also answering in an answer format, explained in Reddit. But anyway, I think this test should be either question-answer or jeopardy style answer-question, but not a mix. If we don't change the data from the original, we could possibly evaluate a much larger dataset without having to manually edit questions. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I agree with @SlyEcho - we should keep either format and not mix it like they've done in the reference repo.
For now, we can merge it like this so we have the evaluation framework available, and later we can update the questions / answers.
Closes #1163
This is pretty much just a straight port of aigoopy/llm-jeopardy/
Leaving as a draft since it's still missing a lot of features, and I will continue to work on it to make it more usable.