Jeopardy Example Script #1168

CRD716 · 2023-04-25T03:00:02Z

This is pretty much just a straight port of aigoopy/llm-jeopardy/
Leaving as a draft since it's still missing a lot of features, and I will continue to work on it to make it more usable.

CRD716 · 2023-04-27T05:26:55Z

All that's left is the readme.

SlyEcho · 2023-04-27T07:25:28Z

I don't think this is correct:

1,The Oscars,Who is John Williams?,Which actor Born in 1932 was the son of a percussionist in the CBS radio orchestra has been nominated for 53 Oscars?

The question should be in the answer format, like so:

1,The Oscars,Who is John Williams?,Born in 1932 & the son of a percussionist in the CBS radio orchestra, he's been nominated for 53 Oscars

CRD716 · 2023-04-27T14:07:21Z

I don't think this is correct:


1,The Oscars,Who is John Williams?,Which actor Born in 1932 was the son of a percussionist in the CBS radio orchestra has been nominated for 53 Oscars?

The question should be in the answer format, like so:


1,The Oscars,Who is John Williams?,Born in 1932 & the son of a percussionist in the CBS radio orchestra, he's been nominated for 53 Oscars

I haven't changed the prompt question from the original repository. The point is to see if it can bring up facts, not if it can play Jeopardy as it is on the show.

SlyEcho · 2023-04-27T14:13:48Z

I haven't changed the prompt question from the original repository. The point is to see if it can bring up facts, not if it can play Jeopardy as it is on the show.

Then the answer should be "John Williams" not "Who is John Williams?"

CRD716 · 2023-04-27T14:20:01Z

I haven't changed the prompt question from the original repository. The point is to see if it can bring up facts, not if it can play Jeopardy as it is on the show.

Then the answer should be "John Williams" not "Who is John Williams?"

See https://github.com/aigoopy/llm-jeopardy/blob/main/qasheet.ods and #1163, I would assume we're trying to use the same data as everyone else, so I'm not sure if this issue is supposed to be an implementation of aigoopy's jeopardy or just something with a similar style. @ggerganov which would you prefer?

SlyEcho · 2023-04-27T14:31:55Z

The columns "Original Answer" and "Original Correct Question" in the spreadsheet is the data they used (what is the source? maybe https://j-archive.com/). Then they created "Model Prompt" where it has been turned into a question, and for all the models, they are also answering in an answer format, explained in Reddit.

But anyway, I think this test should be either question-answer or jeopardy style answer-question, but not a mix.

If we don't change the data from the original, we could possibly evaluate a much larger dataset without having to manually edit questions.

ggerganov

I agree with @SlyEcho - we should keep either format and not mix it like they've done in the reference repo.

For now, we can merge it like this so we have the evaluation framework available, and later we can update the questions / answers.

CRD716 added 8 commits April 24, 2023 19:12

Basic Setup

ccf9002

Prevent Results.txt from coming up

7fd88f4

Prefixes, Line separators, etc

e82439a

editorcheck

e3159c0

introduction to give more consistent results

9143cce

Merge branch 'ggerganov:master' into master

b73c192

Basic graph thing

4277270

Grading, ready for testing!

024e31a

Y'all ready to get funky?

d083567

CRD716 marked this pull request as ready for review April 27, 2023 05:52

CRD716 mentioned this pull request Apr 27, 2023

llama.cpp + Final Jeopardy #1163

Closed

CRD716 added 2 commits April 27, 2023 01:16

fix column removal stuff

c74ceed

missed a few

da58bab

ggerganov approved these changes Apr 28, 2023

View reviewed changes

ggerganov merged commit 5fba3c0 into ggml-org:master Apr 28, 2023

Bearsaerker mentioned this pull request Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jeopardy Example Script #1168

Jeopardy Example Script #1168

CRD716 commented Apr 25, 2023

CRD716 commented Apr 27, 2023

SlyEcho commented Apr 27, 2023

CRD716 commented Apr 27, 2023

SlyEcho commented Apr 27, 2023

CRD716 commented Apr 27, 2023

SlyEcho commented Apr 27, 2023

ggerganov left a comment

Jeopardy Example Script #1168

Jeopardy Example Script #1168

Conversation

CRD716 commented Apr 25, 2023

CRD716 commented Apr 27, 2023

SlyEcho commented Apr 27, 2023

CRD716 commented Apr 27, 2023

SlyEcho commented Apr 27, 2023

CRD716 commented Apr 27, 2023

SlyEcho commented Apr 27, 2023

ggerganov left a comment

Choose a reason for hiding this comment