how can i reproduce the results on truthfulqa? #7

SuperChanS · 2024-07-28T15:05:08Z

I notice that operating truthfulqa.sh requires "gpt_true_model_name" and "gpt_info_model_name". But it seems the original model is unavailable now.

alisawuffles · 2024-07-29T22:55:11Z

Yes, this part of the evaluation is unfortunately no longer reproducable due to OpenAI's deprecation of GPT-3-based models. The allenai/open-instruct evaluation frameowrk has switched to finetuned judge models based on Llama2 instead! Please see their evaluation script here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how can i reproduce the results on truthfulqa? #7

how can i reproduce the results on truthfulqa? #7

SuperChanS commented Jul 28, 2024

alisawuffles commented Jul 29, 2024

how can i reproduce the results on truthfulqa? #7

how can i reproduce the results on truthfulqa? #7

Comments

SuperChanS commented Jul 28, 2024

alisawuffles commented Jul 29, 2024