You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Yes, this part of the evaluation is unfortunately no longer reproducable due to OpenAI's deprecation of GPT-3-based models. The allenai/open-instruct evaluation frameowrk has switched to finetuned judge models based on Llama2 instead! Please see their evaluation script here.
I notice that operating truthfulqa.sh requires "gpt_true_model_name" and "gpt_info_model_name". But it seems the original model is unavailable now.
The text was updated successfully, but these errors were encountered: