You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hey, thank you for your great work. I just wanted to know how can I run evaluation on the open source AgentInstruct data on the AgentBench repo. I will be really grateful if you can share the files defined here for running AgentInstruct on AgentBench. I am asking this because I want to see eval some models on the AgentInstruct data.
The text was updated successfully, but these errors were encountered:
AgentInstruct is a curated dataset used to fine-tune the model; if you need to evaluate the model, you can refer to AgentLM, which is fine-tuned with this data, or you can choose the model fine-tuned by yourself.
For the latest evaluation code, please visit AgentBench. You can configure the fine-tuned model for evaluation in the ./configs directory.
If you have any more questions or need additional help, please don't hesitate to ask.
Hey, thank you for your great work. I just wanted to know how can I run evaluation on the open source AgentInstruct data on the AgentBench repo. I will be really grateful if you can share the files defined here for running AgentInstruct on AgentBench. I am asking this because I want to see eval some models on the AgentInstruct data.
The text was updated successfully, but these errors were encountered: