Skip to content

Latest commit

 

History

History

miniwob++

MiniWoB++

We take the framework of RCI Agent to evaluate the performance of our model on MiniWoB++.

Installation

pip install -r requirements.txt
pip install -e computergym

Evaluation

GPT

bash eval-gpt3.5.sh

or

bash eval-gpt4.sh

You need to provide your OpenAI API key as an environment variable OPENAI_API_KEY.

TGI (Text Generation Inference)

bash eval-tgi.sh

You need to modify eval-tgi.sh to provide your TGI controller addresses in an comma-separated array.