We take the framework of RCI Agent to evaluate the performance of our model on MiniWoB++.
pip install -r requirements.txt
pip install -e computergym
bash eval-gpt3.5.sh
or
bash eval-gpt4.sh
You need to provide your OpenAI API key as an environment variable OPENAI_API_KEY
.
bash eval-tgi.sh
You need to modify eval-tgi.sh
to provide your TGI controller addresses in an comma-separated array.