Add vLLM backend for open weight model evaluation #34

lewtun · 2024-09-04T12:31:25Z

This PR adds support for a vLLM backend so that Hugging Face models like Salesforce/xLAM-v0.1-r can be evaluated. I've shown how this works for WebShop and am happy to extend it to the other benchmarks if the API looks good to you.

salesforce-cla · 2024-09-04T12:31:30Z

Thanks for the contribution! Before we can merge this, we need @lewtun to sign the Salesforce Inc. Contributor License Agreement.

lewtun · 2024-09-04T12:32:44Z

agentlite/llm/agent_llms.py

+class VllmChatModel(BaseLLM):
+    def __init__(self, llm_config: LLMConfig):
+        super().__init__(llm_config)
+        self.client = OpenAI(base_url="http://localhost:8000/v1", api_key="EMPTY")


This is the default endpoint in vLLM, but I could add it as an env variable if preferred

lewtun · 2024-09-04T12:33:38Z

benchmark/webshop/evaluate_webshop.py

@@ -88,7 +88,7 @@ def get_runned_ids(file_path):
    args = parser.parse_args()
    rewards = []
    all_task_ids = list(range(0, 251))
-    REWARD_LOG_FILE = f"{args.llm}_{args.agent_arch}_results_webshop.csv"
+    REWARD_LOG_FILE = f"{args.llm.replace('/', '_')}_{args.agent_arch}_results_webshop.csv"


This is needed because Hugging Face model repos are in the form {org}/{repo_name} which causes problems when trying to write the file to disk.

Add vLLM backend for open weight model evaluation

70efd32

salesforce-cla bot added the cla:missing label Sep 4, 2024

salesforce-cla bot added cla:signed and removed cla:missing labels Sep 4, 2024

lewtun commented Sep 4, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add vLLM backend for open weight model evaluation #34

Add vLLM backend for open weight model evaluation #34

lewtun commented Sep 4, 2024

salesforce-cla bot commented Sep 4, 2024

lewtun Sep 4, 2024

lewtun Sep 4, 2024

Add vLLM backend for open weight model evaluation #34

Are you sure you want to change the base?

Add vLLM backend for open weight model evaluation #34

Conversation

lewtun commented Sep 4, 2024

salesforce-cla bot commented Sep 4, 2024

lewtun Sep 4, 2024

Choose a reason for hiding this comment

lewtun Sep 4, 2024

Choose a reason for hiding this comment