forked from ShishirPatil/gorilla
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
BFCL May 14th Release (GPT-4o and Gemini) (ShishirPatil#426)
This PR makes 3 models(4 entries) available for inference on BFCL: - gpt-4o-2024-05-13 (Function Calling Mode and Prompting Mode) - gemini-1.5-pro-preview-0514 (Function Calling Mode) - gemini-1.5-flash-preview-0514(Function Calling Mode) You can start the evaluation by running `python openfunctions_evaluation.py --model MODEL_NAME` and get score by running `python ./eval_runner.py --model MODEL_NAME`. For more detail, refer to Readme under the BFCL page. Score changes are reflected in ShishirPatil#428 . This PR also updated different models' pricing: - For Gemini, when prompts are less than 128K tokens, the new Gemini series' prices are lowered by around half(https://ai.google.dev/pricing). All the BFCL test cases are less than 128K tokens. - For Anthropic Models, the prices have decreased for claude-2.1 and Claude-instant-1.2 which have updated accordingly - For Mistral Models, the prices have been halved for Mistral-large and Mistral-Small - For OpenAI Models, we have corrected GPT-3.5-turbo-0125 to the price it should have --------- Co-authored-by: Huanzhi Mao <huanzhimao@gmail.com>
- Loading branch information
1 parent
a3de1d9
commit 4a36ac7
Showing
5 changed files
with
92 additions
and
43 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters