Skip to content

BFCL April 25th Release (New Models) #386

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 14 commits into from
Apr 26, 2024

Conversation

HuanzhiMao
Copy link
Collaborator

@HuanzhiMao HuanzhiMao commented Apr 25, 2024

In this PR, 5 new models are added to the leaderboard:

  • meta-llama/Meta-Llama-3-8B-Instruct
  • meta-llama/Meta-Llama-3-70B-Instruct
  • gemini-1.5-pro-preview-0409
  • command-r-plus
  • command-r-plus-FC

The leaderboard website will be updated shortly to reflect these new entries, in a different PR.


Co-authored-by: Charlie Cheng-Jie Ji charliechengjieji@berkeley.edu
Co-authored-by: Fanjia Yan fanjiayan@berkeley.edu

@HuanzhiMao HuanzhiMao changed the title BFCL April 24th Release (New Models) BFCL April 25th Release (New Models) Apr 25, 2024
@HuanzhiMao HuanzhiMao marked this pull request as ready for review April 25, 2024 18:41
Copy link
Collaborator

@CharlieJCJ CharlieJCJ left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Tested, LGTM

@ShishirPatil ShishirPatil merged commit e6cd512 into ShishirPatil:main Apr 26, 2024
ShishirPatil pushed a commit that referenced this pull request Apr 26, 2024
…se (#387)

- As mentioned in #377, this PR updates the leaderboard to reflect the
score changes resulting from the updates in the executable test category
evaluation pipeline.
- As mentioned in #386, this PR also adds five new models to the
leaderboard.
- It also adds a `last_updated` field to the leaderboard. 

This PR **DOES** change the leaderboard score.

---------

Co-authored-by: Charlie Cheng-Jie Ji <charliechengjieji@berkeley.edu>
@HuanzhiMao HuanzhiMao deleted the April24 branch April 27, 2024 01:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants