Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update URL and citation for previous submission #15

Merged
merged 3 commits into from
Mar 5, 2025

Conversation

snova-nidhih
Copy link
Contributor

Only updating empty fields of metadata file

@snova-nidhih snova-nidhih requested a deployment to leaderboard-eval-run March 4, 2025 21:01 — with GitHub Actions Waiting
@snova-nidhih snova-nidhih deployed to leaderboard-eval-run March 4, 2025 21:02 — with GitHub Actions Active
Copy link

github-actions bot commented Mar 5, 2025

Eval run succeeded! Link to run: link

Here are the results of the submission(s):

Llama3.3 70b distilled DS - 32k ss

SambaNova Systems, 2025

Closed Book

  • Loose: 0.515
  • Strict: 0.11
  • ROUGE-1: 0.444
  • ROUGE-2: 0.244
  • ROUGE-L: 0.369
  • BLEURT: 0.495
  • GPT Judge: 0.2

Open Book

  • Loose: 0.559
  • Strict: 0.126
  • ROUGE-1: 0.503
  • ROUGE-2: 0.289
  • ROUGE-L: 0.429
  • BLEURT: 0.53
  • GPT Judge: 0.272

Evidence Provided

  • Loose: 0.659
  • Strict: 0.18
  • ROUGE-1: 0.585
  • ROUGE-2: 0.353
  • ROUGE-L: 0.495
  • BLEURT: 0.582
  • GPT Judge: 0.41

If all looks well, a maintainer will come by soon to merge this PR and your entry/entries will appear on the leaderboard. If you need to make any changes, feel free to push new commits to this PR. Thanks for submitting to FanOutQA!

@zhudotexe zhudotexe merged commit e475243 into zhudotexe:main Mar 5, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants