Skip to content

Actions: stanford-crfm/helm

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
4,235 workflow runs
4,235 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

GPQA Few shot CoT
Test #7501: Pull request #3096 synchronize by liamjxu
November 1, 2024 04:56 12m 12s jialiang/gpqa_cot_scenario
November 1, 2024 04:56 12m 12s
Adding the IFEval scenario
Scenario tests #61: Pull request #3122 synchronize by liamjxu
November 1, 2024 04:54 7m 47s jialiang/ifeval
November 1, 2024 04:54 7m 47s
Adding the IFEval scenario
Test #7500: Pull request #3122 synchronize by liamjxu
November 1, 2024 04:54 7m 44s jialiang/ifeval
November 1, 2024 04:54 7m 44s
Build frontend (#3115)
Test #7499: Commit 4b01a15 pushed by yifanmai
November 1, 2024 04:20 13m 21s main
November 1, 2024 04:20 13m 21s
Show "preview" badge if current version is not released yet (#3126)
Build Frontend #141: Commit 2ae1381 pushed by yifanmai
November 1, 2024 04:18 53s main
November 1, 2024 04:18 53s
Show "preview" badge if current version is not released yet (#3126)
Frontend #644: Commit 2ae1381 pushed by yifanmai
November 1, 2024 04:18 59s main
November 1, 2024 04:18 59s
Skip models column when summarizing mean
Test #7498: Pull request #3127 synchronize by yifanmai
November 1, 2024 04:13 13m 1s yifanmai/fix-summarize-mean
November 1, 2024 04:13 13m 1s
Skip models column when summarizing mean
Test #7497: Pull request #3127 opened by yifanmai
November 1, 2024 04:07 13m 24s yifanmai/fix-summarize-mean
November 1, 2024 04:07 13m 24s
Add functionality for linking directly to instances in Predictions pa…
Build Frontend #140: Commit c1750e7 pushed by yifanmai
November 1, 2024 03:49 48s main
November 1, 2024 03:49 48s
Add functionality for linking directly to instances in Predictions pa…
Frontend #642: Commit c1750e7 pushed by yifanmai
November 1, 2024 03:49 1m 2s main
November 1, 2024 03:49 1m 2s
Comments addressed for MMLU-PRO Non COT
Scenario tests #60: Pull request #3125 reopened by siyagoel
October 31, 2024 23:59 17m 18s siyagoel/mmluprofinal
October 31, 2024 23:59 17m 18s
Comments addressed for MMLU-PRO Non COT
Test #7496: Pull request #3125 reopened by siyagoel
October 31, 2024 23:59 13m 4s siyagoel/mmluprofinal
October 31, 2024 23:59 13m 4s
Comments addressed for MMLU-PRO Non COT
Test #7495: Pull request #3125 synchronize by siyagoel
October 31, 2024 23:57 13m 27s siyagoel/mmluprofinal
October 31, 2024 23:57 13m 27s
Comments addressed for MMLU-PRO Non COT
Scenario tests #59: Pull request #3125 synchronize by siyagoel
October 31, 2024 23:57 11m 23s siyagoel/mmluprofinal
October 31, 2024 23:57 11m 23s
Comments addressed for MMLU-PRO Non COT
Scenario tests #58: Pull request #3125 synchronize by siyagoel
October 31, 2024 23:46 8m 25s siyagoel/mmluprofinal
October 31, 2024 23:46 8m 25s
Comments addressed for MMLU-PRO Non COT
Test #7494: Pull request #3125 synchronize by siyagoel
October 31, 2024 23:46 12m 37s siyagoel/mmluprofinal
October 31, 2024 23:46 12m 37s
GPQA Few-shot CoT, spec part
Test #7493: Pull request #3097 synchronize by yifanmai
October 31, 2024 23:22 13m 12s jialiang/gpqa_cot_run_spec
October 31, 2024 23:22 13m 12s
Changed MMLU Pro for Non-COT Version (#3108)
Test #7492: Commit b92b93f pushed by siyagoel
October 31, 2024 21:35 12m 42s main
October 31, 2024 21:35 12m 42s
Changed MMLU Pro for Non-COT Version (#3108)
Scenario tests #57: Commit b92b93f pushed by siyagoel
October 31, 2024 21:35 8m 16s main
October 31, 2024 21:35 8m 16s
Add stop sequence support to MistralClient (#3120)
Test #7491: Commit 712ac23 pushed by yifanmai
October 31, 2024 20:58 12m 29s main
October 31, 2024 20:58 12m 29s
Treat missing AI21 message content as empty string (#3123)
Test #7490: Commit 7c3c6fb pushed by yifanmai
October 31, 2024 20:58 12m 38s main
October 31, 2024 20:58 12m 38s
Pin revision in many invocations of Hugging Face load_datasets() (#3124)
Test #7489: Commit 5e56027 pushed by yifanmai
October 31, 2024 20:58 13m 34s main
October 31, 2024 20:58 13m 34s