Skip to content

Actions: stanford-crfm/helm

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
3,831 workflow runs
3,831 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add PARADE audio scenario (#3424)
Test #8157: Commit 5a7ca34 pushed by ImKeTT
March 12, 2025 05:58 9m 7s main
March 12, 2025 05:58 9m 7s
Fix multilingual librispeech audio scenario (#3423)
Test #8155: Commit 174cabd pushed by teetone
March 12, 2025 02:08 10m 0s main
March 12, 2025 02:08 10m 0s
Make Azure OpenAI deployment name configurable (#3421)
Test #8153: Commit 1b05df3 pushed by yifanmai
March 11, 2025 18:21 12m 16s main
March 11, 2025 18:21 12m 16s
3323 adaptive evaluation
Test #8152: Pull request #3397 synchronize by yuhengtu
March 11, 2025 15:53 Action required sangttruong:3323-adaptive_evaluation
March 11, 2025 15:53 Action required
Scenario tests
Scenario tests #306: Scheduled
March 11, 2025 15:36 9m 19s main
March 11, 2025 15:36 9m 19s
Watson x
Test #8151: Pull request #3422 opened by vz-ibm
March 11, 2025 13:29 6m 48s watsonX
March 11, 2025 13:29 6m 48s
Add LibriSpeech and FLEURS gender fairness audio scenarios (#3418)
Test #8150: Commit 36930e8 pushed by teetone
March 11, 2025 01:30 9m 55s main
March 11, 2025 01:30 9m 55s
Add LibriSpeech and FLEURS gender fairness audio scenarios
Test #8149: Pull request #3418 synchronize by ImKeTT
March 11, 2025 00:59 9m 57s ImKeTT:audio_judge
March 11, 2025 00:59 9m 57s
Make GPQA CoT metric pattern matching less strict (#3420)
Test #8147: Commit b1bc37e pushed by yifanmai
March 10, 2025 23:05 10m 6s main
March 10, 2025 23:05 10m 6s
Make GPQA CoT metric pattern matching less strict
Test #8146: Pull request #3420 opened by yifanmai
March 10, 2025 22:43 9m 38s yifanmai/fix-mcqa-cot
March 10, 2025 22:43 9m 38s
Add request response format JSON schema support (#3415)
Test #8145: Commit 84d37d7 pushed by yifanmai
March 10, 2025 22:15 9m 28s main
March 10, 2025 22:15 9m 28s
Add LibriSpeech and FLEURS gender fairness audio scenarios
Test #8144: Pull request #3418 opened by ImKeTT
March 10, 2025 16:47 9m 37s ImKeTT:audio_judge
March 10, 2025 16:47 9m 37s
Scenario tests
Scenario tests #305: Scheduled
March 10, 2025 15:35 8m 57s main
March 10, 2025 15:35 8m 57s
Add GPT4 evaluator for open-ended audio scenarios (#3417)
Test #8143: Commit 035c2a4 pushed by teetone
March 10, 2025 04:26 8m 57s main
March 10, 2025 04:26 8m 57s
Scenario tests
Scenario tests #304: Scheduled
March 9, 2025 15:33 9m 43s main
March 9, 2025 15:33 9m 43s
Add GPT4 evaluator for open-ended audio scenarios
Test #8142: Pull request #3417 opened by ImKeTT
March 9, 2025 09:18 8m 57s ImKeTT:audio_judge
March 9, 2025 09:18 8m 57s
Added OpenAITranscriptionThenCompletionClient (#3416)
Test #8141: Commit 37d3a91 pushed by teetone
March 9, 2025 05:41 9m 30s main
March 9, 2025 05:41 9m 30s
anthropic thinking
Frontend #787: Commit 5c704cc pushed by teetone
March 8, 2025 20:09 1m 1s vhelm36
March 8, 2025 20:09 1m 1s
Added OpenAITranscriptionThenCompletionClient
Test #8140: Pull request #3416 synchronize by teetone
March 8, 2025 19:29 9m 14s whisper
March 8, 2025 19:29 9m 14s
Added OpenAITranscriptionThenCompletionClient
Test #8139: Pull request #3416 opened by teetone
March 8, 2025 19:24 6m 26s whisper
March 8, 2025 19:24 6m 26s