Skip to content

Actions: EleutherAI/lm-evaluation-harness

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
6,413 workflow runs
6,413 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add GPTQModel support for evaluating GPTQ models (#2217)
Unit Tests #3625: Commit 4f8e479 pushed by baberabb
October 31, 2024 16:15 6m 42s main
October 31, 2024 16:15 6m 42s
Add GPTQModel support for evaluating GPTQ models (#2217)
Tasks Modified #3653: Commit 4f8e479 pushed by baberabb
October 31, 2024 16:15 14s main
October 31, 2024 16:15 14s
OpenAI ChatCompletions: switch max_tokens
Tasks Modified #3651: Pull request #2443 synchronize by baberabb
October 31, 2024 09:14 17s openaichat
October 31, 2024 09:14 17s
OpenAI ChatCompletions: switch max_tokens
Unit Tests #3623: Pull request #2443 synchronize by baberabb
October 31, 2024 09:14 6m 10s openaichat
October 31, 2024 09:14 6m 10s
Add Aggregation for Kobest Benchmark
Tasks Modified #3650: Pull request #2446 synchronize by tryumanshow
October 31, 2024 04:46 1m 29s tryumanshow:kobest-agg
October 31, 2024 04:46 1m 29s
Add Aggregation for Kobest Benchmark
Unit Tests #3622: Pull request #2446 synchronize by tryumanshow
October 31, 2024 04:46 6m 16s tryumanshow:kobest-agg
October 31, 2024 04:46 6m 16s
Add Aggregation for Kobest Benchmark
Unit Tests #3621: Pull request #2446 synchronize by tryumanshow
October 31, 2024 04:43 6m 6s tryumanshow:kobest-agg
October 31, 2024 04:43 6m 6s
Add Aggregation for Kobest Benchmark
Tasks Modified #3649: Pull request #2446 synchronize by tryumanshow
October 31, 2024 04:43 2m 6s tryumanshow:kobest-agg
October 31, 2024 04:43 2m 6s
Add Aggregation for Kobest Benchmark
Unit Tests #3620: Pull request #2446 synchronize by tryumanshow
October 31, 2024 04:40 6m 19s tryumanshow:kobest-agg
October 31, 2024 04:40 6m 19s
Add Aggregation for Kobest Benchmark
Tasks Modified #3648: Pull request #2446 synchronize by tryumanshow
October 31, 2024 04:40 2m 11s tryumanshow:kobest-agg
October 31, 2024 04:40 2m 11s
Add Aggregation for Kobest Benchmark
Tasks Modified #3647: Pull request #2446 synchronize by tryumanshow
October 31, 2024 04:39 1m 48s tryumanshow:kobest-agg
October 31, 2024 04:39 1m 48s
Add Aggregation for Kobest Benchmark
Unit Tests #3619: Pull request #2446 synchronize by tryumanshow
October 31, 2024 04:39 2m 32s tryumanshow:kobest-agg
October 31, 2024 04:39 2m 32s
Add Aggregation for Kobest Benchmark
Unit Tests #3618: Pull request #2446 synchronize by tryumanshow
October 31, 2024 04:35 6m 17s tryumanshow:kobest-agg
October 31, 2024 04:35 6m 17s
Add Aggregation for Kobest Benchmark
Tasks Modified #3646: Pull request #2446 synchronize by tryumanshow
October 31, 2024 04:35 1m 34s tryumanshow:kobest-agg
October 31, 2024 04:35 1m 34s
Add Aggregation for Kobest Benchmark
Tasks Modified #3645: Pull request #2446 opened by tryumanshow
October 31, 2024 04:29 1m 36s tryumanshow:kobest-agg
October 31, 2024 04:29 1m 36s
Add Aggregation for Kobest Benchmark
Unit Tests #3617: Pull request #2446 opened by tryumanshow
October 31, 2024 04:29 6m 27s tryumanshow:kobest-agg
October 31, 2024 04:29 6m 27s
mlx Model (loglikelihood & generate_until)
Unit Tests #3616: Pull request #1902 synchronize by chimezie
October 30, 2024 20:00 Action required chimezie:mlx
October 30, 2024 20:00 Action required
mlx Model (loglikelihood & generate_until)
Tasks Modified #3644: Pull request #1902 synchronize by chimezie
October 30, 2024 20:00 Action required chimezie:mlx
October 30, 2024 20:00 Action required
OpenAI ChatCompletions: switch max_tokens
Unit Tests #3615: Pull request #2443 synchronize by baberabb
October 30, 2024 15:11 5m 49s openaichat
October 30, 2024 15:11 5m 49s
OpenAI ChatCompletions: switch max_tokens
Tasks Modified #3643: Pull request #2443 synchronize by baberabb
October 30, 2024 15:11 13s openaichat
October 30, 2024 15:11 13s
Add verify_certificate argument to local-completion (#2440)
Tasks Modified #3642: Commit 57272b6 pushed by baberabb
October 30, 2024 14:42 15s main
October 30, 2024 14:42 15s
Add verify_certificate argument to local-completion (#2440)
Unit Tests #3614: Commit 57272b6 pushed by baberabb
October 30, 2024 14:42 6m 37s main
October 30, 2024 14:42 6m 37s
Add xquad task (#2435)
Tasks Modified #3641: Commit b40a20a pushed by baberabb
October 30, 2024 14:36 5m 12s main
October 30, 2024 14:36 5m 12s