Skip to content

Actions: open-compass/opencompass

deploy

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
850 workflow runs
850 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Demo] Internlm3 math500 thinking demo (#1846)
deploy #911: Commit 862bf78 pushed by tonysy
January 24, 2025 06:56 2s main
January 24, 2025 06:56 2s
[Feature] Support OlympiadBench Benchmark (#1841)
deploy #910: Commit 412199f pushed by liushz
January 24, 2025 02:00 3s main
January 24, 2025 02:00 3s
[Feature] Support Omni-Math (#1837)
deploy #909: Commit 70f2c96 pushed by liushz
January 23, 2025 10:36 2s main
January 23, 2025 10:36 2s
[Bump] Bump version to 0.4.0 (#1838)
deploy #908: Commit 35ec307 pushed by MaiziXiao
January 22, 2025 06:41 33s 0.4.0
January 22, 2025 06:41 33s
[Bump] Bump version to 0.4.0 (#1838)
deploy #907: Commit 35ec307 pushed by MaiziXiao
January 22, 2025 03:41 2s main
January 22, 2025 03:41 2s
[Fix] Update max_out_len logic for OpenAI model (#1839)
deploy #906: Commit 03415b2 pushed by MaiziXiao
January 21, 2025 07:46 3s main
January 21, 2025 07:46 3s
[Refactor] Code refactoarization (#1831)
deploy #905: Commit a6193b4 pushed by MaiziXiao
January 20, 2025 11:17 3s main
January 20, 2025 11:17 3s
[Doc] Installation.md update (#1830)
deploy #904: Commit ffdc917 pushed by MaiziXiao
January 17, 2025 03:08 2s main
January 17, 2025 03:08 2s
[Update] Update method to add dataset in docs (#1827)
deploy #903: Commit 70da9b7 pushed by MaiziXiao
January 17, 2025 03:07 2s main
January 17, 2025 03:07 2s
[Feature] Add support for InternLM3 (#1829)
deploy #902: Commit 531643e pushed by MaiziXiao
January 16, 2025 06:28 2s main
January 16, 2025 06:28 2s
January 10, 2025 10:20 3s
[CI] Fix path conflict (#1814)
deploy #900: Commit 121d482 pushed by MaiziXiao
January 9, 2025 12:16 2s main
January 9, 2025 12:16 2s
[CI] Update daily test metrics threshold (#1812)
deploy #899: Commit abdcee6 pushed by MaiziXiao
January 9, 2025 10:16 2s main
January 9, 2025 10:16 2s
[Feature] Support MMLU-CF Benchmark (#1775)
deploy #898: Commit e039f3e pushed by liushz
January 9, 2025 06:11 3s main
January 9, 2025 06:11 3s
[Update] Update LiveMathBench (#1809)
deploy #897: Commit f1e50d4 pushed by MaiziXiao
January 7, 2025 11:16 2s main
January 7, 2025 11:16 2s
[Update] Update o1 eval prompt (#1806)
deploy #896: Commit 8fdb72f pushed by tonysy
January 6, 2025 16:14 2s main
January 6, 2025 16:14 2s
January 3, 2025 08:33 2s
[Feature] Add Longbenchv2 support (#1801)
deploy #894: Commit 117dc50 pushed by MaiziXiao
January 3, 2025 04:04 2s main
January 3, 2025 04:04 2s
[BUMP] Bump version to 0.3.9 (#1790)
deploy #893: Commit f322043 pushed by MaiziXiao
December 31, 2024 09:28 37s 0.3.9
December 31, 2024 09:28 37s
[BUMP] Bump version to 0.3.9 (#1790)
deploy #892: Commit f322043 pushed by MaiziXiao
December 31, 2024 08:52 2s main
December 31, 2024 08:52 2s
[Feature] Add LiveStemBench Dataset (#1794)
deploy #891: Commit 9c980cb pushed by MaiziXiao
December 31, 2024 07:17 2s main
December 31, 2024 07:17 2s
[Fix] Fix generic_llm_evaluator output_path (#1798)
deploy #890: Commit fc0556e pushed by MaiziXiao
December 31, 2024 05:05 2s main
December 31, 2024 05:05 2s
[Feature] Added Bradley-Terry subjective evaluation
deploy #889: Commit dc6035c pushed by MaiziXiao
December 31, 2024 03:01 2s main
December 31, 2024 03:01 2s
[Feature] Update o1 evaluation with JudgeLLM (#1795)
deploy #888: Commit 98435dd pushed by liushz
December 30, 2024 09:31 2s main
December 30, 2024 09:31 2s
[Feature] Support G-Pass@k and LiveMathBench (#1772)
deploy #887: Commit 8e8d4f1 pushed by MaiziXiao
December 30, 2024 08:59 2s main
December 30, 2024 08:59 2s