Skip to content

Actions: tatsu-lab/alpaca_eval

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
1,351 workflow runs
1,351 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

pages build and deployment
pages-build-deployment #547: by YannDubs
August 17, 2024 23:22 1m 52s main
August 17, 2024 23:22 1m 52s
[ENH] add mistral v0.3, Qwen2 70b, gtp4 mini
test format leaderboard #229: Pull request #393 synchronize by YannDubs
August 17, 2024 22:48 2m 6s yann/models_rubriceval
August 17, 2024 22:48 2m 6s
[ENH] add mistral v0.3, Qwen2 70b, gtp4 mini
alpaca_eval unit tests #757: Pull request #393 synchronize by YannDubs
August 17, 2024 22:48 4m 5s yann/models_rubriceval
August 17, 2024 22:48 4m 5s
[ENH] add mistral v0.3, Qwen2 70b, gtp4 mini
test format leaderboard #228: Pull request #393 opened by YannDubs
August 17, 2024 22:39 2m 8s yann/models_rubriceval
August 17, 2024 22:39 2m 8s
[ENH] add mistral v0.3, Qwen2 70b, gtp4 mini
alpaca_eval unit tests #756: Pull request #393 opened by YannDubs
August 17, 2024 22:39 4m 7s yann/models_rubriceval
August 17, 2024 22:39 4m 7s
[ENH] enable base_dir to be a list (#392)
alpaca_eval unit tests #755: Commit c4def44 pushed by YannDubs
August 17, 2024 08:13 4m 13s main
August 17, 2024 08:13 4m 13s
pages build and deployment
pages-build-deployment #546: by YannDubs
August 17, 2024 08:13 1m 59s main
August 17, 2024 08:13 1m 59s
[ENH] enable base_dir to be a list
alpaca_eval unit tests #754: Pull request #392 synchronize by YannDubs
August 17, 2024 08:13 4m 17s yann/multi_base_dir
August 17, 2024 08:13 4m 17s
[ENH] enable base_dir to be a list
alpaca_eval unit tests #753: Pull request #392 synchronize by YannDubs
August 17, 2024 00:29 4m 9s yann/multi_base_dir
August 17, 2024 00:29 4m 9s
pages build and deployment
pages-build-deployment #545: by github-pages bot
August 17, 2024 00:21 1m 48s main
August 17, 2024 00:21 1m 48s
Format leaderboard
Format leaderboard #147: Manually run by YannDubs
August 17, 2024 00:19 2m 8s main
August 17, 2024 00:19 2m 8s
[ENH] enable base_dir to be a list
alpaca_eval unit tests #752: Pull request #392 opened by YannDubs
August 17, 2024 00:18 4m 45s yann/multi_base_dir
August 17, 2024 00:18 4m 45s
[ENH] OpenAI use tools instead of functions (#391)
alpaca_eval unit tests #751: Commit 1deab1b pushed by YannDubs
August 16, 2024 23:46 3m 54s main
August 16, 2024 23:46 3m 54s
pages build and deployment
pages-build-deployment #544: by YannDubs
August 16, 2024 23:46 1m 57s main
August 16, 2024 23:46 1m 57s
Add blendaxai-gm-l3-v35 to AlpacaEval (#389)
alpaca_eval unit tests #750: Commit 9c46d20 pushed by YannDubs
August 16, 2024 23:45 4m 7s main
August 16, 2024 23:45 4m 7s
Add blendaxai-gm-l3-v35 to AlpacaEval (#389)
Format leaderboard #146: Commit 9c46d20 pushed by YannDubs
August 16, 2024 23:45 2m 16s main
August 16, 2024 23:45 2m 16s
pages build and deployment
pages-build-deployment #543: by YannDubs
August 16, 2024 23:45 1m 37s main
August 16, 2024 23:45 1m 37s
alpaca_eval unit tests
alpaca_eval unit tests #749: Manually run by YannDubs
August 16, 2024 23:43 4m 3s main
August 16, 2024 23:43 4m 3s
[ENH] OpenAI use tools instead of functions
alpaca_eval unit tests #748: Pull request #391 opened by YannDubs
August 16, 2024 23:38 4m 12s yann/chatml
August 16, 2024 23:38 4m 12s
[README] add LC AE to analysis
alpaca_eval unit tests #747: Commit 6bbb762 pushed by YannDubs
August 15, 2024 07:46 5m 25s main
August 15, 2024 07:46 5m 25s
pages build and deployment
pages-build-deployment #542: by YannDubs
August 15, 2024 07:46 1m 39s main
August 15, 2024 07:46 1m 39s
[README] move caution in analysis only for AE1
alpaca_eval unit tests #746: Commit 12e3b1d pushed by YannDubs
August 15, 2024 07:44 4m 38s main
August 15, 2024 07:44 4m 38s
pages build and deployment
pages-build-deployment #541: by YannDubs
August 15, 2024 07:44 1m 47s main
August 15, 2024 07:44 1m 47s
Add blendaxai-gm-l3-v35 to AlpacaEval
alpaca_eval unit tests #745: Pull request #389 synchronize by ym-blendax-ai
August 14, 2024 17:57 3m 51s main
August 14, 2024 17:57 3m 51s
Add blendaxai-gm-l3-v35 to AlpacaEval
test format leaderboard #227: Pull request #389 synchronize by ym-blendax-ai
August 14, 2024 17:57 2m 8s main
August 14, 2024 17:57 2m 8s