Skip to content

Actions: huggingface/open-r1

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
578 workflow runs
578 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[rewards] use dense rep penalty
Tests #59: Pull request #296 opened by kashif
February 12, 2025 19:02 2m 26s fix-rep-test
February 12, 2025 19:02 2m 26s
[Rewards] add kimi len_reward
Tests #58: Pull request #292 synchronize by kashif
February 12, 2025 12:21 2m 33s len-reward
February 12, 2025 12:21 2m 33s
[Rewards] add kimi len_reward
Tests #57: Pull request #292 synchronize by kashif
February 12, 2025 12:11 3m 8s len-reward
February 12, 2025 12:11 3m 8s
Enable Weights & Biases defaults to be overridden in training (#294)
Tests #56: Commit 96a6b0f pushed by lewtun
February 12, 2025 12:01 2m 19s main
February 12, 2025 12:01 2m 19s
Enable Weights & Biases defaults to be overridden in training
Tests #55: Pull request #294 synchronize by lewtun
February 12, 2025 11:58 2m 31s add-wandb
February 12, 2025 11:58 2m 31s
Enable Weights & Biases defaults to be overridden in training
Tests #54: Pull request #294 opened by lewtun
February 12, 2025 11:57 2m 31s add-wandb
February 12, 2025 11:57 2m 31s
[Rewards] add kimi len_reward
Tests #53: Pull request #292 opened by kashif
February 12, 2025 11:44 2m 16s len-reward
February 12, 2025 11:44 2m 16s
[GRPO] generate with prompt containing the first <think> tag
Tests #52: Pull request #283 synchronize by kashif
February 12, 2025 11:10 2m 27s test-format
February 12, 2025 11:10 2m 27s
Update README.md
Tests #51: Pull request #291 opened by tpoisonooo
February 12, 2025 09:34 Action required tpoisonooo:patch-1
February 12, 2025 09:34 Action required
Performance improvements of reward calculation
Tests #50: Pull request #286 opened by saidineshpola
February 11, 2025 19:17 5m 13s saidineshpola:main
February 11, 2025 19:17 5m 13s
[GRPO] generate with prompt containing the first <think> tag
Tests #49: Pull request #283 synchronize by kashif
February 11, 2025 15:49 2m 24s test-format
February 11, 2025 15:49 2m 24s
[GRPO] generate with prompt containing the first <think> tag
Tests #48: Pull request #283 synchronize by kashif
February 11, 2025 15:47 2m 25s test-format
February 11, 2025 15:47 2m 25s
Weighted reward functions
Tests #47: Pull request #213 synchronize by zeenolife
February 11, 2025 14:21 6m 46s zeenolife:almaz/reward-weights
February 11, 2025 14:21 6m 46s
Weighted reward functions
Tests #46: Pull request #213 synchronize by qgallouedec
February 11, 2025 13:51 2m 26s zeenolife:almaz/reward-weights
February 11, 2025 13:51 2m 26s
Fix uuid in the data generator (#284)
Tests #43: Commit fa9b621 pushed by anton-l
February 11, 2025 13:08 2m 23s main
February 11, 2025 13:08 2m 23s
Fix uuid in the data generator
Tests #42: Pull request #284 opened by anton-l
February 11, 2025 13:04 2m 20s server-scripts
February 11, 2025 13:04 2m 20s
[GRPO] generate with prompt containing the first <think> tag
Tests #41: Pull request #283 synchronize by kashif
February 11, 2025 12:47 2m 32s test-format
February 11, 2025 12:47 2m 32s
[GRPO] generate with prompt containing the first <think> tag
Tests #40: Pull request #283 synchronize by kashif
February 11, 2025 12:17 2m 31s test-format
February 11, 2025 12:17 2m 31s
[GRPO] generate with prompt containing the first <think> tag
Tests #39: Pull request #283 synchronize by kashif
February 11, 2025 12:07 2m 33s test-format
February 11, 2025 12:07 2m 33s
Weighted reward functions
Tests #38: Pull request #213 synchronize by qgallouedec
February 11, 2025 10:44 2m 35s zeenolife:almaz/reward-weights
February 11, 2025 10:44 2m 35s
Weighted reward functions
Tests #37: Pull request #213 synchronize by qgallouedec
February 11, 2025 10:41 2m 31s zeenolife:almaz/reward-weights
February 11, 2025 10:41 2m 31s
new grpo logic (#274)
Tests #36: Commit 52aa875 pushed by lewtun
February 11, 2025 08:35 2m 20s main
February 11, 2025 08:35 2m 20s
fix(sft recipes): remove duplicate packing option from config (#280)
Tests #35: Commit 82b2a65 pushed by lewtun
February 11, 2025 08:34 2m 18s main
February 11, 2025 08:34 2m 18s
Weighted reward functions
Tests #33: Pull request #213 synchronize by zeenolife
February 10, 2025 19:31 2m 16s zeenolife:almaz/reward-weights
February 10, 2025 19:31 2m 16s
Weighted reward functions
Tests #32: Pull request #213 synchronize by zeenolife
February 10, 2025 17:45 8s zeenolife:almaz/reward-weights
February 10, 2025 17:45 8s