Skip to content

Comments

Revert "[Fix] Fix pass_at_k missing for SkRLGymGenerator.agent_loop flow due to token-level rewards"#300

Merged
erictang000 merged 1 commit intomainfrom
revert-271-fix-0908-fix-reward
Sep 16, 2025
Merged

Revert "[Fix] Fix pass_at_k missing for SkRLGymGenerator.agent_loop flow due to token-level rewards"#300
erictang000 merged 1 commit intomainfrom
revert-271-fix-0908-fix-reward

Conversation

@CharlieFRuan
Copy link
Collaborator

Reverts #271 as it causes errors

For more, see #299 (comment)

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request reverts a recent change that caused errors. The reverted code was intended to handle per-trajectory rewards as floats to enable pass_at_k calculation, but it introduced regressions. This PR restores the previous behavior where rewards are consistently handled as token-level lists. The changes are applied correctly in skyrl_train/skyrl_train/generators/skyrl_gym_generator.py and the corresponding tests are updated to reflect the reverted logic. The revert is clean and a good measure to ensure stability.

@erictang000 erictang000 merged commit 7406b96 into main Sep 16, 2025
3 checks passed
@tyler-griggs tyler-griggs deleted the revert-271-fix-0908-fix-reward branch September 22, 2025 18:10
dzorlu pushed a commit to fleet-ai/SkyRL that referenced this pull request Feb 4, 2026
…low due to token-level rewards" (NovaSky-AI#300)

Reverts NovaSky-AI#271 as it causes errors

For more, see
NovaSky-AI#299 (comment)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants