[tx] Right align prompts for decoding #932

pcmoritz · 2026-01-24T06:50:57Z

In #927, we left aligned the prompts for sampling prefill in preparation for #879. Here we shift them after the prefill so they will again be right aligned for the decoding. This will allow us to use the cudnn attention #927 for both prefill and decoding.

gemini-code-assist

Code Review

This pull request aims to right-align prompts for the decoding phase after a left-aligned prefill. This is a good change that should allow for using the same efficient attention mechanism for both prefill and decoding. The approach of removing the compute_positions function and introducing a batch_roll function to shift the KV cache and attention mask is sound. However, I've identified a critical issue in the implementation of the new batch_roll function that would lead to incorrect behavior for multi-dimensional tensors like the KV cache. My review includes a suggested fix for this issue.

skyrl-tx/tx/utils/generator.py

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

pcmoritz · 2026-01-24T06:57:43Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a change to right-align prompts for the decoding phase, following a prefill phase with left-aligned prompts. This is a clever approach to enable the use of optimized cuDNN attention kernels for both stages. The changes are well-contained and logical. The new batch_roll utility is correctly implemented for shifting the KV cache and attention mask. The modifications in the model files to simplify position ID generation for the prefill stage are also appropriate given the new alignment strategy. Overall, the implementation appears correct and robust.

…o tx-right-align-decode

In NovaSky-AI#927, we left aligned the prompts for sampling prefill in preparation for NovaSky-AI#879. Here we shift them after the prefill so they will again be right aligned for the decoding. This will allow us to use the cudnn attention NovaSky-AI#927 for both prefill and decoding. --------- Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

pcmoritz added 2 commits January 23, 2026 22:46

[tx] Right align prompts for decoding

9496754

update

013ca4a

pcmoritz added the tx label Jan 24, 2026

gemini-code-assist bot reviewed Jan 24, 2026

View reviewed changes

skyrl-tx/tx/utils/generator.py Outdated Show resolved Hide resolved

Update skyrl-tx/tx/utils/generator.py

b05f70a

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

gemini-code-assist bot reviewed Jan 24, 2026

View reviewed changes

pcmoritz added 2 commits January 23, 2026 23:45

fix OOM in CI

bb06675

Merge branch 'tx-right-align-decode' of github.com:pcmoritz/SkyRL int…

93f2f1e

…o tx-right-align-decode

pcmoritz merged commit b9a6307 into NovaSky-AI:main Jan 24, 2026
4 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[tx] Right align prompts for decoding #932

[tx] Right align prompts for decoding #932

pcmoritz commented Jan 24, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

pcmoritz commented Jan 24, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[tx] Right align prompts for decoding #932

[tx] Right align prompts for decoding #932

Conversation

pcmoritz commented Jan 24, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

pcmoritz commented Jan 24, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant