Skip to content

Conversation

@WoosukKwon
Copy link
Collaborator

@WoosukKwon WoosukKwon commented Sep 13, 2025

IIUC, batched RoPE is only used when different LoRA adapters have different rope scaling factor.
Now that this feature is dropped in #21169, we can remove this op for simplication.

Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request effectively removes the unused batched_rotary_embedding operation and its associated kernel. The changes are comprehensive, covering the C++ implementation, Python bindings, and corresponding tests, leading to a cleaner and more maintainable codebase. I have a few minor suggestions to improve code formatting for better readability and consistency with standard Python style guides.

@WoosukKwon WoosukKwon added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 13, 2025
@vllm-bot vllm-bot merged commit 5febdc8 into main Sep 13, 2025
94 of 96 checks passed
@vllm-bot vllm-bot deleted the woosuk/remove-batched-rope branch September 13, 2025 07:08
dsxsteven pushed a commit to dsxsteven/vllm_splitPR that referenced this pull request Sep 15, 2025
Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
bbartels pushed a commit to bbartels/vllm that referenced this pull request Sep 15, 2025
Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
Signed-off-by: bbartels <benjamin@bartels.dev>
FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025
Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025
Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
Signed-off-by: xuebwang-amd <xuebwang@amd.com>
choprahetarth pushed a commit to Tandemn-Labs/vllm that referenced this pull request Oct 11, 2025
Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025
Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
Signed-off-by: xuebwang-amd <xuebwang@amd.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready ONLY add when PR is ready to merge/full CI is needed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants