Skip to content

Conversation

@ZJY0516
Copy link
Contributor

@ZJY0516 ZJY0516 commented Nov 7, 2025

Purpose

As discussed in #28182, we should not use torch.empty to allocate attention output buffer in qwen3 Qwen3NextGatedDeltaNet


Essential Elements of an Effective PR Description Checklist
  • The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
  • The test plan, such as providing test command.
  • The test results, such as pasting the results comparison before and after, or e2e results
  • (Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
  • (Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>
@ZJY0516 ZJY0516 requested a review from sighingnow as a code owner November 7, 2025 05:09
@ZJY0516
Copy link
Contributor Author

ZJY0516 commented Nov 7, 2025

cc @vadiklyutiy

@mergify mergify bot added the qwen Related to Qwen models label Nov 7, 2025
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request adds a helpful comment to explain why torch.zeros is used instead of torch.empty for allocating the attention buffer in Qwen3NextGatedDeltaNet. This is a good practice for documenting non-obvious design choices. The change is correct and improves code clarity. I have no further comments.

@vadiklyutiy vadiklyutiy self-requested a review November 7, 2025 23:40
@vadiklyutiy vadiklyutiy added ready-for-merge Indicate this PR is ready to be merged by the maintainers, used by reviewers without merge access. ready ONLY add when PR is ready to merge/full CI is needed labels Nov 7, 2025
@ywang96 ywang96 merged commit 7ae5a5f into vllm-project:main Nov 9, 2025
59 checks passed
@ZJY0516 ZJY0516 deleted the q3n-comment branch November 13, 2025 09:38
xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Nov 13, 2025
Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>
Signed-off-by: xuebwang-amd <xuebwang@amd.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed ready-for-merge Indicate this PR is ready to be merged by the maintainers, used by reviewers without merge access.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants