[Bugfix] Fix interns1-vit qk norm code path #27480

Isotr0py · 2025-10-24T15:18:53Z

Purpose

Fix vllm deploy error InternLM/Intern-S1#29

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

gemini-code-assist

Code Review

This pull request correctly fixes a critical bug in the InternSdpaAttention's forward pass for interns1-vit models with QK normalization enabled. The previous implementation would crash due to incorrectly attempting to unpack a 3D tensor into four variables and also used flatten incorrectly. The fix applies the QK normalization directly to the query and key tensors, which is the correct approach. The removal of the unused B, N, C variables is also a good cleanup.

I've added one comment pointing out a related latent bug when num_dummy_heads > 0, which will also cause a crash. This seems to stem from incorrectly adapted logic for tensor parallelism dummy heads. While the current PR fixes the most obvious issue, addressing the related bug would make the implementation more robust.

gemini-code-assist · 2025-10-24T15:21:37Z

vllm/model_executor/models/interns1_vit.py

+            q = self.q_norm(q)
+            k = self.k_norm(k)


While this change correctly fixes the immediate crash, there's a latent critical bug when num_dummy_heads > 0 which will cause crashes in this block and later on.

q_norm and k_norm initialization: In __init__, self.q_norm is initialized as RMSNorm(hidden_size=self.dummy_dim, ..., var_hidden_size=self.embed_dim). If num_dummy_heads > 0, self.dummy_dim > self.embed_dim. The input q has a last dimension of self.embed_dim. RMSNorm expects the input's last dimension to match its hidden_size (self.dummy_dim), so self.q_norm(q) will raise a ValueError. The same applies to k_norm.

projection_layer input shape: The output of self.attn(q, k, v) will have a shape of (..., self.embed_dim). However, self.projection_layer is initialized as nn.Linear(self.dummy_dim, self.embed_dim). If num_dummy_heads > 0, the call to self.projection_layer on a subsequent line will fail due to a shape mismatch.

The MultiHeadAttention implementation used here doesn't seem to account for dummy heads. The entire dummy_dim logic within InternSdpaAttention might need to be re-evaluated to either be correctly implemented or removed if it's not applicable for this ViT attention layer. A potential fix for the normalization part would be to initialize RMSNorm with self.embed_dim.

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

Isotr0py added 2 commits October 24, 2025 23:12

fix interns1-vit qk norm

cf1ace0

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

clean

85f1dac

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

gemini-code-assist bot reviewed Oct 24, 2025

View reviewed changes

DarkLight1337 approved these changes Oct 24, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) October 24, 2025 15:25

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 24, 2025

DarkLight1337 merged commit acc78ae into vllm-project:main Oct 24, 2025
56 checks passed

kingsmad pushed a commit to kingsmad/vllm that referenced this pull request Oct 25, 2025

[Bugfix] Fix interns1-vit qk norm code path (vllm-project#27480)

5b1dc56

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

Isotr0py deleted the fix-interns1 branch October 25, 2025 03:35

rohin-garg pushed a commit to rohin-garg/vllm that referenced this pull request Oct 25, 2025

[Bugfix] Fix interns1-vit qk norm code path (vllm-project#27480)

0a0d070

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

Kay-Tian mentioned this pull request Oct 25, 2025

vLLM PR #27480 变更核心文件提醒 Kay-Tian/vllm#41

Closed

0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025

[Bugfix] Fix interns1-vit qk norm code path (vllm-project#27480)

ff55c3b

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>

0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025

[Bugfix] Fix interns1-vit qk norm code path (vllm-project#27480)

df7ecd6

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>

lvhan028 mentioned this pull request Nov 7, 2025

vllm deploy error InternLM/Intern-S1#29

Open

ilmarkov pushed a commit to neuralmagic/vllm that referenced this pull request Nov 7, 2025

[Bugfix] Fix interns1-vit qk norm code path (vllm-project#27480)

dcd2d6d

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

rtourgeman pushed a commit to rtourgeman/vllm that referenced this pull request Nov 10, 2025

[Bugfix] Fix interns1-vit qk norm code path (vllm-project#27480)

241680e

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request Nov 29, 2025

[Bugfix] Fix interns1-vit qk norm code path (vllm-project#27480)

b6f3ec6

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] Fix interns1-vit qk norm code path #27480

[Bugfix] Fix interns1-vit qk norm code path #27480

Uh oh!

Isotr0py commented Oct 24, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[Bugfix] Fix interns1-vit qk norm code path #27480

[Bugfix] Fix interns1-vit qk norm code path #27480

Uh oh!

Conversation

Isotr0py commented Oct 24, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Isotr0py commented Oct 24, 2025 •

edited by github-actions bot

Loading