[Bugfix] Fallback ViT attn backend to SDPA for blackwell #25851

ywang96 · 2025-09-29T04:01:47Z

Purpose

#25788 Fixed the issue for Qwen3-VL - while we don't know if xformers is going to work with other head sizes, it's not officially supported yet according to facebookresearch/xformers#1317 (comment). Therefore it's probably safer for us to force ViT backend to SDPA for all models on blackwell for now.

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Roger Wang <hey@rogerw.io>

gemini-code-assist

Code Review

This pull request refactors the ViT attention backend fallback for Blackwell GPUs by moving the logic from a model-specific file (qwen3_vl.py) to the general CUDA platform file (cuda.py). While this is a good architectural improvement, the new implementation in cuda.py has a logical flaw that prevents the fallback from working as intended. I've provided a critical comment with a suggested fix to correct the device capability check order.

Signed-off-by: Roger Wang <hey@rogerw.io>

DarkLight1337 · 2025-09-29T04:13:31Z

Perhaps we should link a tracking issue in the code?

ywang96 · 2025-09-29T04:19:37Z

Perhaps we should link a tracking issue in the code?

Good point!

Signed-off-by: Roger Wang <hey@rogerw.io>

wwl2755 · 2025-09-29T05:50:27Z

vllm/model_executor/models/qwen3_vl.py

        self.attn_backend = get_vit_attn_backend(
            head_size=head_dim, dtype=torch.get_default_dtype())
        use_upstream_fa = False
        if self.attn_backend != _Backend.FLASH_ATTN and \


QQ: Does FA has the similar problem in Blackwell? Because this logic may still select upstream FA is available.

upstream FA seems okay on blackwell (our user reported that installing upstream FA also fixes the issue, therefore I didn't delete the code here)

Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: simon-mo <simon.mo@hey.com>

…t#25851) Signed-off-by: Roger Wang <hey@rogerw.io>

Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: yewentao256 <zhyanwentao@126.com>

…t#25851) Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

…t#25851) Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: simon-mo <simon.mo@hey.com>

Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: simon-mo <simon.mo@hey.com>

…t#25851) Signed-off-by: Roger Wang <hey@rogerw.io>

…t#25851) Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

update

62db78c

Signed-off-by: Roger Wang <hey@rogerw.io>

ywang96 requested a review from sighingnow as a code owner September 29, 2025 04:01

mergify bot added the qwen Related to Qwen models label Sep 29, 2025

gemini-code-assist bot reviewed Sep 29, 2025

View reviewed changes

Isotr0py approved these changes Sep 29, 2025

View reviewed changes

ywang96 added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 29, 2025

ywang96 enabled auto-merge (squash) September 29, 2025 04:08

move up

58e74b8

Signed-off-by: Roger Wang <hey@rogerw.io>

update

c1f7b19

Signed-off-by: Roger Wang <hey@rogerw.io>

wwl2755 reviewed Sep 29, 2025

View reviewed changes

ywang96 merged commit 65ecb4f into vllm-project:main Sep 29, 2025
48 checks passed

DarkLight1337 added this to the v0.11.0 Cherry Picks milestone Sep 29, 2025

simon-mo pushed a commit that referenced this pull request Oct 1, 2025

[Bugfix] Fallback ViT attn backend to SDPA for blackwell (#25851)

ab5b645

Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: simon-mo <simon.mo@hey.com>

pdasigi pushed a commit to pdasigi/vllm that referenced this pull request Oct 2, 2025

[Bugfix] Fallback ViT attn backend to SDPA for blackwell (vllm-projec…

9e74829

…t#25851) Signed-off-by: Roger Wang <hey@rogerw.io>

yewentao256 pushed a commit that referenced this pull request Oct 3, 2025

[Bugfix] Fallback ViT attn backend to SDPA for blackwell (#25851)

4079a63

Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: yewentao256 <zhyanwentao@126.com>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025

[Bugfix] Fallback ViT attn backend to SDPA for blackwell (vllm-projec…

651c52e

…t#25851) Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

choprahetarth pushed a commit to Tandemn-Labs/vllm that referenced this pull request Oct 11, 2025

[Bugfix] Fallback ViT attn backend to SDPA for blackwell (vllm-projec…

d77b5c4

…t#25851) Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: simon-mo <simon.mo@hey.com>

shyeh25 pushed a commit to shyeh25/vllm that referenced this pull request Oct 14, 2025

Fallback ViT attn backend to SDPA for blackwell (vllm-project#25851)

3c3dda6

Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: simon-mo <simon.mo@hey.com>

lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025

[Bugfix] Fallback ViT attn backend to SDPA for blackwell (vllm-projec…

83f149b

…t#25851) Signed-off-by: Roger Wang <hey@rogerw.io>

alhridoy pushed a commit to alhridoy/vllm that referenced this pull request Oct 24, 2025

[Bugfix] Fallback ViT attn backend to SDPA for blackwell (vllm-projec…

c8cde9b

…t#25851) Signed-off-by: Roger Wang <hey@rogerw.io>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025

[Bugfix] Fallback ViT attn backend to SDPA for blackwell (vllm-projec…

5bdb238

…t#25851) Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] Fallback ViT attn backend to SDPA for blackwell #25851

[Bugfix] Fallback ViT attn backend to SDPA for blackwell #25851

Uh oh!

ywang96 commented Sep 29, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

DarkLight1337 commented Sep 29, 2025

Uh oh!

ywang96 commented Sep 29, 2025

Uh oh!

wwl2755 Sep 29, 2025

Uh oh!

ywang96 Sep 29, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

[Bugfix] Fallback ViT attn backend to SDPA for blackwell #25851

[Bugfix] Fallback ViT attn backend to SDPA for blackwell #25851

Uh oh!

Conversation

ywang96 commented Sep 29, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

DarkLight1337 commented Sep 29, 2025

Uh oh!

ywang96 commented Sep 29, 2025

Uh oh!

wwl2755 Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

ywang96 Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ywang96 commented Sep 29, 2025 •

edited by github-actions bot

Loading

ywang96 Sep 29, 2025 •

edited

Loading