[Bugfix] Convert untraceable GroupShape to list for AMD impl #26535

Lucaskabela · 2025-10-09T21:12:52Z

Purpose

#25696 converts GroupShape to list where possible for cutlass custom ops; however, this was not done for the aiter or triton impl, which causes the code to fail in dynamo tracing

Test Plan

Run DeepseekR1-0528 on AMD hardware with FP8 kernels

 FLASH_ATTENTION_TRITON_AMD_ENABLE=TRUE VLLM_USE_V1=1 VLLM_MLA_DISABLE=0 VLLM_FP8_PADDING=1 VLLM_USE_TRITON_FLASH_ATTN=1 VLLM_USE_ROCM_FP8_FLASH_ATTN=0 HSA_NO_SCRATCH_RECLAIM=1 
VLLM_USE_STANDALONE_COMPILE=1 with-proxy python examples/offline_inference/basic/generate.py --model=deepseek-ai/DeepSeek-R1-0528 --max-model-len=1024 -tp=8

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Lucaskabela · 2025-10-09T21:14:01Z

cc @ElizaWszola @zou3519 @ProExpertProg

bradleyhd · 2025-10-09T21:15:09Z

can confirm this unblocks our internal AMD pipeline, thanks @Lucaskabela !

zou3519 · 2025-10-09T21:37:01Z

vllm/model_executor/layers/quantization/utils/fp8_utils.py

            input_scale,
            weight_scale,
-            self.weight_group_shape,
+            list(self.weight_group_shape),


Is it possible to add a test somehow? I don't know how vLLM CI runs amd tests

zou3519 · 2025-10-09T21:50:33Z

If someone has an idea for how to write a test for this please shout (I'm not very good with how AMD works in vLLM CI), otherwise this is pretty self contained and we verified that it fixes a compile regression

yewentao256

LGTM, thanks for the work!

ProExpertProg

Sorry we missed this initially, thanks for fixing! AMD testing is in pretty poor state cc @Alexei-V-Ivanov-AMD @gshtras

Signed-off-by: Lucas Kabela <lucaskabela@meta.com>

…oject#26535) Signed-off-by: Lucas Kabela <lucaskabela@meta.com> Signed-off-by: Dhruvil Bhatt <bhattdbh@amazon.com>

…oject#26535) Signed-off-by: Lucas Kabela <lucaskabela@meta.com> Signed-off-by: bbartels <benjamin@bartels.dev>

…oject#26535) Signed-off-by: Lucas Kabela <lucaskabela@meta.com>

…oject#26535) Signed-off-by: Lucas Kabela <lucaskabela@meta.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

…oject#26535) Signed-off-by: Lucas Kabela <lucaskabela@meta.com> Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>

Lucaskabela mentioned this pull request Oct 9, 2025

[Perf] Fix and reapply move apply w8a8 block fp8 linear to class #25696

Merged

mergify bot added the rocm Related to AMD ROCm label Oct 9, 2025

Lucaskabela marked this pull request as ready for review October 9, 2025 21:14

Lucaskabela requested review from mgoin, robertgshaw2-redhat, tlrmchlsmth and yewentao256 as code owners October 9, 2025 21:14

zou3519 reviewed Oct 9, 2025

View reviewed changes

zou3519 requested a review from ProExpertProg October 9, 2025 21:49

zou3519 approved these changes Oct 9, 2025

View reviewed changes

zou3519 added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 9, 2025

yewentao256 approved these changes Oct 9, 2025

View reviewed changes

ProExpertProg approved these changes Oct 9, 2025

View reviewed changes

zou3519 enabled auto-merge (squash) October 10, 2025 11:14

Convert untraceable groupshape to list

7206439

Signed-off-by: Lucas Kabela <lucaskabela@meta.com>

zou3519 force-pushed the lucaskabela/quant_dynamo_amd_fix branch from c9a82a6 to 7206439 Compare October 10, 2025 11:21

zou3519 merged commit 213b644 into vllm-project:main Oct 10, 2025
52 checks passed

bbartels pushed a commit to bbartels/vllm that referenced this pull request Oct 16, 2025

[Bugfix] Convert untraceable GroupShape to list for AMD impl (vllm-pr…

6a6131f

…oject#26535) Signed-off-by: Lucas Kabela <lucaskabela@meta.com> Signed-off-by: bbartels <benjamin@bartels.dev>

lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025

[Bugfix] Convert untraceable GroupShape to list for AMD impl (vllm-pr…

cf3120d

…oject#26535) Signed-off-by: Lucas Kabela <lucaskabela@meta.com>

alhridoy pushed a commit to alhridoy/vllm that referenced this pull request Oct 24, 2025

[Bugfix] Convert untraceable GroupShape to list for AMD impl (vllm-pr…

8889956

…oject#26535) Signed-off-by: Lucas Kabela <lucaskabela@meta.com>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025

[Bugfix] Convert untraceable GroupShape to list for AMD impl (vllm-pr…

39c9730

…oject#26535) Signed-off-by: Lucas Kabela <lucaskabela@meta.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] Convert untraceable GroupShape to list for AMD impl #26535

[Bugfix] Convert untraceable GroupShape to list for AMD impl #26535

Uh oh!

Lucaskabela commented Oct 9, 2025 •

edited by github-actions bot

Loading

Uh oh!

Lucaskabela commented Oct 9, 2025

Uh oh!

bradleyhd commented Oct 9, 2025

Uh oh!

zou3519 Oct 9, 2025

Uh oh!

zou3519 commented Oct 9, 2025

Uh oh!

yewentao256 left a comment

Uh oh!

ProExpertProg left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

[Bugfix] Convert untraceable GroupShape to list for AMD impl #26535

[Bugfix] Convert untraceable GroupShape to list for AMD impl #26535

Uh oh!

Conversation

Lucaskabela commented Oct 9, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Uh oh!

Lucaskabela commented Oct 9, 2025

Uh oh!

bradleyhd commented Oct 9, 2025

Uh oh!

zou3519 Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

zou3519 commented Oct 9, 2025

Uh oh!

yewentao256 left a comment

Choose a reason for hiding this comment

Uh oh!

ProExpertProg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Lucaskabela commented Oct 9, 2025 •

edited by github-actions bot

Loading