[compile] Fix inductor partition config #26645

angelayi · 2025-10-11T16:34:34Z

When trying to run the following cmd, I ran into this assertion error because self.backend is equal to "".

vllm bench latency \
    --model=RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8 \
    --output-len 1 --input-len 8192 --batch-size 1 \
    --tensor-parallel-size 8 --load-format dummy \
    --num_iters_warmup 5 --num_iters 15 \
    -O '{"level": 3, "pass_config": {"enable_async_tp": true, "enable_sequence_parallelism": true}, "use_inductor_graph_partition": true, "custom_ops":["+quant_fp8"], "cudagraph_mode":"FULL_AND_PIECEWISE"}' \
    --no-enable-prefix-caching

cc @ProExpertProg @zou3519 @BoyuanFeng @baonudesifeizhai

Signed-off-by: angelayi <yiangela7@gmail.com>

gemini-code-assist

Code Review

This pull request correctly fixes a bug in the inductor partition configuration. The is_attention_compiled_piecewise method was previously checking self.backend == "inductor", which would incorrectly evaluate to false when the default backend (an empty string) is used with inductor. The change to check self.use_inductor is the correct approach, as this flag accurately indicates whether inductor compilation is enabled. This resolves the assertion error described in the pull request.

ProExpertProg

Yep, bad merge of #25845 after #26113 was reverted in #26472. @morrison-turnansky can you fix in #26502?

morrison-turnansky · 2025-10-13T12:47:08Z

@ProExpertProg yes, will do

Signed-off-by: angelayi <yiangela7@gmail.com> Signed-off-by: 1994 <1994@users.noreply.github.com>

Signed-off-by: angelayi <yiangela7@gmail.com> Signed-off-by: Dhruvil Bhatt <bhattdbh@amazon.com>

Signed-off-by: angelayi <yiangela7@gmail.com> Signed-off-by: bbartels <benjamin@bartels.dev>

Signed-off-by: angelayi <yiangela7@gmail.com>

Signed-off-by: angelayi <yiangela7@gmail.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

Signed-off-by: angelayi <yiangela7@gmail.com> Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>

[compile] Fix inductor partition config

f91a4b6

Signed-off-by: angelayi <yiangela7@gmail.com>

angelayi requested review from ProExpertProg, WoosukKwon, hmellor, houseroad, mgoin, robertgshaw2-redhat, simon-mo, tlrmchlsmth, yewentao256 and youkaichao as code owners October 11, 2025 16:34

gemini-code-assist bot reviewed Oct 11, 2025

View reviewed changes

ProExpertProg approved these changes Oct 11, 2025

View reviewed changes

ProExpertProg enabled auto-merge (squash) October 11, 2025 19:12

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 11, 2025

ProExpertProg merged commit 01653a9 into vllm-project:main Oct 11, 2025
48 checks passed

1994 pushed a commit to 1994/vllm that referenced this pull request Oct 14, 2025

[compile] Fix inductor partition config (vllm-project#26645)

3cf01dc

Signed-off-by: angelayi <yiangela7@gmail.com> Signed-off-by: 1994 <1994@users.noreply.github.com>

Dhruvilbhatt pushed a commit to Dhruvilbhatt/vllm that referenced this pull request Oct 14, 2025

[compile] Fix inductor partition config (vllm-project#26645)

5ba03e0

Signed-off-by: angelayi <yiangela7@gmail.com> Signed-off-by: Dhruvil Bhatt <bhattdbh@amazon.com>

bbartels pushed a commit to bbartels/vllm that referenced this pull request Oct 16, 2025

[compile] Fix inductor partition config (vllm-project#26645)

d73df8c

Signed-off-by: angelayi <yiangela7@gmail.com> Signed-off-by: bbartels <benjamin@bartels.dev>

lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025

[compile] Fix inductor partition config (vllm-project#26645)

2d8741c

Signed-off-by: angelayi <yiangela7@gmail.com>

alhridoy pushed a commit to alhridoy/vllm that referenced this pull request Oct 24, 2025

[compile] Fix inductor partition config (vllm-project#26645)

efb3324

Signed-off-by: angelayi <yiangela7@gmail.com>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025

[compile] Fix inductor partition config (vllm-project#26645)

ead2797

Signed-off-by: angelayi <yiangela7@gmail.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025

[compile] Fix inductor partition config (vllm-project#26645)

8f1ed1b

Signed-off-by: angelayi <yiangela7@gmail.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025

[compile] Fix inductor partition config (vllm-project#26645)

59067d6

Signed-off-by: angelayi <yiangela7@gmail.com> Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>

0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025

[compile] Fix inductor partition config (vllm-project#26645)

8935115

Signed-off-by: angelayi <yiangela7@gmail.com> Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[compile] Fix inductor partition config #26645

[compile] Fix inductor partition config #26645

Uh oh!

angelayi commented Oct 11, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

ProExpertProg left a comment

Uh oh!

Uh oh!

morrison-turnansky commented Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[compile] Fix inductor partition config #26645

[compile] Fix inductor partition config #26645

Uh oh!

Conversation

angelayi commented Oct 11, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

ProExpertProg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

morrison-turnansky commented Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

angelayi commented Oct 11, 2025 •

edited by github-actions bot

Loading