[V1] Make V1 engine backward compatible #637

yiz-liu · 2025-04-23T16:46:32Z

What this PR does / why we need it?

Enforce eager mode in the V1 engine ahead of the upcoming CANN and torch_npu releases.

Does this PR introduce any user-facing change?

After this change, users will no longer need to manually set enforce_eager=True.

How was this patch tested?

Test it with regular offline inference examples.

Signed-off-by: Yizhou Liu <liu_yizhou@outlook.com>

ganyi1996ppo · 2025-04-24T08:10:42Z

tests/ops/test_fused_moe.py

Looks good to me, but the changes in this file seems not related to the topic of this PR

Actually, I spoke with Xiyuan and confirmed that this test case does not require setting the vllm config. As a result, I reverted it to its original state. The reason for this change is that the CI triggered a vllm config-related error while I was addressing the enforce_eager issue, which led me to this test case.

ganyi1996ppo · 2025-04-24T08:11:43Z

@wangxiyuan can you please review this PR, we should merge this PR asap.

MengqingCao · 2025-04-24T09:12:05Z

lgtm

### What this PR does / why we need it? Enforce eager mode in the V1 engine ahead of the upcoming CANN and torch_npu releases. ### Does this PR introduce _any_ user-facing change? After this change, users will no longer need to manually set enforce_eager=True. ### How was this patch tested? Test it with regular offline inference examples. Signed-off-by: Yizhou Liu <liu_yizhou@outlook.com>

github-actions bot added module:tests module:core labels Apr 23, 2025

yiz-liu force-pushed the main branch 4 times, most recently from d032ee2 to 306654e Compare April 24, 2025 06:53

[Fix] Set enforce_eager to True and revert changes in tests

d1ae81f

Signed-off-by: Yizhou Liu <liu_yizhou@outlook.com>

yiz-liu force-pushed the main branch from 306654e to d1ae81f Compare April 24, 2025 07:59

ganyi1996ppo reviewed Apr 24, 2025

View reviewed changes

wangxiyuan approved these changes Apr 24, 2025

View reviewed changes

wangxiyuan merged commit d785e78 into vllm-project:main Apr 24, 2025
15 checks passed

wangxiyuan changed the title ~~Quick Fix: Enforce eager mode in the V1 engine ahead of the upcoming CANN and torch_npu releases~~ [V1] Make V1 engine backward compatible Apr 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[V1] Make V1 engine backward compatible #637

[V1] Make V1 engine backward compatible #637

Uh oh!

yiz-liu commented Apr 23, 2025

Uh oh!

ganyi1996ppo Apr 24, 2025

Uh oh!

yiz-liu Apr 24, 2025

Uh oh!

ganyi1996ppo commented Apr 24, 2025

Uh oh!

MengqingCao commented Apr 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[V1] Make V1 engine backward compatible #637

[V1] Make V1 engine backward compatible #637

Uh oh!

Conversation

yiz-liu commented Apr 23, 2025

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

ganyi1996ppo Apr 24, 2025

Choose a reason for hiding this comment

Uh oh!

yiz-liu Apr 24, 2025

Choose a reason for hiding this comment

Uh oh!

ganyi1996ppo commented Apr 24, 2025

Uh oh!

MengqingCao commented Apr 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants