[v0.9.1][refactor] Refactoring AscendFusedMoE (#1229) #1264

ganyi1996ppo · 2025-06-17T12:27:13Z

What this PR does / why we need it?

This PR is the cherry-pick of the PR #1229 which have already merged into the main branch.

This PR is used for resolved issue 1147

Move fused_moe code into one file fused_moe.py.
Integrate branch conditions into function get_fused_moe_state.

Does this PR introduce any user-facing change?

This PR has removed the env VLLM_ENABLE_MC2, because I think this env is useless, we can make judgments based on the current scenario without this env, it will only increase complexity.
This PR has removed the env USING_LCCL_COM, because this env has already expired.
additional_config.expert_tensor_parallel_size has already expired, and now we also use parameter enable_expert_parallel, consistent with the vLLM.

How was this patch tested?

CI passed

This PR is used for resolved [issue 1147](vllm-project#1147) 1. Move fused_moe code into one file `fused_moe.py`. 2. Integrate branch conditions into function `get_fused_moe_state`.  1. This PR has removed the env `VLLM_ENABLE_MC2`, because I think this env is useless, we can make judgments based on the current scenario without this env, it will only increase complexity. 2. This PR has removed the env `USING_LCCL_COM`, because this env has already expired. 3. `additional_config.expert_tensor_parallel_size` has already expired, and now we also use parameter `enable_expert_parallel`, consistent with the vLLM.   Signed-off-by: zzzzwwjj <1183291235@qq.com> Signed-off-by: ganyi <pleaplusone.gy@gmail.com>

ganyi1996ppo · 2025-06-17T12:29:53Z

@zzzzwwjj there is a conflict in model_runner_v1.py, please review this code change.

ganyi1996ppo force-pushed the dev/refactor_moe branch from 05010a7 to 15731e4 Compare June 17, 2025 12:28

github-actions bot added module:ops module:core module:quantization labels Jun 17, 2025

wangxiyuan approved these changes Jun 17, 2025

View reviewed changes

Yikun changed the title ~~[cherry-pick][refactor] Refactoring AscendFusedMoE (#1229)~~ [v0.9.1][refactor] Refactoring AscendFusedMoE (#1229) Jun 17, 2025

Yikun merged commit 733b0a2 into vllm-project:v0.9.1-dev Jun 17, 2025
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[v0.9.1][refactor] Refactoring AscendFusedMoE (#1229) #1264

[v0.9.1][refactor] Refactoring AscendFusedMoE (#1229) #1264

Uh oh!

ganyi1996ppo commented Jun 17, 2025 •

edited by Yikun

Loading

Uh oh!

ganyi1996ppo commented Jun 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[v0.9.1][refactor] Refactoring AscendFusedMoE (#1229) #1264

[v0.9.1][refactor] Refactoring AscendFusedMoE (#1229) #1264

Uh oh!

Conversation

ganyi1996ppo commented Jun 17, 2025 • edited by Yikun Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

ganyi1996ppo commented Jun 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ganyi1996ppo commented Jun 17, 2025 •

edited by Yikun

Loading