[v0.9.1][bugfix] fix deepseek with mc2 (#1269)

zzzzwwjj · web-flow · commit caaf6c9a27e7 · 2025-06-18T00:58:24.000+08:00
### What this PR does / why we need it?  ### Does this PR introduce _any_ user-facing change?  ### How was this patch tested?  Signed-off-by: zzzzwwjj <1183291235@qq.com>
diff --git a/vllm_ascend/ops/fused_moe.py b/vllm_ascend/ops/fused_moe.py
@@ -1186,7 +1186,8 @@ def forward(self,
             enable_force_load_balance=enable_force_load_balance,
             log2phy=self.log2phy,
             global_redundant_expert_num=self.global_redundant_expert_num,
-            shared_experts=shared_experts,
+            shared_experts=shared_experts if self.torchair_graph_enabled
+            and self.enable_multistream_moe and not is_prefill else None,
         )
 
         if shared_experts:

Original file line number	Diff line number	Diff line change
`@@ -1186,7 +1186,8 @@ def forward(self,`
`1186`	`1186`	`enable_force_load_balance=enable_force_load_balance,`
`1187`	`1187`	`log2phy=self.log2phy,`
`1188`	`1188`	`global_redundant_expert_num=self.global_redundant_expert_num,`
`1189`		`- shared_experts=shared_experts,`
	`1189`	`+ shared_experts=shared_experts if self.torchair_graph_enabled`
	`1190`	`+ and self.enable_multistream_moe and not is_prefill else None,`
`1190`	`1191`	`)`
`1191`	`1192`
`1192`	`1193`	`if shared_experts:`