Refactormc2 #2164

momo609 · 2025-08-01T04:35:16Z

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.10.0
vLLM main: vllm-project/vllm@a353bd0

github-actions · 2025-08-01T05:32:11Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

ApsarasX · 2025-08-02T01:36:51Z

vllm_ascend/ops/fused_moe.py

    hidden_states = torch.cat(hidden_states, dim=0)
    return hidden_states

+    w1 = w1.transpose(1, 2)


Duplicate code?

github-actions · 2025-08-12T13:16:16Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

realliujiaxu · 2025-08-26T07:15:55Z

vllm_ascend/ops/moe_dispatcher/token_dispatcher.py

+
+        return final_hidden_states
+
+class QuantizedTokenDispatcherWithAllGather(MoETokenDispatcher):


fused_experts_with_allgather is not transferred here. when expert_map is not None, this is really slow

github-actions bot added the module:ops label Aug 1, 2025

ApsarasX requested changes Aug 2, 2025

View reviewed changes

wangxiaoxin-sherie added 4 commits August 8, 2025 14:13

x

c2f34d4

add mc2 refactor.

8905fda

xx

e4bc036

xx

6b090a0

momo609 force-pushed the refactormc2 branch from f8f09df to 6b090a0 Compare August 8, 2025 06:14

wangxiaoxin-sherie added 2 commits August 8, 2025 15:05

xx

ba92040

xx

c315cb5

github-actions bot added module:core merge-conflicts labels Aug 12, 2025

shiyuan680 mentioned this pull request Aug 13, 2025

[RFC]: Refactoring fused_moe #2321

Open

zz

f62989f

github-actions bot added the module:tests label Aug 14, 2025

realliujiaxu reviewed Aug 26, 2025

View reviewed changes

momo609 closed this Sep 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactormc2 #2164

Refactormc2 #2164

Uh oh!

momo609 commented Aug 1, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Aug 1, 2025

Uh oh!

ApsarasX Aug 2, 2025

Uh oh!

github-actions bot commented Aug 12, 2025

Uh oh!

realliujiaxu Aug 26, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		return final_hidden_states

		class QuantizedTokenDispatcherWithAllGather(MoETokenDispatcher):

Refactormc2 #2164

Refactormc2 #2164

Uh oh!

Conversation

momo609 commented Aug 1, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Aug 1, 2025

Uh oh!

ApsarasX Aug 2, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Aug 12, 2025

Uh oh!

realliujiaxu Aug 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

momo609 commented Aug 1, 2025 •

edited by github-actions bot

Loading

realliujiaxu Aug 26, 2025 •

edited

Loading