[Feature] Add CustomQwen3MoeForCausalLM model #925

yiz-liu · 2025-05-22T06:35:01Z

Tweak packed_modules_mapping to support W8A8 weights.

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

…s_mapping Signed-off-by: Yizhou Liu <liu_yizhou@outlook.com>

yiz-liu · 2025-05-22T10:32:56Z

vllm_ascend/models/qwen3_moe.py

+        "experts":
+        ["experts.0.gate_proj", "experts.0.up_proj", "experts.0.down_proj"],


We have to add this, otherwise, weights exported by modelslim cannot be properly processed by vLLM.

wangxiyuan · 2025-05-22T10:36:01Z

I'm fine with this change for temporary quick fix. It's good to fix by modelslim for vllm upstream instead.
@22dimensions please check this issue with modelslim team. Thanks.
@MengqingCao for vllm upstream fix, we can take a try as well.

Tweak packed_modules_mapping to support W8A8 weights.  ### What this PR does / why we need it?  ### Does this PR introduce _any_ user-facing change?  ### How was this patch tested?  Signed-off-by: Yizhou Liu <liu_yizhou@outlook.com> Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>

Tweak packed_modules_mapping to support W8A8 weights.  ### What this PR does / why we need it?  ### Does this PR introduce _any_ user-facing change?  ### How was this patch tested?  Signed-off-by: Yizhou Liu <liu_yizhou@outlook.com>

[Feature] Add CustomQwen3MoeForCausalLM model and tweak packed_module…

6fe32a7

…s_mapping Signed-off-by: Yizhou Liu <liu_yizhou@outlook.com>

yiz-liu force-pushed the feat-qwen3-quant branch from 457898c to 6fe32a7 Compare May 22, 2025 07:28

yiz-liu commented May 22, 2025

View reviewed changes

wangxiyuan added the ready read for review label May 22, 2025

wangxiyuan approved these changes May 22, 2025

View reviewed changes

ganyi1996ppo merged commit 17f05b1 into vllm-project:main May 23, 2025
16 checks passed

yiz-liu deleted the feat-qwen3-quant branch May 23, 2025 07:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] Add CustomQwen3MoeForCausalLM model #925

[Feature] Add CustomQwen3MoeForCausalLM model #925

Uh oh!

yiz-liu commented May 22, 2025 •

edited

Loading

Uh oh!

yiz-liu May 22, 2025

Uh oh!

wangxiyuan commented May 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		"experts":
		["experts.0.gate_proj", "experts.0.up_proj", "experts.0.down_proj"],

[Feature] Add CustomQwen3MoeForCausalLM model #925

[Feature] Add CustomQwen3MoeForCausalLM model #925

Uh oh!

Conversation

yiz-liu commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

yiz-liu May 22, 2025

Choose a reason for hiding this comment

Uh oh!

wangxiyuan commented May 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yiz-liu commented May 22, 2025 •

edited

Loading