Skip to content

Conversation

@shikang-hangzhou
Copy link

@shikang-hangzhou shikang-hangzhou commented Jun 20, 2025

What this PR does / why we need it?

Fix accuracy problem after MOE refactor and make inference flow better.

Does this PR introduce any user-facing change?

None

How was this patch tested?

e2e test in tests/e2e/multicard/test_offline_inference_distributed.py

@shikang-hangzhou shikang-hangzhou changed the title [BugFix]fix accuracy in dbo after refact MOE [BugFix]fix accuracy in dbo after refactor MOE Jun 20, 2025
Signed-off-by: shikang-hangzhou <459956190@qq.com>
@shikang-hangzhou shikang-hangzhou changed the title [BugFix]fix accuracy in dbo after refactor MOE [0.9.1][BugFix]fix accuracy in dbo after refactor MOE Jun 21, 2025
@wangxiyuan wangxiyuan merged commit 822de15 into vllm-project:v0.9.1-dev Jun 21, 2025
16 checks passed
@Yikun Yikun added the no-main label Jul 7, 2025
22dimensions pushed a commit to 22dimensions/vllm-ascend that referenced this pull request Jul 22, 2025
…llm-project#1328)

Fix accuracy problem after MOE refactor and make inference flow better.
None
e2e test in `tests/e2e/multicard/test_offline_inference_distributed.py`

Signed-off-by: shikang-hangzhou <459956190@qq.com>
22dimensions pushed a commit to 22dimensions/vllm-ascend that referenced this pull request Jul 22, 2025
…m-project#1420 vllm-project#1328 from v0.9.1-dev to main

Signed-off-by: 22dimensions <waitingwind@foxmail.com>
22dimensions pushed a commit to 22dimensions/vllm-ascend that referenced this pull request Jul 23, 2025
…m-project#1420 vllm-project#1328 from v0.9.1-dev to main

Signed-off-by: 22dimensions <waitingwind@foxmail.com>
venus-taibai pushed a commit to venus-taibai/vllm-ascend that referenced this pull request Sep 17, 2025
Merge branch zxdu/dev-v0.9.1.0622-dbo-prefill of git@code.alipay.com:Theta/vllm-ascend.git into dev-v0.9.1.0622
https://code.alipay.com/Theta/vllm-ascend/pull_requests/187

Reviewed-by: 沧濯 <zhengshoujian.zsj@antgroup.com>


* [0.9.1][Bugfix] fix dp error in dbo (vllm-project#1291)
* [0.9.1][BugFix]fix accuracy in dbo after refactor MOE (vllm-project#1328)
* [feat]: sync with deepseekv2
* [feat]: support chunked prefill split for dbo
* [fix]: overlap shared experts with post layernorm
* support fused_moe_allgather_ep
* [feat]: dbo support allgather ep
* [fix]: revert changes to format
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants