-
Notifications
You must be signed in to change notification settings - Fork 532
[BugFix] Fix the bug that qwen3 moe doesn't work with aclgraph #2183
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:
If CI fails, you can run linting and testing checks locally according Contributing and Testing. |
|
depends on #2181 |
1efb036 to
8f09332
Compare
8f09332 to
90e6b45
Compare
f9935fc to
41a9872
Compare
Codecov Report❌ Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #2183 +/- ##
=======================================
Coverage 76.34% 76.34%
=======================================
Files 110 110
Lines 12473 12474 +1
=======================================
+ Hits 9522 9523 +1
Misses 2951 2951
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
41a9872 to
7d3c5c1
Compare
Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>
7d3c5c1 to
03b4565
Compare
…project#2183) What's the PR does: 1. Move AscendSparseMoeBlock to qwen3 model, since it's only used by qwen3 model. 2. Disable AscendSparseMoeBlock if aclgraph is enabled, AscendSparseMoeBlock doesn't work with aclgraph currently. - vLLM version: v0.10.0 - vLLM main: vllm-project/vllm@cdfd687 Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>
…project#2183) What's the PR does: 1. Move AscendSparseMoeBlock to qwen3 model, since it's only used by qwen3 model. 2. Disable AscendSparseMoeBlock if aclgraph is enabled, AscendSparseMoeBlock doesn't work with aclgraph currently. - vLLM version: v0.10.0 - vLLM main: vllm-project/vllm@cdfd687 Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>
…project#2183) What's the PR does: 1. Move AscendSparseMoeBlock to qwen3 model, since it's only used by qwen3 model. 2. Disable AscendSparseMoeBlock if aclgraph is enabled, AscendSparseMoeBlock doesn't work with aclgraph currently. - vLLM version: v0.10.0 - vLLM main: vllm-project/vllm@cdfd687 Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>
…project#2183) What's the PR does: 1. Move AscendSparseMoeBlock to qwen3 model, since it's only used by qwen3 model. 2. Disable AscendSparseMoeBlock if aclgraph is enabled, AscendSparseMoeBlock doesn't work with aclgraph currently. - vLLM version: v0.10.0 - vLLM main: vllm-project/vllm@cdfd687 Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>
What's the PR does: