-
Notifications
You must be signed in to change notification settings - Fork 528
[Bugfix] Fix deepseek percision issue and add acc ci for it #905
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
vllm_ascend/ops/fused_moe.py
Outdated
| num_experts = w1.shape[0] | ||
| dtype = hidden_states.dtype | ||
| device = hidden_states.device | ||
| topk_weights = topk_weights.to(dtype) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
cb2ee7d to
0067a94
Compare
a57bdd4 to
2c38964
Compare
9f976dd to
d1a117a
Compare
ba23af4 to
988ab44
Compare
|
This pull request has conflicts, please resolve those before we can evaluate the pull request. |
Signed-off-by: MengqingCao <cmq0113@163.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let's merge this first, if e2e CI passed to make main CI recover.
And address comments and recover long term CI in sperate PR.
| matrix: | ||
| os: [linux-arm64-npu-1, linux-arm64-npu-4] | ||
| vllm_version: [main, v0.9.0] | ||
| concurrency: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
seems not work, pls try to change global concurrency
| pytest -sv tests/long_term/spec_decode --ignore=tests/long_term/spec_decode/e2e/test_mtp_correctness.py --ignore=tests/long_term/spec_decode/e2e/test_v1_spec_decode.py --ignore=tests/long_term/spec_decode/e2e/test_v1_mtp_correctness.py | ||
| pytest -sv tests/long_term/test_accuracy.py | ||
| else | ||
| VLLM_USE_MODELSCOPE=True pytest -sv tests/long_term/test_deepseek_v2_lite_tp2_accuracy.py |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this could move to multicard because gsm8k it's ok to run every PR.
| RTOL = 0.03 | ||
| # Baseline accuracy after VLLM optimization. | ||
| # FIXME: fix the accuracy issue | ||
| EXPECTED_VALUE = 0.000758150113722517 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This shoudl be fixed
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Got it, will do it in next pr
|
Merged to main to recover main CI |
…ject#905) Fix deepseek percision issue on V0 and add acc ci for it Fixes vllm-project#1062 CI passed with new added test. Signed-off-by: MengqingCao <cmq0113@163.com>
…ject#905) ### What this PR does / why we need it? Fix deepseek percision issue on V0 and add acc ci for it Fixes vllm-project#1062 ### How was this patch tested? CI passed with new added test. Signed-off-by: MengqingCao <cmq0113@163.com>
…ject#905) ### What this PR does / why we need it? Fix deepseek percision issue on V0 and add acc ci for it Fixes vllm-project#1062 ### How was this patch tested? CI passed with new added test. Signed-off-by: MengqingCao <cmq0113@163.com> Signed-off-by: weijinqian_v1 <weijinqian@huawei.com>
…ject#905) ### What this PR does / why we need it? Fix deepseek percision issue on V0 and add acc ci for it Fixes vllm-project#1062 ### How was this patch tested? CI passed with new added test. Signed-off-by: MengqingCao <cmq0113@163.com>
…ject#905) ### What this PR does / why we need it? Fix deepseek percision issue on V0 and add acc ci for it Fixes vllm-project#1062 ### How was this patch tested? CI passed with new added test. Signed-off-by: MengqingCao <cmq0113@163.com> Signed-off-by: weijinqian_v1 <weijinqian@huawei.com>
…ject#905) ### What this PR does / why we need it? Fix deepseek percision issue on V0 and add acc ci for it Fixes vllm-project#1062 ### How was this patch tested? CI passed with new added test. Signed-off-by: MengqingCao <cmq0113@163.com> Signed-off-by: weijinqian_v1 <weijinqian@huawei.com>
…ject#905) ### What this PR does / why we need it? Fix deepseek percision issue on V0 and add acc ci for it Fixes vllm-project#1062 ### How was this patch tested? CI passed with new added test. Signed-off-by: MengqingCao <cmq0113@163.com> Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
…ject#905) ### What this PR does / why we need it? Fix deepseek percision issue on V0 and add acc ci for it Fixes vllm-project#1062 ### How was this patch tested? CI passed with new added test. Signed-off-by: MengqingCao <cmq0113@163.com> Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
…ject#905) ### What this PR does / why we need it? Fix deepseek percision issue on V0 and add acc ci for it Fixes vllm-project#1062 ### How was this patch tested? CI passed with new added test. Signed-off-by: MengqingCao <cmq0113@163.com> Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
…ject#905) ### What this PR does / why we need it? Fix deepseek percision issue on V0 and add acc ci for it Fixes vllm-project#1062 ### How was this patch tested? CI passed with new added test. Signed-off-by: MengqingCao <cmq0113@163.com> Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
…ject#905) ### What this PR does / why we need it? Fix deepseek percision issue on V0 and add acc ci for it Fixes vllm-project#1062 ### How was this patch tested? CI passed with new added test. Signed-off-by: MengqingCao <cmq0113@163.com> Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
…ject#905) ### What this PR does / why we need it? Fix deepseek percision issue on V0 and add acc ci for it Fixes vllm-project#1062 ### How was this patch tested? CI passed with new added test. Signed-off-by: MengqingCao <cmq0113@163.com> Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
…ject#905) ### What this PR does / why we need it? Fix deepseek percision issue on V0 and add acc ci for it Fixes vllm-project#1062 ### How was this patch tested? CI passed with new added test. Signed-off-by: MengqingCao <cmq0113@163.com> Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
…ject#905) ### What this PR does / why we need it? Fix deepseek percision issue on V0 and add acc ci for it Fixes vllm-project#1062 ### How was this patch tested? CI passed with new added test. Signed-off-by: MengqingCao <cmq0113@163.com> Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
…ject#905) ### What this PR does / why we need it? Fix deepseek percision issue on V0 and add acc ci for it Fixes vllm-project#1062 ### How was this patch tested? CI passed with new added test. Signed-off-by: MengqingCao <cmq0113@163.com> Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
…ject#905) ### What this PR does / why we need it? Fix deepseek percision issue on V0 and add acc ci for it Fixes vllm-project#1062 ### How was this patch tested? CI passed with new added test. Signed-off-by: MengqingCao <cmq0113@163.com> Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
…ject#905) ### What this PR does / why we need it? Fix deepseek percision issue on V0 and add acc ci for it Fixes vllm-project#1062 ### How was this patch tested? CI passed with new added test. Signed-off-by: MengqingCao <cmq0113@163.com> Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
… for it (vllm-project#905) Merge branch wengang/cherry-pick-905 of git@code.alipay.com:Theta/vllm-ascend.git into dev-v0.9.0604 https://code.alipay.com/Theta/vllm-ascend/pull_requests/107 Reviewed-by: 子宏 <tanzhiqiang.tzq@antgroup.com> * [Bugfix] Fix deepseek percision issue and add acc ci for it (vllm-project#905)
…ject#905) ### What this PR does / why we need it? Fix deepseek percision issue on V0 and add acc ci for it Fixes vllm-project#1062 ### How was this patch tested? CI passed with new added test. Signed-off-by: MengqingCao <cmq0113@163.com>
…ject#905) ### What this PR does / why we need it? Fix deepseek percision issue on V0 and add acc ci for it Fixes vllm-project#1062 ### How was this patch tested? CI passed with new added test. Signed-off-by: MengqingCao <cmq0113@163.com>
What this PR does / why we need it?
Fix deepseek percision issue on V0 and add acc ci for it
Fixes #1062
How was this patch tested?
CI passed with new added test.