Skip to content

Conversation

@MengqingCao
Copy link
Collaborator

@MengqingCao MengqingCao commented May 20, 2025

What this PR does / why we need it?

Fix deepseek percision issue on V0 and add acc ci for it
Fixes #1062

How was this patch tested?

CI passed with new added test.

@MengqingCao MengqingCao marked this pull request as ready for review May 20, 2025 06:29
num_experts = w1.shape[0]
dtype = hidden_states.dtype
device = hidden_states.device
topk_weights = topk_weights.to(dtype)
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actually this fix is done by @whx-sjtu as a part in #863, but I think it should be merge quickly instead of blocking in #863

@MengqingCao MengqingCao force-pushed the dsfix branch 3 times, most recently from 9f976dd to d1a117a Compare May 30, 2025 03:13
@MengqingCao MengqingCao added long-term-test enable long term test for PR ready-for-test start test by label for PR labels May 30, 2025
@MengqingCao MengqingCao added ready-for-test start test by label for PR and removed ready-for-test start test by label for PR labels Jun 4, 2025
@wangxiyuan wangxiyuan mentioned this pull request Jun 4, 2025
76 tasks
@MengqingCao MengqingCao changed the title [Bugfix] Fix deepseek V0 percision issue and add acc ci for it [Bugfix] Fix deepseek percision issue and add acc ci for it Jun 4, 2025
@MengqingCao MengqingCao force-pushed the dsfix branch 2 times, most recently from ba23af4 to 988ab44 Compare June 4, 2025 08:42
@github-actions
Copy link

github-actions bot commented Jun 4, 2025

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: MengqingCao <cmq0113@163.com>
@Yikun Yikun added ready-for-test start test by label for PR and removed ready-for-test start test by label for PR labels Jun 4, 2025
@MengqingCao MengqingCao added ready-for-test start test by label for PR and removed ready-for-test start test by label for PR labels Jun 4, 2025
Copy link
Collaborator

@Yikun Yikun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's merge this first, if e2e CI passed to make main CI recover.

And address comments and recover long term CI in sperate PR.

matrix:
os: [linux-arm64-npu-1, linux-arm64-npu-4]
vllm_version: [main, v0.9.0]
concurrency:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

seems not work, pls try to change global concurrency

pytest -sv tests/long_term/spec_decode --ignore=tests/long_term/spec_decode/e2e/test_mtp_correctness.py --ignore=tests/long_term/spec_decode/e2e/test_v1_spec_decode.py --ignore=tests/long_term/spec_decode/e2e/test_v1_mtp_correctness.py
pytest -sv tests/long_term/test_accuracy.py
else
VLLM_USE_MODELSCOPE=True pytest -sv tests/long_term/test_deepseek_v2_lite_tp2_accuracy.py
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this could move to multicard because gsm8k it's ok to run every PR.

RTOL = 0.03
# Baseline accuracy after VLLM optimization.
# FIXME: fix the accuracy issue
EXPECTED_VALUE = 0.000758150113722517
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This shoudl be fixed

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Got it, will do it in next pr

@Yikun Yikun merged commit afc4c0c into vllm-project:main Jun 4, 2025
26 of 30 checks passed
@Yikun
Copy link
Collaborator

Yikun commented Jun 4, 2025

Merged to main to recover main CI

@MengqingCao MengqingCao deleted the dsfix branch June 4, 2025 12:33
wanghanqingLYT pushed a commit to raindaywhu/vllm-ascend that referenced this pull request Jun 4, 2025
…ject#905)

Fix deepseek percision issue on V0 and add acc ci for it
Fixes vllm-project#1062
CI passed with new added test.

Signed-off-by: MengqingCao <cmq0113@163.com>
weijinqian0 pushed a commit to weijinqian0/vllm-ascend that referenced this pull request Jun 4, 2025
…ject#905)

### What this PR does / why we need it?
Fix deepseek percision issue on V0 and add acc ci for it
Fixes vllm-project#1062
### How was this patch tested?
CI passed with new added test.

Signed-off-by: MengqingCao <cmq0113@163.com>
weijinqian0 pushed a commit to weijinqian0/vllm-ascend that referenced this pull request Jun 4, 2025
…ject#905)

### What this PR does / why we need it?
Fix deepseek percision issue on V0 and add acc ci for it
Fixes vllm-project#1062
### How was this patch tested?
CI passed with new added test.

Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: weijinqian_v1 <weijinqian@huawei.com>
weijinqian0 pushed a commit to weijinqian0/vllm-ascend that referenced this pull request Jun 4, 2025
…ject#905)

### What this PR does / why we need it?
Fix deepseek percision issue on V0 and add acc ci for it
Fixes vllm-project#1062
### How was this patch tested?
CI passed with new added test.

Signed-off-by: MengqingCao <cmq0113@163.com>
weijinqian0 pushed a commit to weijinqian0/vllm-ascend that referenced this pull request Jun 4, 2025
…ject#905)

### What this PR does / why we need it?
Fix deepseek percision issue on V0 and add acc ci for it
Fixes vllm-project#1062
### How was this patch tested?
CI passed with new added test.

Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: weijinqian_v1 <weijinqian@huawei.com>
weijinqian0 pushed a commit to weijinqian0/vllm-ascend that referenced this pull request Jun 4, 2025
…ject#905)

### What this PR does / why we need it?
Fix deepseek percision issue on V0 and add acc ci for it
Fixes vllm-project#1062
### How was this patch tested?
CI passed with new added test.

Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: weijinqian_v1 <weijinqian@huawei.com>
momo609 pushed a commit to momo609/vllm-ascend that referenced this pull request Jun 4, 2025
…ject#905)

### What this PR does / why we need it?
Fix deepseek percision issue on V0 and add acc ci for it
Fixes vllm-project#1062
### How was this patch tested?
CI passed with new added test.

Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
momo609 pushed a commit to momo609/vllm-ascend that referenced this pull request Jun 4, 2025
…ject#905)

### What this PR does / why we need it?
Fix deepseek percision issue on V0 and add acc ci for it
Fixes vllm-project#1062
### How was this patch tested?
CI passed with new added test.

Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
momo609 pushed a commit to momo609/vllm-ascend that referenced this pull request Jun 4, 2025
…ject#905)

### What this PR does / why we need it?
Fix deepseek percision issue on V0 and add acc ci for it
Fixes vllm-project#1062
### How was this patch tested?
CI passed with new added test.

Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
momo609 pushed a commit to momo609/vllm-ascend that referenced this pull request Jun 4, 2025
…ject#905)

### What this PR does / why we need it?
Fix deepseek percision issue on V0 and add acc ci for it
Fixes vllm-project#1062
### How was this patch tested?
CI passed with new added test.

Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
momo609 pushed a commit to momo609/vllm-ascend that referenced this pull request Jun 4, 2025
…ject#905)

### What this PR does / why we need it?
Fix deepseek percision issue on V0 and add acc ci for it
Fixes vllm-project#1062
### How was this patch tested?
CI passed with new added test.

Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
momo609 pushed a commit to momo609/vllm-ascend that referenced this pull request Jun 4, 2025
…ject#905)

### What this PR does / why we need it?
Fix deepseek percision issue on V0 and add acc ci for it
Fixes vllm-project#1062
### How was this patch tested?
CI passed with new added test.

Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
momo609 pushed a commit to momo609/vllm-ascend that referenced this pull request Jun 5, 2025
…ject#905)

### What this PR does / why we need it?
Fix deepseek percision issue on V0 and add acc ci for it
Fixes vllm-project#1062
### How was this patch tested?
CI passed with new added test.

Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
momo609 pushed a commit to momo609/vllm-ascend that referenced this pull request Jun 5, 2025
…ject#905)

### What this PR does / why we need it?
Fix deepseek percision issue on V0 and add acc ci for it
Fixes vllm-project#1062
### How was this patch tested?
CI passed with new added test.

Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
momo609 pushed a commit to momo609/vllm-ascend that referenced this pull request Jun 5, 2025
…ject#905)

### What this PR does / why we need it?
Fix deepseek percision issue on V0 and add acc ci for it
Fixes vllm-project#1062
### How was this patch tested?
CI passed with new added test.

Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
momo609 pushed a commit to momo609/vllm-ascend that referenced this pull request Jun 5, 2025
…ject#905)

### What this PR does / why we need it?
Fix deepseek percision issue on V0 and add acc ci for it
Fixes vllm-project#1062
### How was this patch tested?
CI passed with new added test.

Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
momo609 pushed a commit to momo609/vllm-ascend that referenced this pull request Jun 5, 2025
…ject#905)

### What this PR does / why we need it?
Fix deepseek percision issue on V0 and add acc ci for it
Fixes vllm-project#1062
### How was this patch tested?
CI passed with new added test.

Signed-off-by: MengqingCao <cmq0113@163.com>
Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>
venus-taibai pushed a commit to venus-taibai/vllm-ascend that referenced this pull request Jun 18, 2025
… for it (vllm-project#905)

Merge branch wengang/cherry-pick-905 of git@code.alipay.com:Theta/vllm-ascend.git into dev-v0.9.0604
https://code.alipay.com/Theta/vllm-ascend/pull_requests/107

Reviewed-by: 子宏 <tanzhiqiang.tzq@antgroup.com>


* [Bugfix] Fix deepseek percision issue and add acc ci for it (vllm-project#905)
chopper0126 pushed a commit to chopper0126/vllm-ascend that referenced this pull request Oct 16, 2025
…ject#905)

### What this PR does / why we need it?
Fix deepseek percision issue on V0 and add acc ci for it
Fixes vllm-project#1062
### How was this patch tested?
CI passed with new added test.

Signed-off-by: MengqingCao <cmq0113@163.com>
Angazenn pushed a commit to Angazenn/vllm-ascend that referenced this pull request Oct 21, 2025
…ject#905)

### What this PR does / why we need it?
Fix deepseek percision issue on V0 and add acc ci for it
Fixes vllm-project#1062
### How was this patch tested?
CI passed with new added test.

Signed-off-by: MengqingCao <cmq0113@163.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Bug]: deepseek-v2-lite offline failed to run

4 participants