[Bugfix] fix env variable in dbo #1284

zxdukki · 2025-06-18T11:49:21Z

What this PR does / why we need it?

Fix env variable in dbo to enable dbo in DeepSeek-V3 model. Besides, we have fixed an known issue in deepseek-dbo.

Does this PR introduce any user-facing change?

How was this patch tested?

This patch can be tested with newly added e2e tests: tests/multicard/test_offline_inference_distributed.py.
It can be verified with pytest.

wangxiyuan · 2025-06-18T12:47:44Z

it's good to add the e2e as well. https://github.com/vllm-project/vllm-ascend/blob/main/tests/e2e/multicard/test_torchair_graph_mode.py with the env enabled

Yikun · 2025-06-18T13:30:40Z

vllm_ascend/models/__init__.py

            "DeepseekV2ForCausalLM",
            "vllm_ascend.models.deepseek_dbo:CustomDeepseekDBOForCausalLM")
+
+        ModelRegistry.register_model(


Please add a ut here to check DeepseekV3ForCausalLM is registered

Please add a ut here to check DeepseekV3ForCausalLM is registered

Thanks for your review!
Is it appropriate to add a check for the module_name and class_name of the registered dbo model here？

zxdukki · 2025-06-18T15:42:45Z

it's good to add the e2e as well. https://github.com/vllm-project/vllm-ascend/blob/main/tests/e2e/multicard/test_torchair_graph_mode.py with the env enabled

Thanks for your advice! We have added an e2e test for using dbo in deepseekV3 model here.

WRONG Test

wangxiyuan · 2025-06-19T01:58:41Z

update https://github.com/vllm-project/vllm-ascend/blob/main/.github/workflows/vllm_ascend_test.yaml#L334-L338 to let the test run by CI actually

wangxiyuan · 2025-06-19T01:59:49Z

tests/e2e/multicard/test_offline_inference_distributed.py

+    dtype = "half"
+    sampling_params = SamplingParams(max_tokens=100, temperature=0.0)
+    with VllmRunner(
+            "deepseek-ai/DeepSeek-V3-Lite-base-latest-w8a8-dynamic",


what's this model? I can't found any weight named this in public. I think you can use vllm-ascend/DeepSeek-V3-Pruning instead

zxdukki · 2025-06-19T07:12:18Z

it seems that the DeepSeek-V3-Pruning models has the following config: first_k_dense_replace = 3, num_hidden_layers = 2;
(first_k_dense_replace > num_hidden_layers)
so the ut failed. We commit a fix to compatible with this kind of models.

zxdukki · 2025-06-19T07:46:02Z

it seems that the DeepSeek-V3-Pruning models has the following config: first_k_dense_replace = 3, num_hidden_layers = 2; (first_k_dense_replace > num_hidden_layers) so the ut failed. We commit a fix to compatible with this kind of models.

vllm-project/vllm@799397e#diff-52422ec3f789d0cddb8ac608c8ab33c74839612d5b05fbc64fb76ebffe0889d2

pooling_params

wangxiyuan · 2025-06-19T07:52:07Z

@zxdukki fixed here #1293

wangxiyuan · 2025-06-20T01:49:11Z

please rebase asap to make ci happy

Signed-off-by: zhuohuan <zxdu1997@gmail.com>

wangxiyuan · 2025-06-23T01:07:06Z

this PR has been merged to 0.9.1-dev already. Let's merge this to main as well.

### What this PR does / why we need it? Fix env variable in dbo to enable dbo in DeepSeek-V3 model. Besides, we have fixed an known issue in deepseek-dbo. ### Does this PR introduce _any_ user-facing change? no ### How was this patch tested? This patch can be tested with newly added e2e tests: [tests/multicard/test_offline_inference_distributed.py](https://github.com/vllm-project/vllm-ascend/pull/1285/files#diff-7cd2e6b1bda6b8ad1bedb3276971fe7064aeae4dc0efd41c301c4ede2158c57e). It can be verified with pytest. --------- Signed-off-by: zhuohuan <zxdu1997@gmail.com>

zxdukki force-pushed the dev_dbo_fix_init branch from 583d280 to 0cc6f23 Compare June 18, 2025 11:58

Yikun reviewed Jun 18, 2025

View reviewed changes

github-actions bot added the module:tests label Jun 18, 2025

wangxiyuan previously approved these changes Jun 19, 2025

View reviewed changes

wangxiyuan reviewed Jun 19, 2025

View reviewed changes

Yikun mentioned this pull request Jun 20, 2025

[release] 0.9.1rc1 release checklist #1315

Closed

29 tasks

zxdukki added 4 commits June 20, 2025 21:12

[fix]: support deepseek-v3-dbo if enable DBO and fix known issues

1fec28a

Signed-off-by: zhuohuan <zxdu1997@gmail.com>

[fix]: add ut for the registed model of dbo

766344f

Signed-off-by: zhuohuan <zxdu1997@gmail.com>

[fix]: fix ut issues for dbo

03cc023

Signed-off-by: zhuohuan <zxdu1997@gmail.com>

[fix]: compatible with pruning models for dbo

91bd15c

Signed-off-by: zhuohuan <zxdu1997@gmail.com>

zxdukki force-pushed the dev_dbo_fix_init branch from 4401b17 to 91bd15c Compare June 20, 2025 13:13

wangxiyuan approved these changes Jun 20, 2025

View reviewed changes

wangxiyuan merged commit f04c676 into vllm-project:main Jun 23, 2025
20 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix] fix env variable in dbo #1284

[Bugfix] fix env variable in dbo #1284

Uh oh!

zxdukki commented Jun 18, 2025 •

edited

Loading

Uh oh!

wangxiyuan commented Jun 18, 2025

Uh oh!

Yikun Jun 18, 2025

Uh oh!

zxdukki Jun 18, 2025 •

edited

Loading

Uh oh!

zxdukki commented Jun 18, 2025

Uh oh!

wangxiyuan commented Jun 19, 2025

Uh oh!

wangxiyuan Jun 19, 2025

Uh oh!

zxdukki commented Jun 19, 2025

Uh oh!

zxdukki commented Jun 19, 2025

Uh oh!

wangxiyuan commented Jun 19, 2025

Uh oh!

wangxiyuan commented Jun 20, 2025

Uh oh!

wangxiyuan commented Jun 23, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Bugfix] fix env variable in dbo #1284

[Bugfix] fix env variable in dbo #1284

Uh oh!

Conversation

zxdukki commented Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

wangxiyuan commented Jun 18, 2025

Uh oh!

Yikun Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

zxdukki Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zxdukki commented Jun 18, 2025

Uh oh!

wangxiyuan commented Jun 19, 2025

Uh oh!

wangxiyuan Jun 19, 2025

Choose a reason for hiding this comment

Uh oh!

zxdukki commented Jun 19, 2025

Uh oh!

zxdukki commented Jun 19, 2025

Uh oh!

wangxiyuan commented Jun 19, 2025

Uh oh!

wangxiyuan commented Jun 20, 2025

Uh oh!

wangxiyuan commented Jun 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zxdukki commented Jun 18, 2025 •

edited

Loading

zxdukki Jun 18, 2025 •

edited

Loading

wangxiyuan commented Jun 23, 2025 •

edited

Loading