Skip to content

Conversation

@zxdukki
Copy link
Contributor

@zxdukki zxdukki commented Jun 18, 2025

What this PR does / why we need it?

Fix env variable in dbo to enable dbo in DeepSeek-V3 model. Besides, we have fixed an known issue in deepseek-dbo.

Does this PR introduce any user-facing change?

How was this patch tested?

This patch can be tested with newly added e2e tests: tests/multicard/test_offline_inference_distributed.py.
It can be verified with pytest.

@zxdukki zxdukki force-pushed the dev_dbo_fix_init branch from 583d280 to 0cc6f23 Compare June 18, 2025 11:58
@wangxiyuan
Copy link
Collaborator

"DeepseekV2ForCausalLM",
"vllm_ascend.models.deepseek_dbo:CustomDeepseekDBOForCausalLM")

ModelRegistry.register_model(
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please add a ut here to check DeepseekV3ForCausalLM is registered

Copy link
Contributor Author

@zxdukki zxdukki Jun 18, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please add a ut here to check DeepseekV3ForCausalLM is registered

Thanks for your review!
Is it appropriate to add a check for the module_name and class_name of the registered dbo model here

@zxdukki
Copy link
Contributor Author

zxdukki commented Jun 18, 2025

it's good to add the e2e as well. https://github.com/vllm-project/vllm-ascend/blob/main/tests/e2e/multicard/test_torchair_graph_mode.py with the env enabled

Thanks for your advice! We have added an e2e test for using dbo in deepseekV3 model here.

wangxiyuan
wangxiyuan previously approved these changes Jun 19, 2025
@wangxiyuan wangxiyuan dismissed their stale review June 19, 2025 01:58

WRONG Test

@wangxiyuan
Copy link
Collaborator

dtype = "half"
sampling_params = SamplingParams(max_tokens=100, temperature=0.0)
with VllmRunner(
"deepseek-ai/DeepSeek-V3-Lite-base-latest-w8a8-dynamic",
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what's this model? I can't found any weight named this in public. I think you can use vllm-ascend/DeepSeek-V3-Pruning instead

@zxdukki
Copy link
Contributor Author

zxdukki commented Jun 19, 2025

it seems that the DeepSeek-V3-Pruning models has the following config: first_k_dense_replace = 3, num_hidden_layers = 2;
(first_k_dense_replace > num_hidden_layers)
so the ut failed. We commit a fix to compatible with this kind of models.

@zxdukki
Copy link
Contributor Author

zxdukki commented Jun 19, 2025

it seems that the DeepSeek-V3-Pruning models has the following config: first_k_dense_replace = 3, num_hidden_layers = 2; (first_k_dense_replace > num_hidden_layers) so the ut failed. We commit a fix to compatible with this kind of models.

vllm-project/vllm@799397e#diff-52422ec3f789d0cddb8ac608c8ab33c74839612d5b05fbc64fb76ebffe0889d2

pooling_params

@wangxiyuan
Copy link
Collaborator

@zxdukki fixed here #1293

@wangxiyuan
Copy link
Collaborator

please rebase asap to make ci happy

@Yikun Yikun mentioned this pull request Jun 20, 2025
29 tasks
zxdukki added 4 commits June 20, 2025 21:12
Signed-off-by: zhuohuan <zxdu1997@gmail.com>
Signed-off-by: zhuohuan <zxdu1997@gmail.com>
Signed-off-by: zhuohuan <zxdu1997@gmail.com>
Signed-off-by: zhuohuan <zxdu1997@gmail.com>
@zxdukki zxdukki force-pushed the dev_dbo_fix_init branch from 4401b17 to 91bd15c Compare June 20, 2025 13:13
@wangxiyuan
Copy link
Collaborator

wangxiyuan commented Jun 23, 2025

this PR has been merged to 0.9.1-dev already. Let's merge this to main as well.

@wangxiyuan wangxiyuan merged commit f04c676 into vllm-project:main Jun 23, 2025
20 checks passed
chopper0126 pushed a commit to chopper0126/vllm-ascend that referenced this pull request Oct 16, 2025
### What this PR does / why we need it?
Fix env variable in dbo to enable dbo in DeepSeek-V3 model. Besides, we
have fixed an known issue in deepseek-dbo.


### Does this PR introduce _any_ user-facing change?
no

### How was this patch tested?
This patch can be tested with newly added e2e tests:
[tests/multicard/test_offline_inference_distributed.py](https://github.com/vllm-project/vllm-ascend/pull/1285/files#diff-7cd2e6b1bda6b8ad1bedb3276971fe7064aeae4dc0efd41c301c4ede2158c57e).
It can be verified with pytest.

---------

Signed-off-by: zhuohuan <zxdu1997@gmail.com>
Angazenn pushed a commit to Angazenn/vllm-ascend that referenced this pull request Oct 21, 2025
### What this PR does / why we need it?
Fix env variable in dbo to enable dbo in DeepSeek-V3 model. Besides, we
have fixed an known issue in deepseek-dbo.


### Does this PR introduce _any_ user-facing change?
no

### How was this patch tested?
This patch can be tested with newly added e2e tests:
[tests/multicard/test_offline_inference_distributed.py](https://github.com/vllm-project/vllm-ascend/pull/1285/files#diff-7cd2e6b1bda6b8ad1bedb3276971fe7064aeae4dc0efd41c301c4ede2158c57e).
It can be verified with pytest.

---------

Signed-off-by: zhuohuan <zxdu1997@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants