add Deepseek-R1 tutorial. #4566

Gongdayao · 2025-11-29T07:52:29Z

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.11.2
vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2

Signed-off-by: Gongdayao <gongdayao@foxmail.com>

gemini-code-assist

Code Review

This pull request adds a new tutorial for deploying the DeepSeek-R1 model. The tutorial is comprehensive, covering environment setup, deployment on A2 and A3 series hardware, functional verification, and performance/accuracy evaluation. However, I've found a few critical issues in the documentation that could prevent users from successfully following the steps. These include a typo in a command-line argument, incomplete installation instructions, incorrect markdown syntax, and inconsistent model naming. Addressing these issues will significantly improve the quality and usability of the tutorial.

gemini-code-assist · 2025-11-29T07:53:33Z

docs/source/tutorials/DeepSeek-R1.md

+  --host 0.0.0.0 \
+  --port 8000 \
+  --data-parallel-size 4 \
+  --data-parallel-size_local 2 \


There is a typo in the command-line argument --data-parallel-size_local. It should be --data-parallel-size-local (with hyphens instead of an underscore). This typo will cause the vllm serve command to fail.

Suggested change

--data-parallel-size_local 2 \

--data-parallel-size-local 2 \

gemini-code-assist · 2025-11-29T07:53:33Z

docs/source/tutorials/DeepSeek-R1.md

+## Introduction
+
+DeepSeek-R1 is a high-performance Mixture-of-Experts (MoE) large language model developed by DeepSeek Company. It excels in complex logical reasoning, mathematical problem-solving, and code generation. By dynamically activating its expert networks, it delivers exceptional performance while maintaining computational efficiency. Building upon R1, DeepSeek-R1-W8A8 is a fully quantized version of the model. It employs 8-bit integer (INT8) quantization for both weights and activations, which significantly reduces the model's memory footprint and computational requirements, enabling more efficient deployment and application in resource-constrained environments.
+This article takes the deepseek- R1-w8a8 version as an example to introduce the deployment of the R1 series models.


The model name deepseek- R1-w8a8 is used here (with a space and lowercase w). However, the vllm serve commands (e.g., line 88) and the official model download link use DeepSeek-R1-W8A8 with an uppercase W. This inconsistency is present throughout the document and can lead to 'file not found' errors on case-sensitive filesystems. Please use a consistent naming convention, preferably DeepSeek-R1-W8A8.

gemini-code-assist · 2025-11-29T07:53:33Z

docs/source/tutorials/DeepSeek-R1.md

+
+- Install `vllm-ascend` from source, refer to [installation](../installation.md).
+
+- Install extra operator for supporting `DeepSeek-R1-w8a8`, refer to the above tab.


The instruction to 'refer to the above tab' for installing the extra operator is unclear as there is no other tab at this level providing these instructions. This leaves the user without a way to proceed with the source installation. Please provide the correct instructions or link to them.

Signed-off-by: Gongdayao <gongdayao@foxmail.com>

github-actions · 2025-11-29T08:15:41Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

Signed-off-by: Gongdayao <gongdayao@foxmail.com>

add Deepseek-R1 tutorial.

b8c1030

Signed-off-by: Gongdayao <gongdayao@foxmail.com>

gemini-code-assist bot reviewed Nov 29, 2025

View reviewed changes

update Deepseek-R1 tutorial.

ec4878f

Signed-off-by: Gongdayao <gongdayao@foxmail.com>

github-actions bot added the documentation Improvements or additions to documentation label Nov 29, 2025

Gongdayao added 2 commits November 29, 2025 16:50

update Deepseek-R1 tutorial.

560fae7

Signed-off-by: Gongdayao <gongdayao@foxmail.com>

update Deepseek-R1 tutorial.

d00dab2

Signed-off-by: Gongdayao <gongdayao@foxmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add Deepseek-R1 tutorial. #4566

add Deepseek-R1 tutorial. #4566

Gongdayao commented Nov 29, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Nov 29, 2025

Uh oh!

gemini-code-assist bot Nov 29, 2025

Uh oh!

gemini-code-assist bot Nov 29, 2025

Uh oh!

github-actions bot commented Nov 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	--data-parallel-size_local 2 \
	--data-parallel-size-local 2 \


		- Install `vllm-ascend` from source, refer to [installation](../installation.md).

		- Install extra operator for supporting `DeepSeek-R1-w8a8`, refer to the above tab.

add Deepseek-R1 tutorial. #4566

Are you sure you want to change the base?

add Deepseek-R1 tutorial. #4566

Conversation

Gongdayao commented Nov 29, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Nov 29, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Nov 29, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Nov 29, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Nov 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Gongdayao commented Nov 29, 2025 •

edited by github-actions bot

Loading