[Doc] Add vllm-ascend usage doc & fix doc format #53

shen-shanshan · 2025-02-12T09:03:52Z

What this PR does / why we need it?

Add vllm-ascend tutorial doc for Qwen/Qwen2.5-7B-Instruct model serving doc
fix format of files in docs dir, e.g. format tables, add underline for links, add line feed...

Does this PR introduce any user-facing change?

no.

How was this patch tested?

no.

shen-shanshan · 2025-02-12T09:06:35Z

cc:

@Yikun @wangxiyuan @MengqingCao

wangxiyuan · 2025-02-12T09:15:32Z

No need to update installation and quick start doc. They will be updated in new PR.

shen-shanshan · 2025-02-12T09:18:51Z

No need to update installation and quick start doc. They will be updated in new PR.

ok.

docs/source/index.md

docs/source/installation.md

docs/source/quick_start.md

docs/source/running_vllm_with_ascend.md

Signed-off-by: Shanshan Shen <87969357+shen-shanshan@users.noreply.github.com>

Yikun

Overall, it has been greatly improved compared to the previous version, thank you!

Yikun · 2025-02-14T12:45:36Z

docs/source/tutorials.md

+```bash
+# Use Modelscope mirror to speed up model download
+export VLLM_USE_MODELSCOPE=True
+export MODELSCOPE_CACHE=/root/models/


Suggested change

export MODELSCOPE_CACHE=/root/models/

you can use default cache -v /root/.cache:/root/.cache

Yikun · 2025-02-14T12:52:23Z

docs/source/tutorials.md

+-v /usr/local/Ascend/driver/lib64/:/usr/local/Ascend/driver/lib64/ \
+-v /usr/local/Ascend/driver/version.info:/usr/local/Ascend/driver/version.info \
+-v /etc/ascend_install.info:/etc/ascend_install.info \
+-v /root/models:/root/models \


Suggested change

-v /root/models:/root/models \

-v /root/.cache:/root/.cache \

Yikun · 2025-02-14T12:52:46Z

docs/source/tutorials.md

+-v /usr/local/Ascend/driver/lib64/:/usr/local/Ascend/driver/lib64/ \
+-v /usr/local/Ascend/driver/version.info:/usr/local/Ascend/driver/version.info \
+-v /etc/ascend_install.info:/etc/ascend_install.info \
+-v /root/models:/root/models \


Suggested change

-v /root/models:/root/models \

-v /root/.cache:/root/.cache \

Yikun · 2025-02-14T12:53:05Z

docs/source/tutorials.md

+-v /root/models:/root/models \
+-p 8000:8000 \
+-e VLLM_USE_MODELSCOPE=True \
+-e MODELSCOPE_CACHE=/root/models/ \


Suggested change

-e MODELSCOPE_CACHE=/root/models/ \

Yikun · 2025-02-14T12:53:35Z

docs/source/tutorials.md

+-v /usr/local/Ascend/driver/lib64/:/usr/local/Ascend/driver/lib64/ \
+-v /usr/local/Ascend/driver/version.info:/usr/local/Ascend/driver/version.info \
+-v /etc/ascend_install.info:/etc/ascend_install.info \
+-v /root/models:/root/models \


Suggested change

-v /root/models:/root/models \

-v /root/.cache:/root/.cache \

Yikun · 2025-02-14T12:53:43Z

docs/source/tutorials.md

+```bash
+# Use Modelscope mirror to speed up model download
+export VLLM_USE_MODELSCOPE=True
+export MODELSCOPE_CACHE=/root/models/


Suggested change

export MODELSCOPE_CACHE=/root/models/

Yikun · 2025-02-14T12:54:20Z

docs/source/tutorials.md

+def clean_up():
+    destroy_model_parallel()
+    destroy_distributed_environment()
+    gc.collect()
+    torch.npu.empty_cache()


Looks like a little bit wired, would you mind taking a look? @wangxiyuan

since this is only a simple example, no need to do

del llm clean_up()

When we using mp as distributed_executor_backend, the clean up must be done by hand, otherwise will raise error when exiting process. This is a bug in vLLM.

Signed-off-by: Shanshan Shen <87969357+shen-shanshan@users.noreply.github.com>

Yikun · 2025-02-17T06:35:08Z

After this PR merged, pls also backport this to v0.7.1 branch.

### What this PR does / why we need it? 1. Add vllm-ascend tutorial doc for Qwen/Qwen2.5-7B-Instruct model serving doc 2. fix format of files in `docs` dir, e.g. format tables, add underline for links, add line feed... ### Does this PR introduce _any_ user-facing change?  no. ### How was this patch tested? doc CI passed --------- Signed-off-by: Shanshan Shen <87969357+shen-shanshan@users.noreply.github.com>

fix bugs caused by variable name old_placemet

shen-shanshan marked this pull request as draft February 12, 2025 09:04

shen-shanshan force-pushed the doc branch from c71743c to ebc859c Compare February 13, 2025 12:04

shen-shanshan marked this pull request as ready for review February 13, 2025 12:04

shen-shanshan force-pushed the doc branch from ebc859c to f0816bf Compare February 14, 2025 02:53

Yikun reviewed Feb 14, 2025

View reviewed changes

MengqingCao reviewed Feb 14, 2025

View reviewed changes

docs/source/running_vllm_with_ascend.md Outdated Show resolved Hide resolved

Yikun reviewed Feb 14, 2025

View reviewed changes

docs/source/running_vllm_with_ascend.md Outdated Show resolved Hide resolved

add vllm-ascend tutorials

76fcb75

Signed-off-by: Shanshan Shen <87969357+shen-shanshan@users.noreply.github.com>

shen-shanshan force-pushed the doc branch from cbebd7b to 76fcb75 Compare February 14, 2025 10:40

Yikun approved these changes Feb 14, 2025

View reviewed changes

update tutorials

8287ea8

Signed-off-by: Shanshan Shen <87969357+shen-shanshan@users.noreply.github.com>

Yikun mentioned this pull request Feb 17, 2025

[v0.7.1rc1] FAQ & Feedback #19

Closed

wangxiyuan merged commit 2a67814 into vllm-project:main Feb 17, 2025
3 checks passed

ZhengWG pushed a commit to ZhengWG/vllm-ascend that referenced this pull request Jun 18, 2025

Merge pull request vllm-project#53 from raindaywhu/dev_whq_eplb1

60c87b0

fix bugs caused by variable name old_placemet

offline893 pushed a commit to offline893/vllm-ascend that referenced this pull request Sep 9, 2025

Merge pull request vllm-project#53 from raindaywhu/dev_whq_eplb1

339d3e8

fix bugs caused by variable name old_placemet

	-v /root/models:/root/models \
	-v /root/.cache:/root/.cache \

[Doc] Add vllm-ascend usage doc & fix doc format #53

[Doc] Add vllm-ascend usage doc & fix doc format #53

Uh oh!

Conversation

shen-shanshan commented Feb 12, 2025 • edited by Yikun Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

shen-shanshan commented Feb 12, 2025

Uh oh!

wangxiyuan commented Feb 12, 2025

Uh oh!

shen-shanshan commented Feb 12, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Yikun left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Yikun commented Feb 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

shen-shanshan commented Feb 12, 2025 •

edited by Yikun

Loading

Yikun commented Feb 17, 2025 •

edited

Loading