Skip to content

Conversation

@shen-shanshan
Copy link
Collaborator

@shen-shanshan shen-shanshan commented Feb 12, 2025

What this PR does / why we need it?

  1. Add vllm-ascend tutorial doc for Qwen/Qwen2.5-7B-Instruct model serving doc
  2. fix format of files in docs dir, e.g. format tables, add underline for links, add line feed...

Does this PR introduce any user-facing change?

no.

How was this patch tested?

no.

@shen-shanshan shen-shanshan marked this pull request as draft February 12, 2025 09:04
@shen-shanshan
Copy link
Collaborator Author

cc:

@Yikun @wangxiyuan @MengqingCao

@wangxiyuan
Copy link
Collaborator

No need to update installation and quick start doc. They will be updated in new PR.

@shen-shanshan
Copy link
Collaborator Author

No need to update installation and quick start doc. They will be updated in new PR.

ok.

Signed-off-by: Shanshan Shen <87969357+shen-shanshan@users.noreply.github.com>
Copy link
Collaborator

@Yikun Yikun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Overall, it has been greatly improved compared to the previous version, thank you!

```bash
# Use Modelscope mirror to speed up model download
export VLLM_USE_MODELSCOPE=True
export MODELSCOPE_CACHE=/root/models/
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
export MODELSCOPE_CACHE=/root/models/

you can use default cache -v /root/.cache:/root/.cache

-v /usr/local/Ascend/driver/lib64/:/usr/local/Ascend/driver/lib64/ \
-v /usr/local/Ascend/driver/version.info:/usr/local/Ascend/driver/version.info \
-v /etc/ascend_install.info:/etc/ascend_install.info \
-v /root/models:/root/models \
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
-v /root/models:/root/models \
-v /root/.cache:/root/.cache \

-v /usr/local/Ascend/driver/lib64/:/usr/local/Ascend/driver/lib64/ \
-v /usr/local/Ascend/driver/version.info:/usr/local/Ascend/driver/version.info \
-v /etc/ascend_install.info:/etc/ascend_install.info \
-v /root/models:/root/models \
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
-v /root/models:/root/models \
-v /root/.cache:/root/.cache \

-v /root/models:/root/models \
-p 8000:8000 \
-e VLLM_USE_MODELSCOPE=True \
-e MODELSCOPE_CACHE=/root/models/ \
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
-e MODELSCOPE_CACHE=/root/models/ \

-v /usr/local/Ascend/driver/lib64/:/usr/local/Ascend/driver/lib64/ \
-v /usr/local/Ascend/driver/version.info:/usr/local/Ascend/driver/version.info \
-v /etc/ascend_install.info:/etc/ascend_install.info \
-v /root/models:/root/models \
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
-v /root/models:/root/models \
-v /root/.cache:/root/.cache \

```bash
# Use Modelscope mirror to speed up model download
export VLLM_USE_MODELSCOPE=True
export MODELSCOPE_CACHE=/root/models/
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
export MODELSCOPE_CACHE=/root/models/

Comment on lines 160 to 164
def clean_up():
destroy_model_parallel()
destroy_distributed_environment()
gc.collect()
torch.npu.empty_cache()
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks like a little bit wired, would you mind taking a look? @wangxiyuan

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

since this is only a simple example, no need to do

del llm
clean_up()

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

When we using mp as distributed_executor_backend, the clean up must be done by hand, otherwise will raise error when exiting process. This is a bug in vLLM.

Signed-off-by: Shanshan Shen <87969357+shen-shanshan@users.noreply.github.com>
@Yikun
Copy link
Collaborator

Yikun commented Feb 17, 2025

After this PR merged, pls also backport this to v0.7.1 branch.

@Yikun Yikun mentioned this pull request Feb 17, 2025
@wangxiyuan wangxiyuan merged commit 2a67814 into vllm-project:main Feb 17, 2025
3 checks passed
ttanzhiqiang pushed a commit to ttanzhiqiang/vllm-ascend that referenced this pull request Apr 27, 2025
### What this PR does / why we need it?
1. Add vllm-ascend tutorial doc for Qwen/Qwen2.5-7B-Instruct model
serving doc
2. fix format of files in `docs` dir, e.g. format tables, add underline
for links, add line feed...

### Does this PR introduce _any_ user-facing change?
<!--
Note that it means *any* user-facing change including all aspects such
as API, interface or other behavior changes.
Documentation-only updates are not considered user-facing changes.
-->

no.

### How was this patch tested?
doc CI passed

---------

Signed-off-by: Shanshan Shen <87969357+shen-shanshan@users.noreply.github.com>
ZhengWG pushed a commit to ZhengWG/vllm-ascend that referenced this pull request Jun 18, 2025
fix bugs caused by variable name old_placemet
offline893 pushed a commit to offline893/vllm-ascend that referenced this pull request Sep 9, 2025
fix bugs caused by variable name old_placemet
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants