-
Notifications
You must be signed in to change notification settings - Fork 543
[Doc] Add Atlas 300I series tutorials #1341
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: leo-pony <nengjunma@outlook.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
310p.md ---> single_node_300i.md
docs/source/tutorials/310p.md
Outdated
| @@ -0,0 +1,312 @@ | |||
| # 310P (310I Duo) | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| # 310P (310I Duo) | |
| # Single Node (Atlas 300I series) |
docs/source/tutorials/310p.md
Outdated
| @@ -0,0 +1,312 @@ | |||
| # 310P (310I Duo) | |||
|
|
|||
| ## Run vllm-ascend with 310P | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| ## Run vllm-ascend with 310P | |
| ## Run vLLM on Altlas 300I series |
docs/source/tutorials/310p.md
Outdated
| -v /usr/local/Ascend/driver/version.info:/usr/local/Ascend/driver/version.info \ | ||
| -v /etc/ascend_install.info:/etc/ascend_install.info \ | ||
| -v /root/.cache:/root/.cache \ | ||
| -v /data/mnj/:/home/mnj/ \ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| -v /data/mnj/:/home/mnj/ \ |
Signed-off-by: leo-pony <nengjunma@outlook.com>
Signed-off-by: leo-pony <nengjunma@outlook.com>
Signed-off-by: leo-pony <nengjunma@outlook.com>
| ``` | ||
| :::: | ||
|
|
||
| ::::{tab-item} pangu-72b-MoE |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| ::::{tab-item} pangu-72b-MoE | |
| ::::{tab-item} Pangu-Pro-MoE-72B |
| ```{code-block} bash | ||
| :substitutions: | ||
| # Update the MODEL | ||
| export MODEL= |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| export MODEL= | |
| export MODEL=“/path/to/pangu-pro-moe-model” |
| ```{code-block} bash | ||
| :substitutions: | ||
| # Update the vllm-ascend image | ||
| export IMAGE=quay.io/ascend/vllm-ascend:main-310p |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| export IMAGE=quay.io/ascend/vllm-ascend:main-310p | |
| export IMAGE=quay.io/ascend/vllm-ascend:|vllm_ascend_version|-310p |
Signed-off-by: leo-pony <nengjunma@outlook.com>
Signed-off-by: leo-pony <nengjunma@outlook.com>
| gpu_memory_utilization=0.5, | ||
| max_num_seqs=4, | ||
| enforce_eager=True, | ||
| enforce_eager=True, # For ascend 310P eager mode, only single-operator model is supported. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| enforce_eager=True, # For ascend 310P eager mode, only single-operator model is supported. | |
| enforce_eager=True, # For 300I series, only eager mode is supported. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fix
Signed-off-by: leo-pony <nengjunma@outlook.com>
Signed-off-by: leo-pony <nengjunma@outlook.com>
### What this PR does / why we need it? Bump v0.9.1rc1 release Closes: #1341 Closes: #1334 ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? CI passed --------- Signed-off-by: Shanshan Shen <87969357+shen-shanshan@users.noreply.github.com> Signed-off-by: Yikun Jiang <yikunkero@gmail.com> Signed-off-by: leo-pony <nengjunma@outlook.com> Co-authored-by: wangxiyuan <wangxiyuan1007@gmail.com> Co-authored-by: leo-pony <nengjunma@outlook.com> Co-authored-by: shen-shanshan <467638484@qq.com>
### What this PR does / why we need it? Bump v0.9.1rc1 release Closes: vllm-project#1341 Closes: vllm-project#1334 ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? CI passed --------- Signed-off-by: Shanshan Shen <87969357+shen-shanshan@users.noreply.github.com> Signed-off-by: Yikun Jiang <yikunkero@gmail.com> Signed-off-by: leo-pony <nengjunma@outlook.com> Co-authored-by: wangxiyuan <wangxiyuan1007@gmail.com> Co-authored-by: leo-pony <nengjunma@outlook.com> Co-authored-by: shen-shanshan <467638484@qq.com>
### What this PR does / why we need it? Bump v0.9.1rc1 release Closes: vllm-project#1341 Closes: vllm-project#1334 ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? CI passed --------- Signed-off-by: Shanshan Shen <87969357+shen-shanshan@users.noreply.github.com> Signed-off-by: Yikun Jiang <yikunkero@gmail.com> Signed-off-by: leo-pony <nengjunma@outlook.com> Co-authored-by: wangxiyuan <wangxiyuan1007@gmail.com> Co-authored-by: leo-pony <nengjunma@outlook.com> Co-authored-by: shen-shanshan <467638484@qq.com>
What this PR does / why we need it?
Add 310P tutorials
Does this PR introduce any user-facing change?
No Changes.