Skip to content

Commit 724c589

Browse files
authored
[v0.7.3] Upgrade CANN version to 8.1.rc1 (#746)
### What this PR does / why we need it? Backport: #747 Upgrade CANN version to 8.1.rc1 ### Does this PR introduce _any_ user-facing change? no ### How was this patch tested? CI passed Co-authored-by: MengqingCao <cmq0113@163.com> Signed-off-by: Yikun Jiang <yikunkero@gmail.com>
1 parent d491957 commit 724c589

File tree

8 files changed

+23
-23
lines changed

8 files changed

+23
-23
lines changed

.github/workflows/vllm_ascend_test.yaml

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -45,7 +45,8 @@ jobs:
4545
name: vLLM Ascend test v0.7.3-dev
4646
runs-on: linux-arm64-npu-1
4747
container:
48-
image: quay.io/ascend/cann:8.0.0-910b-ubuntu22.04-py3.10
48+
# TODO(yikun): Remove m.daocloud.io prefix when infra proxy ready
49+
image: m.daocloud.io/quay.io/ascend/cann:8.1.rc1-910b-ubuntu22.04-py3.10
4950
env:
5051
HF_ENDPOINT: https://hf-mirror.com
5152
HF_TOKEN: ${{ secrets.HF_TOKEN }}

Dockerfile

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@
1515
# limitations under the License.
1616
#
1717

18-
FROM quay.io/ascend/cann:8.0.0-910b-ubuntu22.04-py3.10
18+
FROM quay.io/ascend/cann:8.1.rc1-910b-ubuntu22.04-py3.10
1919

2020
ARG PIP_INDEX_URL="https://mirrors.tuna.tsinghua.edu.cn/pypi/web/simple"
2121

Dockerfile.openEuler

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@
1515
# This file is a part of the vllm-ascend project.
1616
#
1717

18-
FROM quay.io/ascend/cann:8.0.0-910b-openeuler22.03-py3.10
18+
FROM quay.io/ascend/cann:8.1.rc1-910b-openeuler22.03-py3.10
1919

2020
ARG PIP_INDEX_URL="https://mirrors.tuna.tsinghua.edu.cn/pypi/web/simple"
2121

@@ -25,7 +25,7 @@ RUN yum update -y && \
2525

2626
WORKDIR /workspace
2727

28-
COPY . /vllm-workspace/vllm-ascend/
28+
COPY . /vllm-workspace/vllm-ascend/
2929

3030
RUN pip config set global.index-url ${PIP_INDEX_URL}
3131

@@ -45,4 +45,4 @@ RUN python3 -m pip install -e /vllm-workspace/vllm-ascend/ --extra-index https:/
4545
RUN python3 -m pip install "modelscope<1.23.0" ray && \
4646
python3 -m pip cache purge
4747

48-
CMD ["/bin/bash"]
48+
CMD ["/bin/bash"]

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -35,7 +35,7 @@ By using vLLM Ascend plugin, popular open-source models, including Transformer-l
3535
- Hardware: Atlas 800I A2 Inference series, Atlas A2 Training series
3636
- Software:
3737
* Python >= 3.9, < 3.12
38-
* CANN >= 8.0.0
38+
* CANN >= 8.1.rc1
3939
* PyTorch >= 2.5.1, torch-npu >= 2.5.1
4040
* vLLM (the same version as vllm-ascend)
4141

README.zh.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -35,7 +35,7 @@ vLLM 昇腾插件 (`vllm-ascend`) 是一个让vLLM在Ascend NPU无缝运行的
3535
- 硬件:Atlas 800I A2 Inference系列、Atlas A2 Training系列
3636
- 软件:
3737
* Python >= 3.9, < 3.12
38-
* CANN >= 8.0.RC2
38+
* CANN >= 8.1.RC1
3939
* PyTorch >= 2.5.1, torch-npu >= 2.5.1
4040
* vLLM (与vllm-ascend版本一致)
4141

docs/source/conf.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -73,7 +73,7 @@
7373
'pip_vllm_ascend_version': "0.7.3rc2",
7474
'pip_vllm_version': "0.7.3",
7575
# CANN image tag
76-
'cann_image_tag': "8.0.0-910b-ubuntu22.04-py3.10",
76+
'cann_image_tag': "8.1.rc1-910b-ubuntu22.04-py3.10",
7777
}
7878

7979
# Add any paths that contain templates here, relative to this directory.

docs/source/installation.md

Lines changed: 12 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ This document describes how to install vllm-ascend manually.
1111

1212
| Software | Supported version | Note |
1313
| ------------ | ----------------- | -------------------------------------- |
14-
| CANN | >= 8.0.0 | Required for vllm-ascend and torch-npu |
14+
| CANN | >= 8.1.rc1 | Required for vllm-ascend and torch-npu |
1515
| torch-npu | >= 2.5.1 | Required for vllm-ascend |
1616
| torch | >= 2.5.1 | Required for torch-npu and vllm |
1717

@@ -68,32 +68,28 @@ docker run --rm \
6868
:animate: fade-in-slide-down
6969
You can also install CANN manually:
7070

71-
```{note}
72-
This guide takes aarch64 as an example. If you run on x86, you need to replace `aarch64` with `x86_64` for the package name shown below.
73-
```
74-
7571
```bash
7672
# Create a virtual environment
7773
python -m venv vllm-ascend-env
7874
source vllm-ascend-env/bin/activate
7975

8076
# Install required python packages.
81-
pip3 install -i https://pypi.tuna.tsinghua.edu.cn/simple attrs numpy<2.0.0 decorator sympy cffi pyyaml pathlib2 psutil protobuf scipy requests absl-py wheel typing_extensions
77+
pip3 install -i https://pypi.tuna.tsinghua.edu.cn/simple attrs 'numpy<2.0.0' decorator sympy cffi pyyaml pathlib2 psutil protobuf scipy requests absl-py wheel typing_extensions
8278

8379
# Download and install the CANN package.
84-
wget https://ascend-repo.obs.cn-east-2.myhuaweicloud.com/CANN/CANN%208.0.0/Ascend-cann-toolkit_8.0.0_linux-aarch64.run
85-
chmod +x ./Ascend-cann-toolkit_8.0.0_linux-aarch64.run
86-
./Ascend-cann-toolkit_8.0.0_linux-aarch64.run --full
80+
wget https://ascend-repo.obs.cn-east-2.myhuaweicloud.com/CANN/CANN%208.1.RC1/Ascend-cann-toolkit_8.1.RC1_linux-"$(uname -i)".run
81+
chmod +x ./Ascend-cann-toolkit_8.1.RC1_linux-"$(uname -i)".run
82+
./Ascend-cann-toolkit_8.1.RC1_linux-"$(uname -i)".run --full
8783

8884
source /usr/local/Ascend/ascend-toolkit/set_env.sh
8985

90-
wget https://ascend-repo.obs.cn-east-2.myhuaweicloud.com/CANN/CANN%208.0.0/Ascend-cann-kernels-910b_8.0.0_linux-aarch64.run
91-
chmod +x ./Ascend-cann-kernels-910b_8.0.0_linux-aarch64.run
92-
./Ascend-cann-kernels-910b_8.0.0_linux-aarch64.run --install
86+
wget https://ascend-repo.obs.cn-east-2.myhuaweicloud.com/CANN/CANN%208.1.RC1/Ascend-cann-kernels-910b_8.1.RC1_linux-"$(uname -i)".run
87+
chmod +x ./Ascend-cann-kernels-910b_8.1.RC1_linux-"$(uname -i)".run
88+
./Ascend-cann-kernels-910b_8.1.RC1_linux-"$(uname -i)".run --install
9389

94-
wget https://ascend-repo.obs.cn-east-2.myhuaweicloud.com/CANN/CANN%208.0.0/Ascend-cann-nnal_8.0.0_linux-aarch64.run
95-
chmod +x ./Ascend-cann-nnal_8.0.0_linux-aarch64.run
96-
./Ascend-cann-nnal_8.0.0_linux-aarch64.run --install
90+
wget https://ascend-repo.obs.cn-east-2.myhuaweicloud.com/CANN/CANN%208.1.RC1/Ascend-cann-nnal_8.1.RC1_linux-"$(uname -i)".run
91+
chmod +x ./Ascend-cann-nnal_8.1.RC1_linux-"$(uname -i)".run
92+
./Ascend-cann-nnal_8.1.RC1_linux-"$(uname -i)".run --install
9793

9894
source /usr/local/Ascend/nnal/atb/set_env.sh
9995
```
@@ -194,6 +190,7 @@ docker run --rm \
194190
-it $IMAGE bash
195191
```
196192

193+
The default workdir is `/workspace`, vLLM and vLLM Ascend code are placed in `/vllm-workspace` and installed in [development mode](https://setuptools.pypa.io/en/latest/userguide/development_mode.html)(`pip install -e`) to help developer immediately take place changes without requiring a new installation.
197194
::::
198195

199196
:::::

docs/source/quick_start.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -31,6 +31,8 @@ docker run --rm \
3131
-it $IMAGE bash
3232
```
3333

34+
The default workdir is `/workspace`, vLLM and vLLM Ascend code are placed in `/vllm-workspace` and installed in [development mode](https://setuptools.pypa.io/en/latest/userguide/development_mode.html)(`pip install -e`) to help developer immediately take place changes without requiring a new installation.
35+
3436
## Usage
3537

3638
You can use Modelscope mirror to speed up download:

0 commit comments

Comments
 (0)