You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This PR adds the setup instructions for Huawei Ascend NPU. Please refer
to the remainder of the guide for instructions on other devices.
---------
Co-authored-by: sjh <sjh1270@163.com>
Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>
Co-authored-by: Logan Adams <loadams@microsoft.com>
[Accelerator Setup Guide](accelerator-setup-guide.md) provides a guide on how to setup different accelerators for DeepSpeed. It also comes with simple example how to run deepspeed for different accelerators. The following guides are provided:
82
82
1. Run DeepSpeed model on CPU
83
83
2. Run DeepSpeed model on XPU
84
+
3. Run DeepSpeed model on Huawei Ascend NPU
84
85
85
86
# Implement new accelerator extension
86
87
It is possible to implement a new DeepSpeed accelerator extension to support new accelerator in DeepSpeed. An example to follow is _[Intel Extension For DeepSpeed](https://github.com/intel/intel-extension-for-deepspeed/)_. An accelerator extension contains the following components:
DeepSpeed supports different accelerators from different companies. Setup steps to run DeepSpeed on certain accelerators might be different. This guide allows user to lookup setup instructions for the accelerator family and hardware they are using.
@@ -132,3 +133,115 @@ accelerator: xpu
132
133
133
134
## More example for using DeepSpeed on Intel XPU
134
135
Refer to https://github.com/intel/intel-extension-for-pytorch/tree/release/xpu/2.1.40/examples/gpu/inference/python/llm for more extensive guide.
136
+
137
+
138
+
# Huawei Ascend NPU
139
+
140
+
DeepSpeed has been verified on the following Huawei Ascend NPU products:
141
+
* Atlas 300T A2
142
+
143
+
## Installation steps for Huawei Ascend NPU
144
+
145
+
The following steps outline the process for installing DeepSpeed on an Huawei Ascend NPU:
146
+
1. Install the Huawei Ascend NPU Driver and Firmware
147
+
<details>
148
+
<summary>Click to expand</summary>
149
+
150
+
Before proceeding with the installation, please download the necessary files from [Huawei Ascend NPU Driver and Firmware](https://www.hiascend.com/en/hardware/firmware-drivers/commercial?product=4&model=11).
151
+
152
+
The following instructions below are sourced from the [Ascend Community](https://www.hiascend.com/document/detail/en/canncommercial/700/quickstart/quickstart/quickstart_18_0002.html) (refer to the [Chinese version](https://www.hiascend.com/document/detail/zh/canncommercial/700/quickstart/quickstart/quickstart_18_0002.html)):
153
+
154
+
- Execute the following command to install the driver:
ascend_cann version .............. 8.0.RC2.alpha002
227
+
shared memory (/dev/shm) size .... 20.00 GB
228
+
```
229
+
230
+
## How to launch DeepSpeed on Huawei Ascend NPU
231
+
232
+
To validate the Huawei Ascend NPU availability and if the accelerator is correctly chosen, here is an example(Huawei Ascend NPU detection is automatic starting with DeepSpeed v0.12.6):
## Multi-card parallel training using Huawei Ascend NPU
246
+
247
+
To perform model training across multiple Huawei Ascend NPU cards using DeepSpeed, see the examples provided in [DeepSpeed Examples](https://github.com/microsoft/DeepSpeedExamples/blob/master/training/cifar/cifar10_deepspeed.py).
0 commit comments