Skip to content

Commit e0068db

Browse files
committed
[Doc] Modify review comments.
Signed-off-by: menogrey <1299267905@qq.com>
1 parent fb31d2c commit e0068db

File tree

3 files changed

+14
-6
lines changed

3 files changed

+14
-6
lines changed

docs/source/developer_guide/evaluation/using_ais_bench.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -29,6 +29,12 @@ docker run --rm \
2929
-e PYTORCH_NPU_ALLOC_CONF=max_split_size_mb:256 \
3030
-it $IMAGE \
3131
/bin/bash
32+
```
33+
34+
Run the vLLM server in the docker.
35+
36+
```{code-block} bash
37+
:substitutions:
3238
vllm serve Qwen/Qwen2.5-0.5B-Instruct --max_model_len 35000 &
3339
```
3440

docs/source/developer_guide/evaluation/using_lm_eval.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -122,10 +122,10 @@ After 30 minutes, the output is as shown below:
122122
```
123123
The markdown format results is as below:
124124
125-
Tasks|Version| Filter |n-shot| Metric | |Value | |Stderr|
125+
|Tasks|Version| Filter |n-shot| Metric | |Value | |Stderr|
126126
|-----|------:|----------------|-----:|-----------|---|-----:|---|-----:|
127127
|gsm8k| 3|flexible-extract| 5|exact_match|↑ |0.3215|± |0.0129|
128-
| | |strict-match | 5|exact_match|↑ |0.2077|± |0.0112|
128+
|gsm8k| 3|strict-match | 5|exact_match|↑ |0.2077|± |0.0112|
129129
130130
```
131131

@@ -187,7 +187,7 @@ The markdown format results is as below:
187187
Tasks|Version| Filter |n-shot| Metric | |Value | |Stderr|
188188
|-----|------:|----------------|-----:|-----------|---|-----:|---|-----:|
189189
|gsm8k| 3|flexible-extract| 5|exact_match|↑ |0.3412|± |0.0131|
190-
| | |strict-match | 5|exact_match|↑ |0.3139|± |0.0128|
190+
|gsm8k| 3|strict-match | 5|exact_match|↑ |0.3139|± |0.0128|
191191
192192
```
193193

docs/source/tutorials/DeepSeek-V3.2-Exp.md

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,8 @@ DeepSeek-V3.2-Exp is a sparse attention model. The main architecture is similar
66

77
This document will show the main verification steps of the model, including supported features, feature configuration, environment preparation, single-node and multi-node deployment, accuracy and performance evaluation.
88

9+
The `DeepSeek-V3.2-Exp` model is first supported in `vllm-ascend:v0.11.0rc0`.
10+
911
## Supported Features
1012

1113
Refer to [supported features](../user_guide/support_matrix/supported_models.md) to get the model's supported feature matrix.
@@ -16,8 +18,8 @@ Refer to [feature guide](../user_guide/feature_guide/index.md) to get the featur
1618

1719
### Model Weight
1820

19-
- `DeepSeek-V3.2-Exp`: require 2 Atlas 800 A3 (64G × 16) nodes or 4 Atlas 800 A2 (64G × 8) nodes. [Download model weight](https://modelers.cn/models/Modelers_Park/DeepSeek-V3.2-Exp-BF16)
20-
- `DeepSeek-V3.2-Exp(Quantized version)`: require 1 Atlas 800 A3 (64G × 16) node or 2 Atlas 800 A2 (64G × 8) nodes. [Download model weight](https://modelers.cn/models/Modelers_Park/DeepSeek-V3.2-Exp-w8a8)
21+
- `DeepSeek-V3.2-Exp`(BF16 version): require 2 Atlas 800 A3 (64G × 16) nodes or 4 Atlas 800 A2 (64G × 8) nodes. [Download model weight](https://modelers.cn/models/Modelers_Park/DeepSeek-V3.2-Exp-BF16)
22+
- `DeepSeek-V3.2-Exp-w8a8`(Quantized version): require 1 Atlas 800 A3 (64G × 16) node or 2 Atlas 800 A2 (64G × 8) nodes. [Download model weight](https://modelers.cn/models/Modelers_Park/DeepSeek-V3.2-Exp-w8a8)
2123

2224
It is recommended to download the model weight to the shared directory of multiple nodes, such as `/root/.cache/`
2325

@@ -398,7 +400,7 @@ lm_eval \
398400
|Tasks|Version| Filter |n-shot| Metric | |Value | |Stderr|
399401
|-----|------:|----------------|-----:|-----------|---|-----:|---|-----:|
400402
|gsm8k| 3|flexible-extract| 5|exact_match||0.9591|± |0.0055|
401-
| | |strict-match | 5|exact_match||0.9583|± |0.0055|
403+
|gsm8k| 3|strict-match | 5|exact_match||0.9583|± |0.0055|
402404

403405
## Performance
404406

0 commit comments

Comments
 (0)