You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
### What this PR does / why we need it?
Update FAQ.
### Does this PR introduce _any_ user-facing change?
No
### How was this patch tested?
CI passed
---------
Signed-off-by: Shanshan Shen <87969357+shen-shanshan@users.noreply.github.com>
### 21. Failed with inferencing Qwen3 MoE due to `Alloc sq cq fail` issue?
189
+
190
+
When running Qwen3 MoE with tp/dp/ep, etc., you may encounter an error shown in [#2629](https://github.com/vllm-project/vllm-ascend/issues/2629).
191
+
192
+
This is more likely to happen when you're using A3. Please refer to the empirical formula below to estimate a suitable value for this argument:
193
+
194
+
```python
195
+
# pg_num: the number of process groups for communication
196
+
pg_num =sum(size >1for size in [
197
+
parallel_config.data_parallel_size,
198
+
parallel_config.tensor_parallel_size,
199
+
])
200
+
# num_hidden_layer: number of hidden layers of the model
0 commit comments