[Bug]: When moe ep=16 etp=1, the result is normal. When moe ep=1 etp=16, the result is abnormal.

### Your current environment

<details>
nohup python -m vllm.entrypoints.openai.api_server --model=/mnt/deepseek/DeepSeek-R1-W8A8-VLLM \
 --trust-remote-code \
 --distributed-executor-backend=mp \
 -tp=16 \
 -dp=1 \
 --port 8006 \
 --max-num-seqs 24 \
 --max-model-len 32768 \
 --max-num-batched-tokens 32768 \
 --block-size 128 \
 --enable-expert-parallel \
 --compilation_config 0 \
 --gpu-memory-utilization 0.96 \
 --additional-config '{"expert_tensor_parallel_size":1, "ascend_scheduler_config":{}}' &> run.log &

<img width="1709" alt="Image" src="https://github.com/user-attachments/assets/75e02395-cadf-4ebd-a832-9396e50da4f9" />


nohup python -m vllm.entrypoints.openai.api_server --model=/mnt/deepseek/DeepSeek-R1-W8A8-VLLM \
 --trust-remote-code \
 --distributed-executor-backend=mp \
 -tp=16 \
 -dp=1 \
 --port 8006 \
 --max-num-seqs 24 \
 --max-model-len 32768 \
 --max-num-batched-tokens 32768 \
 --block-size 128 \
 --enable-expert-parallel \
 --compilation_config 0 \
 --gpu-memory-utilization 0.96 \
 --additional-config '{"expert_tensor_parallel_size":16, "ascend_scheduler_config":{}}' &> run.log &

<img width="1710" alt="Image" src="https://github.com/user-attachments/assets/db827e65-b39b-4b98-82b3-3f43b238a6e6" />

The principle of the problem：In the case of etp16 there is no parallel processing
<img width="1550" alt="Image" src="https://github.com/user-attachments/assets/a609ff3b-e231-431b-b66f-907ea53c9b78" />



</details>


### 🐛 Describe the bug

nohup python -m vllm.entrypoints.openai.api_server --model=/mnt/deepseek/DeepSeek-R1-W8A8-VLLM \
 --trust-remote-code \
 --distributed-executor-backend=mp \
 -tp=16 \
 -dp=1 \
 --port 8006 \
 --max-num-seqs 24 \
 --max-model-len 32768 \
 --max-num-batched-tokens 32768 \
 --block-size 128 \
 --enable-expert-parallel \
 --compilation_config 0 \
 --gpu-memory-utilization 0.96 \
 --additional-config '{"expert_tensor_parallel_size":16, "ascend_scheduler_config":{}}' &> run.log &

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bug]: When moe ep=16 etp=1, the result is normal. When moe ep=1 etp=16, the result is abnormal. #971

Your current environment

🐛 Describe the bug

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Bug]: When moe ep=16 etp=1, the result is normal. When moe ep=1 etp=16, the result is abnormal. #971

Description

Your current environment

🐛 Describe the bug

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions