Commit 2bb7e55
authored
[Bugfix][PD]fix non-working disaggregated prefill (#2374)
### What this PR does / why we need it?
Mainline vLLM fixes its disaggregated prefill in
vllm-project/vllm#22598 . But it is still not
working in vllm-ascend.
To be concrete, decoder instances crash before vllm's fix and hang after
vllm's fix in ascend devices.
This patch allows disaggregated prefill to work.
### Does this PR introduce _any_ user-facing change?
No.
### How was this patch tested?
Qwen3-0.6B 1P1D tp=1 dp=1
- vLLM version: v0.10.0
- vLLM main:
vllm-project/vllm@0fe8508
---------
Signed-off-by: CaveNightingale <cavenightingale@foxmail.com>1 parent 1b40665 commit 2bb7e55
1 file changed
+4
-3
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1636 | 1636 | | |
1637 | 1637 | | |
1638 | 1638 | | |
1639 | | - | |
| 1639 | + | |
1640 | 1640 | | |
1641 | 1641 | | |
1642 | 1642 | | |
| |||
1838 | 1838 | | |
1839 | 1839 | | |
1840 | 1840 | | |
1841 | | - | |
1842 | | - | |
| 1841 | + | |
| 1842 | + | |
| 1843 | + | |
1843 | 1844 | | |
1844 | 1845 | | |
1845 | 1846 | | |
| |||
0 commit comments