Skip to content

Commit 47c3dad

Browse files
authored
chore(sglang): support dp attention routing (#1415)
1 parent d83633b commit 47c3dad

File tree

2 files changed

+3
-3
lines changed

2 files changed

+3
-3
lines changed

container/Dockerfile.sglang

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -135,9 +135,8 @@ RUN if [ "$ARCH" = "arm64" ]; then \
135135

136136
# Install sglang
137137
# Once either 0.4.6post6 or 0.4.7 is released, we can switch back to using the published version
138-
# This commit references multiple perf fixes for DP attention and NIXL https://github.com/sgl-project/sglang/pull/6780
139-
# 6/2(ishan) - moving to ToT for performance purposes
140-
ARG SGLANG_COMMIT="6376b632eb4daef306b89ede0eabdcb89ddff728"
138+
# This commit references a fix to add DP attention based routing along with other perf fixes https://github.com/sgl-project/sglang/pull/6884
139+
ARG SGLANG_COMMIT="f1569876d54dd3b6601f5280f12652e9fbb1375c"
141140
RUN --mount=type=cache,target=/root/.cache/uv \
142141
git clone https://github.com/sgl-project/sglang.git && \
143142
cd sglang && \

examples/sglang/utils/protocol.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -60,3 +60,4 @@ class DisaggPreprocessedRequest(BaseModel):
6060
bootstrap_host: str
6161
bootstrap_port: int
6262
bootstrap_room: int
63+
data_parallel_rank: Optional[int] = None

0 commit comments

Comments
 (0)