Skip to content

Commit 2d8b646

Browse files
krishung5jain-ria
authored andcommitted
fix: Revert the commit for DeepGEMM to fix vLLM WideEP (#2302)
1 parent b074aff commit 2d8b646

File tree

1 file changed

+3
-1
lines changed

1 file changed

+3
-1
lines changed

container/Dockerfile.vllm

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,9 @@ ARG TORCH_BACKEND="cu128"
1717

1818
# Match 0.10.0 vLLM release
1919
# https://github.com/vllm-project/vllm/releases/tag/v0.10.0
20-
ARG DEEPGEMM_REF="1876566"
20+
# Pinned to commit before https://github.com/deepseek-ai/DeepGEMM/pull/112 for DeepGEMM which seems to break on H100:
21+
# "RuntimeError: Failed: CUDA runtime error csrc/jit/kernel_runtime.hpp:108 '98'"
22+
ARG DEEPGEMM_REF="03d0be3"
2123
ARG FLASHINF_REF="v0.2.8rc1"
2224

2325
# Define general architecture ARGs for supporting both x86 and aarch64 builds.

0 commit comments

Comments
 (0)