We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 347620a commit eddee3aCopy full SHA for eddee3a
container/Dockerfile.vllm
@@ -17,7 +17,9 @@ ARG TORCH_BACKEND="cu128"
17
18
# Match 0.10.0 vLLM release
19
# https://github.com/vllm-project/vllm/releases/tag/v0.10.0
20
-ARG DEEPGEMM_REF="1876566"
+# Pinned to commit before https://github.com/deepseek-ai/DeepGEMM/pull/112 for DeepGEMM which seems to break on H100:
21
+# "RuntimeError: Failed: CUDA runtime error csrc/jit/kernel_runtime.hpp:108 '98'"
22
+ARG DEEPGEMM_REF="03d0be3"
23
ARG FLASHINF_REF="v0.2.8rc1"
24
25
# Define general architecture ARGs for supporting both x86 and aarch64 builds.
0 commit comments