Skip to content

Commit 0e4c980

Browse files
GuanLuosaturley-hall
authored andcommitted
fix: WAR DeepGemm JIT compilation errors (#2937)
Signed-off-by: GuanLuo <gluo@nvidia.com> Co-authored-by: Harrison Saturley-Hall <hsaturleyhal@nvidia.com> Signed-off-by: Indrajit Bhosale <iamindrajitb@gmail.com>
1 parent 11d7cd0 commit 0e4c980

File tree

1 file changed

+7
-1
lines changed

1 file changed

+7
-1
lines changed

container/Dockerfile.vllm

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -199,7 +199,9 @@ RUN apt-get update && \
199199
ninja-build \
200200
g++ \
201201
# prometheus dependencies
202-
ca-certificates && \
202+
ca-certificates \
203+
# DeepGemm uses 'cuobjdump' which does not come with CUDA image
204+
cuda-command-line-tools-12-8 && \
203205
rm -rf /var/lib/apt/lists/*
204206

205207
# Copy CUDA development tools (nvcc, headers, dependencies, etc.) from base devel image
@@ -249,6 +251,10 @@ $NIXL_PLUGIN_DIR:\
249251
/usr/local/ucx/lib/ucx:\
250252
$LD_LIBRARY_PATH
251253

254+
# DeepGemm runs nvcc for JIT kernel compilation, however the CUDA include path
255+
# is not properly set for complilation. Set CPATH to help nvcc find the headers.
256+
ENV CPATH=/usr/local/cuda/include:$CPATH
257+
252258
### VIRTUAL ENVIRONMENT SETUP ###
253259

254260
# Copy uv and entire virtual environment from framework container

0 commit comments

Comments
 (0)