[Misc] Define EP kernel arch list in Dockerfile #25635

simon-mo · 2025-09-25T04:51:21Z

Found by @rizar that our container's DeepEP doesn't work on Blackwell, TY!

Summary

define TORCH_CUDA_ARCH_LIST in the vllm-base stage so EP kernel installation defaults to Hopper and Blackwell support
revert the EP kernel installer to rely on the environment instead of forcing its own CUDA architectures

Testing

not run

https://chatgpt.com/codex/tasks/task_e_68d4c5364338832997fa34fc45f06432

gemini-code-assist

Code Review

This pull request defines the TORCH_CUDA_ARCH_LIST in the vllm-base stage of the Dockerfile, ensuring that EP kernel installation defaults to supporting Hopper and Blackwell architectures. It also updates a fallback value for the architecture list. The changes are logical and improve the build process's configurability and defaults. However, there's a redundancy that can be cleaned up.

gemini-code-assist · 2025-09-25T05:00:59Z

docker/Dockerfile

+RUN export TORCH_CUDA_ARCH_LIST="${TORCH_CUDA_ARCH_LIST:-9.0a 10.0a+PTX}" \
    && bash install_python_libraries.sh


Since TORCH_CUDA_ARCH_LIST is now set via an ENV instruction earlier in this build stage (line 288), this export command with a fallback is redundant. The environment variable will already be available to the install_python_libraries.sh script. You can simplify this RUN command by removing the export part.

RUN bash install_python_libraries.sh

docker/Dockerfile

Signed-off-by: Simon Mo <simon.mo@hey.com>

Signed-off-by: Simon Mo <simon.mo@hey.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

Signed-off-by: Simon Mo <simon.mo@hey.com>

Signed-off-by: Simon Mo <simon.mo@hey.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

Define EP kernel arch list in Dockerfile

3241753

simon-mo added the codex label Sep 25, 2025 — with ChatGPT Codex Connector

mergify bot added the ci/build label Sep 25, 2025

simon-mo changed the title ~~Define EP kernel arch list in Dockerfile~~ [Misc] Define EP kernel arch list in Dockerfile Sep 25, 2025

gemini-code-assist bot reviewed Sep 25, 2025

View reviewed changes

youkaichao approved these changes Sep 25, 2025

View reviewed changes

simon-mo commented Sep 26, 2025

View reviewed changes

docker/Dockerfile Outdated Show resolved Hide resolved

simon-mo added 2 commits September 26, 2025 10:21

Apply suggestion from @simon-mo

11c6159

Signed-off-by: Simon Mo <simon.mo@hey.com>

Merge branch 'main' into codex/fix-torch_cuda_arch_list-in-dockerfile

34df62c

simon-mo enabled auto-merge (squash) October 6, 2025 21:40

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 6, 2025

simon-mo merged commit 8229280 into main Oct 7, 2025
88 checks passed

simon-mo deleted the codex/fix-torch_cuda_arch_list-in-dockerfile branch October 7, 2025 00:05

southfreebird pushed a commit to southfreebird/vllm that referenced this pull request Oct 7, 2025

[Misc] Define EP kernel arch list in Dockerfile (vllm-project#25635)

459cda7

Signed-off-by: Simon Mo <simon.mo@hey.com>

mrasquinha-g pushed a commit to mrasquinha-g/vllm that referenced this pull request Oct 9, 2025

[Misc] Define EP kernel arch list in Dockerfile (vllm-project#25635)

9558aae

Signed-off-by: Simon Mo <simon.mo@hey.com>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025

[Misc] Define EP kernel arch list in Dockerfile (vllm-project#25635)

d73b8d7

Signed-off-by: Simon Mo <simon.mo@hey.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025

[Misc] Define EP kernel arch list in Dockerfile (vllm-project#25635)

ed749b6

Signed-off-by: Simon Mo <simon.mo@hey.com>

alhridoy pushed a commit to alhridoy/vllm that referenced this pull request Oct 24, 2025

[Misc] Define EP kernel arch list in Dockerfile (vllm-project#25635)

28c688b

Signed-off-by: Simon Mo <simon.mo@hey.com>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025

[Misc] Define EP kernel arch list in Dockerfile (vllm-project#25635)

4b23bc2

Signed-off-by: Simon Mo <simon.mo@hey.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Misc] Define EP kernel arch list in Dockerfile #25635

[Misc] Define EP kernel arch list in Dockerfile #25635

Uh oh!

simon-mo commented Sep 25, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Sep 25, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		RUN export TORCH_CUDA_ARCH_LIST="${TORCH_CUDA_ARCH_LIST:-9.0a 10.0a+PTX}" \
		&& bash install_python_libraries.sh

Uh oh!

[Misc] Define EP kernel arch list in Dockerfile #25635

[Misc] Define EP kernel arch list in Dockerfile #25635

Uh oh!

Conversation

simon-mo commented Sep 25, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

simon-mo commented Sep 25, 2025 •

edited by github-actions bot

Loading