[Build] Fix cuda link target of cumem_allocator in CPU env #12863

guoyuhong · 2025-02-07T03:05:24Z

It looks like that the build system could not find -lcuda correctly in the CPU container.
list(APPEND CUMEM_LIBS cuda) is too simple for a CPU build env, ld cannot find the location. It is better to use CUDA::cuda_driver which is a full path.

github-actions · 2025-02-07T03:05:35Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: YuhongGuo <yuhong.gyh@antgroup.com>

guoyuhong · 2025-02-08T02:55:39Z

@tlrmchlsmth do you some comments on this code change?

tlrmchlsmth · 2025-02-10T14:29:57Z

CMakeLists.txt

@@ -192,7 +192,7 @@ set_gencode_flags_for_srcs(
 if(VLLM_GPU_LANG STREQUAL "CUDA")
  message(STATUS "Enabling cumem allocator extension.")
  # link against cuda driver library
-  list(APPEND CUMEM_LIBS cuda)
+  list(APPEND CUMEM_LIBS CUDA::cuda_driver)


This seems reasonable to me -- @youkaichao WDYT?

Actually can we just delete this line completely? We have this link line in define_gpu_extension_target already:

vllm/cmake/utils.cmake

Lines 437 to 443 in 2ae8890

# Don't use `TORCH_LIBRARIES` for CUDA since it pulls in a bunch of

# dependencies that are not necessary and may not be installed.

if (GPU_LANGUAGE STREQUAL "CUDA")

target_link_libraries(${GPU_MOD_NAME} PRIVATE CUDA::cudart CUDA::cuda_driver)

else()

target_link_libraries(${GPU_MOD_NAME} PRIVATE ${TORCH_LIBRARIES})

endif()

Per my analysis, only when GPU_LANGUAGE STREQUAL "CUDA" the CUDA::cudart lib is linked in define_gpu_extension_target. However, the GPU_LANGUAGE for cumem_allocator is CXX. I think that is why extra libcuda is added to CUMEM_LIBS.

tlrmchlsmth

Thanks for the contribution! Verified that this doesn't break my existing build process. I also feel comfortable with the change as this is the official way to do it -see https://cmake.org/cmake/help/latest/module/FindCUDAToolkit.html.

Going to see if this avoids the ld issues that popped up when I tried #12424

guoyuhong · 2025-02-11T01:49:11Z

@tlrmchlsmth Thanks. I also tested whether we could remove this line of code. The building process was fine but there will be undefined symbol as follows.

youkaichao

LGTM, thanks for the fix!

…ect#12863) Signed-off-by: YuhongGuo <yuhong.gyh@antgroup.com> Co-authored-by: Tyler Michael Smith <tyler@neuralmagic.com> Signed-off-by: SzymonOzog <szymon.ozog@aleph-alpha.com>

…ect#12863) Signed-off-by: YuhongGuo <yuhong.gyh@antgroup.com> Co-authored-by: Tyler Michael Smith <tyler@neuralmagic.com>

guoyuhong requested a review from tlrmchlsmth as a code owner February 7, 2025 03:05

mergify bot added the ci/build label Feb 7, 2025

guoyuhong force-pushed the fix_cumem_allocator_cmake branch from 38bf907 to 3b09f0c Compare February 7, 2025 03:06

Fix cuda link target of cumem_allocator

3b09f0c

Signed-off-by: YuhongGuo <yuhong.gyh@antgroup.com>

guoyuhong changed the title ~~Fix cuda link target of cumem_allocator in CPU env~~ [Build] Fix cuda link target of cumem_allocator in CPU env Feb 7, 2025

tlrmchlsmth reviewed Feb 10, 2025

View reviewed changes

tlrmchlsmth approved these changes Feb 10, 2025

View reviewed changes

Merge branch 'main' into fix_cumem_allocator_cmake

c3c60ab

tlrmchlsmth added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 10, 2025

youkaichao approved these changes Feb 11, 2025

View reviewed changes

youkaichao merged commit da31719 into vllm-project:main Feb 11, 2025
51 of 71 checks passed

guoyuhong deleted the fix_cumem_allocator_cmake branch February 12, 2025 02:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Build] Fix cuda link target of cumem_allocator in CPU env #12863

[Build] Fix cuda link target of cumem_allocator in CPU env #12863

guoyuhong commented Feb 7, 2025 •

edited by github-actions bot

Loading

github-actions bot commented Feb 7, 2025

guoyuhong commented Feb 8, 2025

tlrmchlsmth Feb 10, 2025

tlrmchlsmth Feb 10, 2025

guoyuhong Feb 10, 2025

tlrmchlsmth left a comment

guoyuhong commented Feb 11, 2025

youkaichao left a comment

	# Don't use `TORCH_LIBRARIES` for CUDA since it pulls in a bunch of
	# dependencies that are not necessary and may not be installed.
	if (GPU_LANGUAGE STREQUAL "CUDA")
	target_link_libraries(${GPU_MOD_NAME} PRIVATE CUDA::cudart CUDA::cuda_driver)
	else()
	target_link_libraries(${GPU_MOD_NAME} PRIVATE ${TORCH_LIBRARIES})
	endif()

[Build] Fix cuda link target of cumem_allocator in CPU env #12863

[Build] Fix cuda link target of cumem_allocator in CPU env #12863

Conversation

guoyuhong commented Feb 7, 2025 • edited by github-actions bot Loading

github-actions bot commented Feb 7, 2025

guoyuhong commented Feb 8, 2025

tlrmchlsmth Feb 10, 2025

Choose a reason for hiding this comment

tlrmchlsmth Feb 10, 2025

Choose a reason for hiding this comment

guoyuhong Feb 10, 2025

Choose a reason for hiding this comment

tlrmchlsmth left a comment

Choose a reason for hiding this comment

guoyuhong commented Feb 11, 2025

youkaichao left a comment

Choose a reason for hiding this comment

guoyuhong commented Feb 7, 2025 •

edited by github-actions bot

Loading