Skip to content

Conversation

@Sergei-Lebedev
Copy link
Contributor

What?

Detect CUDA version and build device code for supported GPU architectures

@Sergei-Lebedev Sergei-Lebedev force-pushed the topic/detect_cuda_version branch 2 times, most recently from 090d4fa to 442d758 Compare September 17, 2025 13:19
@Sergei-Lebedev Sergei-Lebedev force-pushed the topic/detect_cuda_version branch 2 times, most recently from 0351eb9 to 933f83b Compare September 25, 2025 20:28
@Sergei-Lebedev Sergei-Lebedev force-pushed the topic/detect_cuda_version branch from 933f83b to 2b83a52 Compare October 6, 2025 14:27
@michal-shalev
Copy link
Contributor

This branch was tested on EOS, it fixed the perftest error there, but it adds a lot of time to the build (a few minutes), is there anything that could be done to make it shorter?

@Sergei-Lebedev
Copy link
Contributor Author

This branch was tested on EOS, it fixed the perftest error there, but it adds a lot of time to the build (a few minutes), is there anything that could be done to make it shorter?

This PR simply enables UCX build for all GPU archs supported by the NVCC compiler. Since there are many architectures to build, it makes compilation longer. If you know what GPU model you are going to use, you can still set a specific arch using the --with-nvcc-gencode configure option.

@michal-shalev michal-shalev enabled auto-merge (squash) October 12, 2025 10:11
@michal-shalev michal-shalev merged commit 63be744 into openucx:master Oct 12, 2025
141 checks passed
zzhang37 pushed a commit to zzhang37/ucx that referenced this pull request Oct 16, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants