[SYCL][NVPTX] Obey -fcuda-short-ptr when compiling SYCL for NVPTX #15642

frasercrmck · 2024-10-09T14:28:06Z

This flag turns pointers to CUDA's shared, const, and local address spaces into 32-bit pointers. This can potentially save on registers used for addressing calculations.

This option was being accepted by the frontend when compiling SYCL code, but was then reporting an error that the backend datalayout doesn't match the expected target description. This was because the option wasn't being caught by all parts of the toolchain, leading to inconsistencies.

This PR allows users to pass the option if they wish. They will see a warning that the compiler is linking against a libclc/libspirv that hasn't been compiled with this option, but this is likely harmless since libspirv doesn't manipulate pointers.

This makes pointers to CUDA shared, const, and local address spaces as being 32-bit pointers. This should bring decent performance improvements in certain programs.

hdelan · 2024-10-09T14:49:56Z

https://www.youtube.com/watch?v=koJlIGDImiU

konradkusiak97

This is great! And it looks good to me but maybe other people from @intel/llvm-reviewers-cuda want to also double-check.

clang/lib/Driver/ToolChains/Cuda.cpp

clang/lib/Driver/ToolChains/Clang.cpp

Naghasan · 2024-10-22T15:06:03Z

libclc/CMakeLists.txt

@@ -450,10 +450,15 @@ foreach( t ${LIBCLC_TARGETS_TO_BUILD} )
      list(APPEND flags -D__unix__)
    endif()

+    set(spirv_flags ${flags})
+    if( ARCH STREQUAL nvptx OR ARCH STREQUAL nvptx64 )
+      list(APPEND spirv_flags -Xclang -fcuda-short-ptr -mllvm -nvptx-short-ptr)


I think this is ok, but you could end up linking the libspirv (with short ptr) to a program compiled without. As builtins don't write pointers, I don't think this is an issue but would be good to test prior to merging and document this.

Note I've now reduced the scope of this PR and the option is no longer enabled by default. Thus libclc/libspirv is no longer compiled with this option. If a user passes -fcuda-short-ptr they'll see a warning while linking libclc/libspirv which I think is correct to do.

But yes in general we need to consider whether this is okay. I also think it's okay, and I've run several benchmarks with it and not seen a problem. Ideally we'd compile two versions of libspirv for NVPTX, but I don't know if it's going to be worth it for this relatively obscure option.

clang/test/Driver/sycl-nvptx-short-ptr.cpp

clang/test/CodeGenSYCL/nvptx-short-ptr.cpp

frasercrmck · 2024-12-11T10:33:08Z

@intel/llvm-gatekeepers this is ready to merge, thank you

[SYCL][NVPTX] Enable -fcuda-short-ptr by default

fdc2ad6

This makes pointers to CUDA shared, const, and local address spaces as being 32-bit pointers. This should bring decent performance improvements in certain programs.

frasercrmck requested review from a team as code owners October 9, 2024 14:28

frasercrmck requested a review from konradkusiak97 October 9, 2024 14:28

frasercrmck temporarily deployed to WindowsCILock October 9, 2024 14:29 — with GitHub Actions Inactive

konradkusiak97 approved these changes Oct 9, 2024

View reviewed changes

mdtoguchi reviewed Oct 9, 2024

View reviewed changes

clang/lib/Driver/ToolChains/Cuda.cpp Outdated Show resolved Hide resolved

srividya-sundaram reviewed Oct 9, 2024

View reviewed changes

clang/lib/Driver/ToolChains/Clang.cpp Show resolved Hide resolved

frasercrmck temporarily deployed to WindowsCILock October 10, 2024 04:27 — with GitHub Actions Inactive

Naghasan reviewed Oct 22, 2024

View reviewed changes

frasercrmck added 2 commits December 3, 2024 12:23

Merge remote-tracking branch 'origin/sycl' into sycl-nvptx-short-ptrs

9ec2d4c

reduce scope of PR

9ceb7d3

frasercrmck had a problem deploying to WindowsCILock December 3, 2024 12:26 — with GitHub Actions Error

frasercrmck changed the title ~~[SYCL][NVPTX] Enable -fcuda-short-ptr by default~~ [SYCL][NVPTX] Obey -fcuda-short-ptr when compiling SYCL for NVPTX Dec 3, 2024

add tests

4b1ce3b

frasercrmck requested a review from a team as a code owner December 3, 2024 12:51

frasercrmck temporarily deployed to WindowsCILock December 3, 2024 12:51 — with GitHub Actions Inactive

frasercrmck temporarily deployed to WindowsCILock December 3, 2024 13:27 — with GitHub Actions Inactive

mdtoguchi reviewed Dec 3, 2024

View reviewed changes

clang/test/Driver/sycl-nvptx-short-ptr.cpp Outdated Show resolved Hide resolved

remove requires from test

4243e07

frasercrmck temporarily deployed to WindowsCILock December 3, 2024 17:47 — with GitHub Actions Inactive

frasercrmck temporarily deployed to WindowsCILock December 3, 2024 21:16 — with GitHub Actions Inactive

elizabethandrews reviewed Dec 5, 2024

View reviewed changes

clang/test/CodeGenSYCL/nvptx-short-ptr.cpp Show resolved Hide resolved

frasercrmck added 2 commits December 9, 2024 10:51

Merge remote-tracking branch 'origin/sycl' into sycl-nvptx-short-ptrs

b2301c3

update test with comments and more checks

516261c

frasercrmck temporarily deployed to WindowsCILock December 9, 2024 11:21 — with GitHub Actions Inactive

frasercrmck temporarily deployed to WindowsCILock December 9, 2024 12:02 — with GitHub Actions Inactive

mdtoguchi approved these changes Dec 9, 2024

View reviewed changes

elizabethandrews approved these changes Dec 10, 2024

View reviewed changes

ldrumm merged commit 83fe1c1 into intel:sycl Dec 11, 2024
14 checks passed

frasercrmck deleted the sycl-nvptx-short-ptrs branch December 11, 2024 11:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCL][NVPTX] Obey -fcuda-short-ptr when compiling SYCL for NVPTX #15642

[SYCL][NVPTX] Obey -fcuda-short-ptr when compiling SYCL for NVPTX #15642

frasercrmck commented Oct 9, 2024 •

edited

Loading

hdelan commented Oct 9, 2024

konradkusiak97 left a comment

Naghasan Oct 22, 2024

frasercrmck Dec 3, 2024

frasercrmck commented Dec 11, 2024

[SYCL][NVPTX] Obey -fcuda-short-ptr when compiling SYCL for NVPTX #15642

[SYCL][NVPTX] Obey -fcuda-short-ptr when compiling SYCL for NVPTX #15642

Conversation

frasercrmck commented Oct 9, 2024 • edited Loading

hdelan commented Oct 9, 2024

konradkusiak97 left a comment

Choose a reason for hiding this comment

Naghasan Oct 22, 2024

Choose a reason for hiding this comment

frasercrmck Dec 3, 2024

Choose a reason for hiding this comment

frasercrmck commented Dec 11, 2024

frasercrmck commented Oct 9, 2024 •

edited

Loading