[SYCL][Driver] Fix regression that enabled Cuda-mode in cc1 and defined __CUDA_ARCH__ #15441

GeorgeWeb · 2024-09-19T10:58:38Z

The CudaToolChain set -fcuda-is-device unconditionally which made InitializePredefinedMacros (called from clang::InitializePreprocessor) to define __CUDA_ARCH__ (default-init to 1). As such, the driver assumed Cuda mode while in also SYCL mode, but we don't properly support Cuda device-code compatibility and we want to avoid having the __CUDA_ARCH__ macro defined altogether for SYCL offload.

…ed __CUDA_ARCH__

clang/lib/Driver/ToolChains/Cuda.cpp

clang/test/Preprocessor/sycl-macro.cpp

…e Cuda toolchain

clang/test/Preprocessor/sycl-macro.cpp

GeorgeWeb · 2024-09-23T15:48:08Z

@intel/llvm-gatekeepers This looks good to merge now. Thanks.

bader · 2024-09-25T20:55:55Z

clang/test/Preprocessor/sycl-macro.cpp

-// RUN: %clang_cc1 %s  -triple nvptx64-nvidia-cuda -target-cpu sm_80 -fsycl-is-device -E -dM | FileCheck --check-prefix=CHECK-CUDA %s
+// RUN: %clang_cc1 %s  -triple nvptx64-nvidia-cuda -target-cpu sm_80 -fsycl-is-device -E -dM | FileCheck \
+// RUN: --check-prefix=CHECK-CUDA %s -DARCH_CODE=800
+// RUN: %clangxx %s -fsycl -nocudalib -fsycl-targets=nvptx64-nvidia-cuda -Xsycl-target-backend --offload-arch=sm_80 -E -dM | FileCheck \


This is the preprocessor test, so we should only call %clang_cc1. Please, move the driver test to clang/test/Driver/ directory.

While you are doing this, please, make sure you either have libspirv input files or pass -fno-sycl-libspirv option. I don't build NVPTX target and this test fails with:

clang: error: cannot find 'remangled-l64-signed_char.libspirv-nvptx64-nvidia-cuda.bc'; provide path to libspirv library via '-fsycl-libspirv-path', or pass '-fno-sycl-libspirv' to build without linking with libspirv clang: error: cannot find 'remangled-l64-signed_char.libspirv-nvptx64-nvidia-cuda.bc'; provide path to libspirv library via '-fsycl-libspirv-path', or pass '-fno-sycl-libspirv' to build without linking with libspirv

Noted. I'll sort that out asap once I am back on a work PC.

Addressed in #15521

… Preprocessor (#15521) This PR moves the driver invocation test that checks `__CUDA_ARCH__` does not get defined and ensures that it doesn't require the `libspirv-nvptx64-nvidia-cuda` bitcode files by passing `-fno-sycl-libspirv` to the `%clangxx` command. Link to the comment in related PR that reported this issue: #15441 (comment) Additionally, an extra test is added to check that the `-fcuda-is-device` option is not supplied in the CC1 invocation targeting `nvptx64-nvidia-cuda`, which enables `LangOptions.CudaIsDevice` and was the cause of defining the `__CUDA_ARCH__` macro.

GeorgeWeb had a problem deploying to WindowsCILock September 19, 2024 10:59 — with GitHub Actions Error

[SYCL][Driver] Fix regression that enabled Cuda-mode in cc1 and defin…

a188a0b

…ed __CUDA_ARCH__

GeorgeWeb force-pushed the georgi/cuda-arch-define branch from 186d6bd to a188a0b Compare September 19, 2024 11:30

GeorgeWeb changed the title ~~Fix regression that enabled Cuda-mode speific cc1 args~~ [SYCL][Driver] Fix regression that enabled Cuda-mode in cc1 and defined __CUDA_ARCH__ Sep 19, 2024

GeorgeWeb marked this pull request as ready for review September 19, 2024 11:30

GeorgeWeb requested review from a team as code owners September 19, 2024 11:30

GeorgeWeb temporarily deployed to WindowsCILock September 19, 2024 11:30 — with GitHub Actions Inactive

GeorgeWeb had a problem deploying to WindowsCILock September 19, 2024 12:04 — with GitHub Actions Failure

frasercrmck reviewed Sep 19, 2024

View reviewed changes

clang/lib/Driver/ToolChains/Cuda.cpp Outdated Show resolved Hide resolved

frasercrmck reviewed Sep 19, 2024

View reviewed changes

clang/lib/Driver/ToolChains/Cuda.cpp Outdated Show resolved Hide resolved

Follow the no parens coding standard for one-liner if-statements

c9e6676

GeorgeWeb temporarily deployed to WindowsCILock September 19, 2024 13:52 — with GitHub Actions Inactive

GeorgeWeb temporarily deployed to WindowsCILock September 19, 2024 14:26 — with GitHub Actions Inactive

mdtoguchi reviewed Sep 19, 2024

View reviewed changes

clang/test/Preprocessor/sycl-macro.cpp Outdated Show resolved Hide resolved

Remove unnecessary define from the -cc1 line

3c9c60c

GeorgeWeb temporarily deployed to WindowsCILock September 19, 2024 15:46 — with GitHub Actions Inactive

GeorgeWeb temporarily deployed to WindowsCILock September 19, 2024 16:20 — with GitHub Actions Inactive

Add -fsycl-is-host and -std=c++17 only for .cu SYCL compilation in th…

9dd55b9

…e Cuda toolchain

GeorgeWeb temporarily deployed to WindowsCILock September 19, 2024 21:42 — with GitHub Actions Inactive

GeorgeWeb temporarily deployed to WindowsCILock September 19, 2024 22:15 — with GitHub Actions Inactive

mdtoguchi approved these changes Sep 19, 2024

View reviewed changes

frasercrmck reviewed Sep 20, 2024

View reviewed changes

clang/test/Preprocessor/sycl-macro.cpp Outdated Show resolved Hide resolved

frasercrmck approved these changes Sep 20, 2024

View reviewed changes

Update the test check lines

316a1b0

GeorgeWeb temporarily deployed to WindowsCILock September 20, 2024 11:31 — with GitHub Actions Inactive

GeorgeWeb temporarily deployed to WindowsCILock September 20, 2024 12:15 — with GitHub Actions Inactive

elizabethandrews approved these changes Sep 20, 2024

View reviewed changes

smanna12 approved these changes Sep 20, 2024

View reviewed changes

sarnex merged commit 9fd767d into intel:sycl Sep 23, 2024
12 checks passed

bader reviewed Sep 25, 2024

View reviewed changes

GeorgeWeb mentioned this pull request Sep 26, 2024

[SYCL] Move driver related __CUDA_ARCH__ test to Driver folder from Preprocessor #15521

Merged

npmiller mentioned this pull request Sep 30, 2024

__CUDA_ARCH__ is defined when compiling AOT for HIP #15544

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCL][Driver] Fix regression that enabled Cuda-mode in cc1 and defined __CUDA_ARCH__ #15441

[SYCL][Driver] Fix regression that enabled Cuda-mode in cc1 and defined __CUDA_ARCH__ #15441

GeorgeWeb commented Sep 19, 2024 •

edited

Loading

GeorgeWeb commented Sep 23, 2024

bader Sep 25, 2024

GeorgeWeb Sep 25, 2024

GeorgeWeb Sep 26, 2024

[SYCL][Driver] Fix regression that enabled Cuda-mode in cc1 and defined __CUDA_ARCH__ #15441

[SYCL][Driver] Fix regression that enabled Cuda-mode in cc1 and defined __CUDA_ARCH__ #15441

Conversation

GeorgeWeb commented Sep 19, 2024 • edited Loading

GeorgeWeb commented Sep 23, 2024

bader Sep 25, 2024

Choose a reason for hiding this comment

GeorgeWeb Sep 25, 2024

Choose a reason for hiding this comment

GeorgeWeb Sep 26, 2024

Choose a reason for hiding this comment

GeorgeWeb commented Sep 19, 2024 •

edited

Loading