`__CUDA_ARCH__` is defined when compiling AOT for HIP #15544

AuroraPerego · 2024-09-27T23:16:32Z

Describe the bug

During ahead-of-time compilation for the HIP backend, the macro __CUDA_ARCH__ is defined as 0, while it should not be defined at all. Usually, libraries check only if __CUDA_ARCH__ is defined to enable CUDA specific code, without checking its value, causing compilation for the HIP backend to fail.

To reproduce

Include code snippet as short as possible

`test.cpp`

#include <sycl/sycl.hpp>
int main()
{
    return 0;
}

Specify the command which should be used to compile the program

icpx -fsycl -fsycl-targets=amd_gpu_gfx90a -dM -E test.cpp | grep CUDA_ARCH

Indicate what is wrong and what was expected
the output of the command above is

#define __CUDA_ARCH__ 0

while it should not be defined.

Environment

OS: RHEL 8.10
Target device and vendor: AMD GPU MI250X

icpx version:

Intel(R) oneAPI DPC++/C++ Compiler 2024.2.1 (2024.2.1.20240711)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/intel/oneapi/compiler/2024.2/bin/compiler
Configuration file: /opt/intel/oneapi/compiler/2024.2/bin/compiler/../icpx.cfg

Dependencies version:

HIP version: 6.2.41133-dd7f95766
AMD clang version 18.0.0git (https://github.com/RadeonOpenCompute/llvm-project roc-6.2.0 24292 26466ce804ac523b398608f17388eb6d605a3f09)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/rocm-6.2.0/lib/llvm/bin
Configuration file: /opt/rocm-6.2.0/lib/llvm/bin/clang++.cfg

Additional context

No response

The text was updated successfully, but these errors were encountered:

AuroraPerego · 2024-09-27T23:18:02Z

FYI @fwyzard @ivorobts

GeorgeWeb · 2024-09-28T13:18:51Z

Hi, good timing with this and good catch. I also noticed that happening while fixing another regression defining this macro in the SYCL for Cuda AOT compilation (where we want 'SYCL_CUDA_ARCH' instead) and intended to follow up with a HIP patch-fix afterwards.
I'll ping you when it's up. :)

npmiller · 2024-09-30T09:00:35Z

~~This should be fixed by: #15441~~

Nevermind I got my patches mixed up, the patches for this isn't up yet

This commit updates NVIDIA CCCL to version 2.7.0 and starts following the official repository rather than my fork. In order to make this work, I had to incorporate a temporary workaround for intel/llvm#15544 until we start using a release with intel/llvm#15443. Closes acts-project#660.

AuroraPerego added bug Something isn't working hip Issues related to execution on HIP backend. labels Sep 27, 2024

bader mentioned this issue Sep 27, 2024

[SYCL] Move driver related __CUDA_ARCH__ test to Driver folder from Preprocessor #15521

Merged

GeorgeWeb self-assigned this Sep 28, 2024

GeorgeWeb mentioned this issue Oct 2, 2024

[SYCL][Driver][HIP] Do not define __CUDA_ARCH__ for HIP-AMD targets #15443

Merged

sarnex closed this as completed in #15443 Oct 17, 2024

sarnex closed this as completed in 65e642e Oct 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`__CUDA_ARCH__` is defined when compiling AOT for HIP #15544

`__CUDA_ARCH__` is defined when compiling AOT for HIP #15544

AuroraPerego commented Sep 27, 2024

AuroraPerego commented Sep 27, 2024

GeorgeWeb commented Sep 28, 2024 •

edited

Loading

npmiller commented Sep 30, 2024 •

edited

Loading

__CUDA_ARCH__ is defined when compiling AOT for HIP #15544

__CUDA_ARCH__ is defined when compiling AOT for HIP #15544

Comments

AuroraPerego commented Sep 27, 2024

Describe the bug

To reproduce

test.cpp

Environment

Additional context

AuroraPerego commented Sep 27, 2024

GeorgeWeb commented Sep 28, 2024 • edited Loading

npmiller commented Sep 30, 2024 • edited Loading

`__CUDA_ARCH__` is defined when compiling AOT for HIP #15544

`__CUDA_ARCH__` is defined when compiling AOT for HIP #15544

`test.cpp`

GeorgeWeb commented Sep 28, 2024 •

edited

Loading

npmiller commented Sep 30, 2024 •

edited

Loading