You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
During ahead-of-time compilation for the HIP backend, the macro __CUDA_ARCH__ is defined as 0, while it should not be defined at all. Usually, libraries check only if __CUDA_ARCH__ is defined to enable CUDA specific code, without checking its value, causing compilation for the HIP backend to fail.
To reproduce
Include code snippet as short as possible
test.cpp
#include<sycl/sycl.hpp>intmain()
{
return0;
}
Specify the command which should be used to compile the program
Hi, good timing with this and good catch. I also noticed that happening while fixing another regression defining this macro in the SYCL for Cuda AOT compilation (where we want 'SYCL_CUDA_ARCH' instead) and intended to follow up with a HIP patch-fix afterwards.
I'll ping you when it's up. :)
This commit updates NVIDIA CCCL to version 2.7.0 and starts following
the official repository rather than my fork.
In order to make this work, I had to incorporate a temporary workaround
for intel/llvm#15544 until we start using a
release with intel/llvm#15443.
Closesacts-project#660.
Describe the bug
During ahead-of-time compilation for the HIP backend, the macro
__CUDA_ARCH__
is defined as 0, while it should not be defined at all. Usually, libraries check only if__CUDA_ARCH__
is defined to enable CUDA specific code, without checking its value, causing compilation for the HIP backend to fail.To reproduce
test.cpp
icpx -fsycl -fsycl-targets=amd_gpu_gfx90a -dM -E test.cpp | grep CUDA_ARCH
the output of the command above is
#define __CUDA_ARCH__ 0
while it should not be defined.
Environment
Additional context
No response
The text was updated successfully, but these errors were encountered: