You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Dec 9, 2018. It is now read-only.
I noticed that the nvptx repo was using wrapping_add, which was curious. In looking into it, I noticed that the type of the intrinsics seems to differ from what the CUDA guide states. Is this intentional?
The LLVM definition use i32 so I'm sticking to that. If you are using C/C++ (and CUDA is C/C++ + some language extensions) I suppose it doesn't matter what LLVM (the backend) uses because the compiler will implicitly cast between integer types.
Hi,
I noticed that the nvptx repo was using wrapping_add, which was curious. In looking into it, I noticed that the type of the intrinsics seems to differ from what the CUDA guide states. Is this intentional?
http://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#built-in-variables
The text was updated successfully, but these errors were encountered: