Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Fix nvvm.compile_program() failure for CUDA version 12.0
The original datalayout lacked explicit alignment and size definitions for i1, i8, i16, f32, f64, v64, and v128. The missing types are crucial for LLVM-based compilation in CUDA 12.0. Later CUDA versions are more forgiving, but 12.0 enforces a stricter layout. The stricter layout should resolve the issue for CUDA 12.0 without breaking compatibility with later versions.
- Loading branch information