Release 1.0.0-beta4
elliottslaughter
released this
22 Apr 20:35
·
124 commits
to master
since this release
This is a beta for the upcoming 1.0.0 release.
Major changes since the last beta:
- Added support for LLVM 12 and 13
- Support for (previously deprecated) LLVM 3.5, 3.6 and 3.7 has been removed
- Added support for fast-math flags
- Added support for setting calling convention on functions
- Experimental support for atomicrmw instruction
- Experimental support for AMD GPU code generation
- Build process for the released binaries has been improved and updated
- Nix derivation has been substantially upgraded
- Fixes for performance regressions in CUDA code generation since LLVM 3.8
- Various long-standing documentation issues have been addressed
Binaries for this release were produced in the following configurations:
- Linux: x86_64 on Ubuntu 18.04 with LLVM 13 and CUDA 11.6 (verified to work on Ubuntu 20.04 and 22.04)
- macOS: x86_64 on macOS 10.15 with LLVM 13
- Windows: x86_64 on Server 2022 with LLVM 11 and CUDA 11.6
Note that, while we do our best to make the binaries compatible across OS versions/distros, in general we cannot guarantee compatibility outside of the specified OS release (except where otherwise noted).