This repository has been archived by the owner on Mar 21, 2024. It is now read-only.
libcu++ 1.6.0 (CUDA Toolkit 11.5)
libcu++ 1.6.0 (CUDA Toolkit 11.5)
libcu++ 1.6.0 is a major release. It changes the default alignment of cuda::std::complex
for
better code generation and changes cuda::std::atomic
to use <nv/target>
as the primary dispatch mechanism.
This release adds cuda::annotated_ptr
and cuda::access_property
, two APIs that allow associating an address
space and an explicit caching policy with a pointer, and the related cuda::apply_access_property
,
cuda::associate_access_property
and cuda::discard_memory
APIs.
This release introduces ABI version 4, which is now the default.
Supported ABI Versions: 4 (default), 3, and 2.
Issues Fixed
- #194:
<cuda/std/barrier>
and<cuda/std/atomic>
failed to compile with NVRTC. - #179: Refactors the atomic layer to allow for layering the host device/host abstractions.
- #189: Changed pragmas for silencing chrono long double warnings.
- #186: Allows
<nv/target>
to be used under NVRTC. - #177: Allows
<nv/target>
to build when compiled under C and C++98.- Thanks to David Olsen for this contribution.
- #172: Introduces ABI version 4.
- Forces
cuda::std::complex
alignment for enhanced performance. - Sets the internal representation of
cuda::std::chrono
literals todouble
.
- Forces
- #165: For tests on some older distributions keep using Python 3, but downgrade lit.
- #164: Fixes testing issues related to Python 2/3 switch for lit.
- Thanks to Royil Damer for this contribution.