Skip to content
This repository has been archived by the owner on Mar 21, 2024. It is now read-only.

libcu++ 1.6.0 (CUDA Toolkit 11.5)

Compare
Choose a tag to compare
@wmaxey wmaxey released this 09 Aug 04:49
· 708 commits to main since this release

libcu++ 1.6.0 (CUDA Toolkit 11.5)

libcu++ 1.6.0 is a major release. It changes the default alignment of cuda::std::complex for
better code generation and changes cuda::std::atomic to use <nv/target> as the primary dispatch mechanism.

This release adds cuda::annotated_ptr and cuda::access_property, two APIs that allow associating an address
space and an explicit caching policy with a pointer, and the related cuda::apply_access_property,
cuda::associate_access_property and cuda::discard_memory APIs.

This release introduces ABI version 4, which is now the default.

Supported ABI Versions: 4 (default), 3, and 2.

Issues Fixed

  • #194: <cuda/std/barrier> and <cuda/std/atomic> failed to compile with NVRTC.
  • #179: Refactors the atomic layer to allow for layering the host device/host abstractions.
  • #189: Changed pragmas for silencing chrono long double warnings.
  • #186: Allows <nv/target> to be used under NVRTC.
  • #177: Allows <nv/target> to build when compiled under C and C++98.
    • Thanks to David Olsen for this contribution.
  • #172: Introduces ABI version 4.
    • Forces cuda::std::complex alignment for enhanced performance.
    • Sets the internal representation of cuda::std::chrono literals to double.
  • #165: For tests on some older distributions keep using Python 3, but downgrade lit.
  • #164: Fixes testing issues related to Python 2/3 switch for lit.
    • Thanks to Royil Damer for this contribution.