Skip to content

Releases: NVIDIA/MatX

v0.9.2

29 Jul 19:13
fa9e872
Compare
Choose a tag to compare

New operator: interp

Other Additions:

  • Improvements to sparse support including new batched tri-diagonal solver
  • Automatic vectorization and ILP support
  • DLPack updated to 1.1
  • Many bug fixes

What's Changed

New Contributors

Full Changelog: v0.9.1...v0.9.2

v0.9.1

14 May 15:43
4475c22
Compare
Choose a tag to compare

Sparse support + bugfixes

  • New operators: argminmax, dense2sparse, sparse2dense, interp1, normalize, argsort
  • Removed requirement for --relaxed-constexpr
  • Added MatX NVTX domain
  • Significantly improved speed of svd and inv
  • Python integration sample
  • Experimental sparse tensor support (SpMM and solver routines supported)
  • Significantly reduced FFT memory usage

What's Changed

Read more

v0.9.0

15 Oct 18:12
af55b57
Compare
Choose a tag to compare

Version v0.9.0 adds comprehensive support for more host CPU transforms such as BLAS and LAPACK, including multi-threaded versions.

Beyond the CPU support, there are many more minor improvements:

  • Added several new operators include vector_norm, matrix_norm, frexp, diag, and more
  • Many compiler fixes to support a wider range of older and newer compilers
  • Performance improvements to avoid overhead of permutation operators when unnecessary
  • Much more!

A full changelist is below

What's Changed

Read more

v0.8.0

04 Apr 17:27
7719779
Compare
Choose a tag to compare

Release highlights:

  • Features
    • Updated cuTENSOR and cuTensorNet versions
    • Added configurable print formatting
    • ARM FFT support via NVPL
    • New operators: abs2(), outer(), isnan(), isinf()
    • Many more unit tests for CPU tests
  • Bug fixes for matmul on Hopper, 2D FFTs, and more

Full changelist:

What's Changed

New Contributors

Full Changelog: v0.7.0...v0.8.0

v0.7.0

04 Jan 21:06
Compare
Choose a tag to compare

Features

Fixes

Full Changelog: v0.6.0...v0.7.0

v0.6.0

02 Oct 16:50
Compare
Choose a tag to compare

Notable Updates

Full changelog below:

What's Changed

New Contributors

Full Changelog: v0.5.0...v0.6.0

v0.5.0

03 Jul 21:38
Compare
Choose a tag to compare

Notable Updates

  • Documentation rewritten to include working examples for every function based on unit tests
  • Polyphase resampler based on SciPy/cuSignal's resample_poly

Full changelog below:

What's Changed

New Contributors

Full Changelog: v0.4.1...v0.5.0

v0.4.1

02 Jun 15:17
Compare
Choose a tag to compare

This is a minor release mostly focused on bug fixes for different compilers and CUDA versions. One major feature added was all reductions are supported on the host using a single threaded executor. Multi-threaded executor support coming soon.

What's Changed

Full Changelog: v0.4.0...v0.4.1

v0.4.0

03 Apr 15:55
Compare
Choose a tag to compare

New Features

Bug Fixes

New Contributors

Full Changelog: v0.3.0...v0.4.0

v0.3.0

20 Jan 19:43
20e00a2
Compare
Choose a tag to compare

v0.3.0 marks a major release with over 100 features and bug fixes. Release cadence will occur more frequently after this release to support users not living at the HEAD.

What's Changed

Read more