Releases: pghysels/STRUMPACK
Releases · pghysels/STRUMPACK
STRUMPACK v8.0.0
- New experimental sparse symmetric solver (CUDA only)
- SZ3 compression as an alternative to ZFP
- Many BLR improvements
- GPU performance improvements, making KBLaS optional
- Fixes in BLR COLWISE compression
- Added ZFP compression accuracy mode as alternative to precision mode.
- Accuracy mode is now the default. (set accuracy to negative value to use precision mode)
- Various HIP and SYCL fixes
- Update (PT)Scotch dependency, now requiring scotch >= 7.0.4 (build with CMake)
- Update ButterflyPACK interface, now requiring ButterflyPACK >= 3.2.0.
- Remove dynamic_cast from header files
STRUMPACK 7.2.0
- Fix for sparse solve with MAGMA with 1 MPI rank
- suppress several warnings
- small bugfix in BLR partitioning in sparse solver
- SYCL CMake compilation fix
- In StructuredMatrix interface, construct_from_elements now takes
point geometry for faster HSS compression - In StructuredMatrix interface, BLR compress_and_factor is now
supported
STRUMPACK v7.1.4
- Memory leak fix from MPI_Datatypes
- Set RPATH in strumpack library
- Add counting of subnormal number in the sparse factors
- Changes in the BLR compression tolerances in the sparse solver,
using a scaled absolute tolerance - Fixes in the sparse solver using MAGMA, resetting error codes
which show up in larger systems, but are not actually errors. - Set SOVERSION
- Update CMake for HIP, using enable_language(HIP), requires CMake 3.21
STRUMPACK 7.1.3
Workaround for SLATE <= 20220700
STRUMPACK 7.1.2
Small bugfix
STRUMPACK 7.1.1
ROCm compilation fix
STRUMPACK 7.1.0
- Bugfix in matrix equilibration code
- Several bugfixes, especially for SLATE and large problems on GPU
- Sparse triangular solve on the GPU when using MAGMA
- Other MAGMA fixes for the sparse direct solver (MAGMA still optional)
- New HSS random sketching operators based on sparse Johnson Lindestrauss
- Fix for HODLRMatrix construction from elements, or blocks
- Compilation fixes for NVHPC compiler
- SYCL updates
- Add lapmr routines, which are not available in Mac LAPACK implementations
- Support newer ( >= 1.0) ZFP versions
- Fixes for clang 15
- Add NDBFS GPU matrix ordering code
STRUMPACK 7.0.1
v7.0.1 STRUMPACK v7.0.1
STRUMPACK 7.0.0
- Many bugfixes and general improvements.
- Important fixes in the GPU code, and in the usage of SLATE
(GPU capable ScaLAPACK replacement). - The default ordering now uses METIS_NodeND, instead of the
(undocumented) METIS_NodeNDP routine. This can impact performance,
or for some problems lead to stack overflow, but for others it
drastically reduces memory usage. The old behavior can be restored
with --sp_enable_METIS_NodeNDP.
STRUMPACK 6.3.1
- Fix for setting CUDA/HIP device when there are multiple, but MPI was not initialized
- Memory leak fix in distributed memory GPU code
- Fixed small memory leaks from MPI datatypes
- Change in BLR algorithm selection options
- Changed default blocksize for 2D block cyclic distribution when using SLATE to 512
- Add 64bit support in the matching (MC64)
- Fix installation of Fortran modules