- bug-fixes and numerous performance improvements
- added real-to-real sin and cos transformations of type I and II
- using multi-threading for the FFTW backend
- improved DPC++/SYCL synchronization logic
- improved CMake build system, now using CUDA and HIP as a CMake language, requires CMake 3.19/3.21