heFFTe 2.3
- added option for batch-FFTs (multiple signals with one command) which significantly lowers latency
- added option to work on a sub-communicator (subset of the mpi ranks) which reduces communication
- improved ROCm memory buffering and synchronization
- improved OneAPI support
- numerous performance improvements