v0.1.2
New features
- Aarch64 support (e.g. Apple Silicon, Raspberry Pi 4, ...) with nightly Rust and crate feature
aarch64_neon
Performance
- Another speedup on pure ASCII data
- Aligned reads have been removed as the performance was worse overall.
- Prefetch is used selectively on AVX 2, where it provides a slight benefit on some Intel CPUs.
Comparison vs v0.1.1 on x86-64
Other
- Refactored SIMD integration to allow easy implementation for new architectures
- Full test coverage
- Thoroughly fuzz-tested