Add AArch64-optimized SYMV kernels #5221

tetsuzo-usui · 2025-04-11T12:22:09Z

This pull request adds [SD]SYMV kernels optimized for arm64.
Previously, generic/symv_k.c has been used on arm64 systems, in which the calculation falls back to using the GEMV kernels. This approach involves accessing each matrix element twice. To address this inefficiency, I’ve implemented new kernels by using a technique analogous to those employed in the x86_64 SYMV kernels.
As shown in the graphs below, performance is improved by about 2x on the A64FX, Graviton3E, Grace, and Ampere Altra Max platforms, respectively.

SYMV is an important component in symmetric matrix eigenvalue computations. Consequently, this PR yields performance improvements in higher-level routines such as DSYEVD. Specifically, the execution time of DSYTRD (tridiagonalization), which is the initial step within DSYEVD, is reduced as shown in the graph.

Add symv kernels for arm64

d711906

martin-frbg added this to the 0.3.30 milestone Apr 11, 2025

martin-frbg merged commit afb6645 into OpenMathLib:develop Apr 16, 2025
86 checks passed

tetsuzo-usui mentioned this pull request Jul 1, 2025

Improve [SD]SYEVD performance by parallelizing [SD]LAED3 #5355

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add AArch64-optimized SYMV kernels #5221

Add AArch64-optimized SYMV kernels #5221

Uh oh!

tetsuzo-usui commented Apr 11, 2025

Uh oh!

Uh oh!

Uh oh!

Add AArch64-optimized SYMV kernels #5221

Add AArch64-optimized SYMV kernels #5221

Uh oh!

Conversation

tetsuzo-usui commented Apr 11, 2025

Uh oh!

Uh oh!

Uh oh!