You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This patch adds some more efficient lowering for vecreduce.min/max under NEON,
using sequences of pairwise vpmin/vpmax to reduce to a single value.
This nearly resolves issues such as #50466, #40981, #38190.
Differential Revision: https://reviews.llvm.org/D146404
Extended Description
The binary reduction intrinsics on Aarch64 (and ARM) produce suboptimal implementations over vectors of i1. This issue is similar to #38188 .
produces
when it could instead use vmaxvq (or vpmax on ARM).
The same goes for vector.reduce.and with vminvq (or vpmin on ARM).
The text was updated successfully, but these errors were encountered: