Build aarch64-apple-darwin wheels natively to reduce CI time because cross-compiling is generally slower. ### This is currently being blocked by - [x] https://github.com/actions/runner-images/issues/2187 - [x] https://github.com/github/roadmap/issues/528