M2 Compatibility #5

sfjohnson · 2023-03-08T00:08:42Z

Hi,

On my M2 (2022 MacBook Air) I'm getting the following:

AMX_LDX: fail
AMX_LDY: fail
AMX_LDZ: pass
AMX_LDZI: pass
AMX_STX: pass
AMX_STY: pass
AMX_STZ: pass
AMX_STZI: pass
AMX_EXTRX: fail
AMX_EXTRY: fail
AMX_MAC16: pass
AMX_FMA16: pass
AMX_FMA32: pass
AMX_FMA64: pass
AMX_FMS16: pass
AMX_FMS32: pass
AMX_FMS64: pass
AMX_VECINT: fail
AMX_VECFP: fail
AMX_MATINT: pass
AMX_MATFP: fail
AMX_GENLUT: fail

This is with clang 14.0.0 on macOS 13.1. I could be doing something wrong here, or there might be a minor architectural difference between AMX on M1 and M2.

I'm going to investigate further to see if I can get everything to pass on M2, but first I was wondering if there has been any existing work done on M2 yet?

Thanks.

The text was updated successfully, but these errors were encountered:

corsix · 2023-03-08T22:48:48Z

I've just rented an M2 machine in the cloud for a month, and I see the same thing. My money would be on "minor architectural difference"...

sfjohnson · 2023-03-09T00:48:40Z

Ok cool, I'll explore a bit and let you know if I find anything!

corsix · 2023-03-10T23:30:45Z

Some of the changes seem to be:

extrh, extrv, vecfp, matfp, genlut gaining bf16 modes (the CPU cores also gain BF16 support in M2)
extrh and extrv gaining some f32 -> f16 (or bf16) mixed lane-width modes
ldx and ldy gaining "load four at a time" mode
extrh, extrv, vecint, vecfp gaining support for operating on two or four vectors in a single instruction, rather than just a single vector
vecint and vecfp gaining some new ALU modes (albeit not particularly interesting modes, e.g. z = x * y, z = z + x, z = z + y)

corsix · 2023-03-12T18:09:13Z

Commits pushed to reflect the changes in the above comment, in both the documentation and the emulation code.

56789KD · 2024-09-13T16:30:19Z

Add SIMD comment

corsix self-assigned this Mar 8, 2023

corsix closed this as completed Mar 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

M2 Compatibility #5

M2 Compatibility #5

sfjohnson commented Mar 8, 2023

corsix commented Mar 8, 2023

sfjohnson commented Mar 9, 2023

corsix commented Mar 10, 2023

corsix commented Mar 12, 2023

56789KD commented Sep 13, 2024

M2 Compatibility #5

M2 Compatibility #5

Comments

sfjohnson commented Mar 8, 2023

corsix commented Mar 8, 2023

sfjohnson commented Mar 9, 2023

corsix commented Mar 10, 2023

corsix commented Mar 12, 2023

56789KD commented Sep 13, 2024