-
Notifications
You must be signed in to change notification settings - Fork 4.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[API Proposal]: Arm64: FEAT_F32MM #94024
Comments
Tagging subscribers to this area: @dotnet/area-system-numerics Issue Detailsnamespace System.Runtime.Intrinsics.Arm
/// VectorT Summary
public abstract class Sve : AdvSimd /// Feature: FEAT_F32MM
{
public static unsafe Vector<float> MatrixMultiplyAccumulate(Vector<float> op1, Vector<float> op2, Vector<float> op3);
/// total method signatures: 1
}
/// Full API
public abstract class Sve : AdvSimd /// Feature: FEAT_F32MM
{
/// MatrixMultiplyAccumulate : Matrix multiply-accumulate
/// svfloat32_t svmmla[_f32](svfloat32_t op1, svfloat32_t op2, svfloat32_t op3) : "FMMLA Ztied1.S, Zop2.S, Zop3.S" or "MOVPRFX Zresult, Zop1; FMMLA Zresult.S, Zop2.S, Zop3.S"
public static unsafe Vector<float> MatrixMultiplyAccumulate(Vector<float> op1, Vector<float> op2, Vector<float> op3);
/// total method signatures: 1
/// total method names: 1
}
/// Total ACLE covered across API: 1
|
This contributes to #93095 It covers all of the instructions in FEAT_F32MM. This an optional 8.2 feature but is not yet available in any hardware. This list was auto generated from the C ACLE for SVE, and is in three parts: The methods list reduced down to Vector versions. All possible varaints of T are given above the method. Many of the C functions include predicate argument(s), of type svbool_t as the first argument. These are missing from the C# method. It is expected that the Jit will create predicates where required, or combine with uses of conditionalSelect(). For more discussion see #88140 comment. |
Updated to reflect review comments from other API proposals. |
Tagging subscribers to this area: @dotnet/area-system-runtime-intrinsics Issue Detailsnamespace System.Runtime.Intrinsics.Arm
/// VectorT Summary
public abstract class SveF32mm : AdvSimd /// Feature: FEAT_F32MM
{
public static unsafe Vector<float> MatrixMultiplyAccumulate(Vector<float> op1, Vector<float> op2, Vector<float> op3); // FMMLA // MOVPRFX
/// total method signatures: 1
}
/// Full API
public abstract class SveF32mm : AdvSimd /// Feature: FEAT_F32MM
{
/// MatrixMultiplyAccumulate : Matrix multiply-accumulate
/// svfloat32_t svmmla[_f32](svfloat32_t op1, svfloat32_t op2, svfloat32_t op3) : "FMMLA Ztied1.S, Zop2.S, Zop3.S" or "MOVPRFX Zresult, Zop1; FMMLA Zresult.S, Zop2.S, Zop3.S"
public static unsafe Vector<float> MatrixMultiplyAccumulate(Vector<float> op1, Vector<float> op2, Vector<float> op3);
/// total method signatures: 1
/// total method names: 1
}
/// Total ACLE covered across API: 1
|
Updated to match implemented SVE1 methods. |
This feature is not yet available on any existing Arm hardware. I don't recommend implementing this for .NET10 |
The text was updated successfully, but these errors were encountered: