Implement Shift and Inserts scalar and SIMD intrinsics. #36818

TamarChristinaArm · 2020-05-21T14:55:52Z

This implements the remaining shift and insert intrinsics.

/cc @tannergooding @echesakovMSFT @CarolEidt

Dotnet-GitSync-Bot · 2020-05-21T14:56:04Z

Note regarding the new-api-needs-documentation label:

This serves as a reminder for when your PR is modifying a ref *.cs file and adding/modifying public APIs, to please make sure the API implementation in the src *.cs file is documented with triple slash comments, so the PR reviewers can sign off that change.

TamarChristinaArm · 2020-05-21T14:57:23Z

@echesakovMSFT @tannergooding I'm trying to figure out why my tests are throwing

ERROR!!!-System.PlatformNotSupportedException: Operation is not supported on this platform.

I know my constants aren't compile time constants, but I thought it would generate a lookup table then?

tannergooding · 2020-05-21T15:15:36Z

src/coreclr/src/jit/hwintrinsicarm64.cpp

+ case NI_AdvSimd_ShiftLeftLogicalAndInsertScalar:
+ case NI_AdvSimd_ShiftRightAndInsert:
+ case NI_AdvSimd_ShiftRightLogicalAndInsertScalar:
+ immLowerBound = 1;


immLowerBound == 0 is a MOVI variant? (Trying to make sure I'm reading the architecture manual correctly here 😄)

yeah 0 would be a MOV or ORR (not MOVI since that's a bitmask).

echesakov · 2020-05-21T17:27:43Z

@TamarChristinaArm You would also need something like this echesakov@0fe0cf9 to make sure that in non-const imm case we are checking against lower bound.

Would you mind waiting a little bit before I merge #36552 and changes in my private branch https://github.com/echesakovMSFT/runtime/compare/Arm64-ASIMD-Shift-Intrinsics that I plan to publish today.

#36552 is needed to fix an issue in emitter where INS_sri was not encoding shift amount properly (see #36552 (comment)) and the changes in private will add the check in addRangeCheckIfNeeded.

TamarChristinaArm · 2020-05-21T17:32:01Z

@echesakovMSFT ahh thanks, No I don't mind waiting at all. I was just really confused why it wasn't working :)

echesakov · 2020-05-21T17:44:03Z

I have one question - in #33398 we decided to explicitly specify what type of right shifts an intrinsic does - logical right shift or arithmetic right shift.

The operations in this PR are slightly different though - they are neither logical nor arithmetic since the bits "created" by shifting are filled with the corresponding bits in Vd.

Would it make sense to name them ShiftLeftAndInsert ShiftLeftAndInsertScalar ShiftRightAndInsert ShiftRightAndInsertScalar? @TamarChristinaArm @tannergooding

TamarChristinaArm · 2020-05-21T17:49:14Z

Would it make sense to name them ShiftLeftAndInsert ShiftLeftAndInsertScalar ShiftRightAndInsert ShiftRightAndInsertScalar? @TamarChristinaArm @tannergooding

I do agree here. When implementing them I did find the naming a bit confusing and inconsistent. But since they were approved like that I went with it. I find your proposed name clearer since as you say regardless of whether the shift is logical or arithmetic the bits you shift in will be overwritten.

tannergooding · 2020-05-21T18:10:06Z

That makes sense. This is also an element shift rather than a bitwise shift, so we might consider a different parameter name to make that clear.

That is, if you right shift by 1 then element 0 of the source is lost, element 1 of the source becomes element 0 of the destination, element 2 becomes element 1, etc. The last element of the destination is then preserved

TamarChristinaArm · 2020-05-21T18:20:50Z

That makes sense. This is also an element shift rather than a bitwise shift, so we might consider a different parameter name to make that clear.

It's a bitwise shift though, but every element is shifted by the same amount. I have also just realized that this is a RMW and forgot to modify it as such..

tannergooding · 2020-05-21T18:42:04Z

It's a bitwise shift though, but every element is shifted by the same amount. I have also just realized that this is a RMW and forgot to modify it as such..

Ah, I misread the diagram in the manual. I see now that the boxes are for the individual bits, my mistake.

In any case, we'd just need to take this to API review for discussion.

echesakov · 2020-05-26T17:47:27Z

ahh thanks, No I don't mind waiting at all. I was just really confused why it wasn't working :)

@TamarChristinaArm I merged #36552 and #36830 last week, so sri and sli instructions should work as expected and all the infrastructure needed for non-zero lower bound intrinsics are in place. Please let me know if you need help with resolving the merge conflicts.

TamarChristinaArm · 2020-05-27T15:44:41Z

ahh thanks, No I don't mind waiting at all. I was just really confused why it wasn't working :)

@TamarChristinaArm I merged #36552 and #36830 last week, so sri and sli instructions should work as expected and all the infrastructure needed for non-zero lower bound intrinsics are in place. Please let me know if you need help with resolving the merge conflicts.

@echesakovMSFT awesome thanks! I'm rebasing it now, will let you know if I run into any trouble.

TamarChristinaArm · 2020-06-08T07:44:20Z

That should do it @echesakovMSFT @tannergooding @CarolEidt

TamarChristinaArm · 2020-06-08T07:45:57Z

aside from the naming issue @echesakovMSFT raised in #36818 (comment) but @tannergooding mentioned it needs API review, so I have left them as the last reviewed names.

TamarChristinaArm · 2020-06-11T11:53:40Z

rebased with changes from master

echesakov · 2020-06-11T18:24:06Z

src/libraries/System.Private.CoreLib/src/System/Runtime/Intrinsics/Arm/AdvSimd.cs

@@ -7442,6 +7442,230 @@ public new abstract class Arm64 : ArmBase.Arm64
 /// </summary>
 public static Vector128<float> ReciprocalStep(Vector128<float> left, Vector128<float> right) => ReciprocalStep(left, right);

+ /// <summary>
+ /// uint8x8_t vsli_n_u8(uint8x8_t a, uint8x8_t b, __builtin_constant_p(n))


@TamarChristinaArm Everything looks good.

Just one question here - what is __builtin_constant_p(n) and
should we use uint8x8_t vsri_n_u8 (uint8x8_t a, uint8x8_t b, const int n) comment here instead as we did for the remaining intrinsics with an immediate operand?

Just one question here - what is __builtin_constant_p(n) and

It's a compiler builtin supported by clang and gcc to indicate that the value of n must be a compile time constant. the problem is that in C there's no real way to indicate this as const only means the value is read-only in the function.

should we use uint8x8_t vsri_n_u8 (uint8x8_t a, uint8x8_t b, const int n) comment here instead as we did for the remaining intrinsics with an immediate operand?

probably yes since in CLR it doesn't have to be since you generate that lookup table. I generate these signatures through a script that reads the ACLE sources so they got in automatically :)

echesakov · 2020-06-11T18:44:49Z

I am going to merge this to avoid future conflicts (e.g. with #36916)

If we decide to update the summary comments/method names we can do this later.

terrajobst · 2020-06-25T17:30:44Z

I have one question - in #33398 we decided to explicitly specify what type of right shifts an intrinsic does - logical right shift or arithmetic right shift.

The operations in this PR are slightly different though - they are neither logical nor arithmetic since the bits "created" by shifting are filled with the corresponding bits in Vd.

Would it make sense to name them ShiftLeftAndInsert ShiftLeftAndInsertScalar ShiftRightAndInsert ShiftRightAndInsertScalar?

Works for us.

Dotnet-GitSync-Bot added area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI new-api-needs-documentation labels May 21, 2020

tannergooding reviewed May 21, 2020

View reviewed changes

This was referenced May 21, 2020

Errors installing the SDK during builds #34015

Closed

OSX machines are de-provisioned during CI / PR runs leading to failures #34472

Closed

TamarChristinaArm force-pushed the implement-arm64-shift-and-inserts-insn branch from cf57e2a to ce29c95 Compare June 8, 2020 07:37

TamarChristinaArm changed the title ~~WIP: Implement Shift and Inserts scalar and SIMD intrinsics.~~ Implement Shift and Inserts scalar and SIMD intrinsics. Jun 8, 2020

Arm64: Implement shift and insert intrinsics

3fcae9a

TamarChristinaArm force-pushed the implement-arm64-shift-and-inserts-insn branch from ce29c95 to 3fcae9a Compare June 11, 2020 11:52

tannergooding approved these changes Jun 11, 2020

View reviewed changes

echesakov approved these changes Jun 11, 2020

View reviewed changes

echesakov reviewed Jun 11, 2020

View reviewed changes

echesakov merged commit d2f9adc into dotnet:master Jun 11, 2020

echesakov mentioned this pull request Jul 1, 2020

[Arm64] ASIMD InsertScalar and rename to ShiftLeftAndInsert ShiftRightAndInsert #38680

Merged

ghost locked as resolved and limited conversation to collaborators Dec 9, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Shift and Inserts scalar and SIMD intrinsics. #36818

Implement Shift and Inserts scalar and SIMD intrinsics. #36818

TamarChristinaArm commented May 21, 2020

Dotnet-GitSync-Bot commented May 21, 2020

TamarChristinaArm commented May 21, 2020

tannergooding May 21, 2020

TamarChristinaArm May 21, 2020

echesakov commented May 21, 2020

TamarChristinaArm commented May 21, 2020

echesakov commented May 21, 2020

TamarChristinaArm commented May 21, 2020

tannergooding commented May 21, 2020

TamarChristinaArm commented May 21, 2020

tannergooding commented May 21, 2020

echesakov commented May 26, 2020

TamarChristinaArm commented May 27, 2020

TamarChristinaArm commented Jun 8, 2020

TamarChristinaArm commented Jun 8, 2020

TamarChristinaArm commented Jun 11, 2020

echesakov Jun 11, 2020 •

edited

Loading

TamarChristinaArm Jun 12, 2020

echesakov commented Jun 11, 2020

terrajobst commented Jun 25, 2020

Implement Shift and Inserts scalar and SIMD intrinsics. #36818

Implement Shift and Inserts scalar and SIMD intrinsics. #36818

Conversation

TamarChristinaArm commented May 21, 2020

Dotnet-GitSync-Bot commented May 21, 2020

TamarChristinaArm commented May 21, 2020

tannergooding May 21, 2020

Choose a reason for hiding this comment

TamarChristinaArm May 21, 2020

Choose a reason for hiding this comment

echesakov commented May 21, 2020

TamarChristinaArm commented May 21, 2020

echesakov commented May 21, 2020

TamarChristinaArm commented May 21, 2020

tannergooding commented May 21, 2020

TamarChristinaArm commented May 21, 2020

tannergooding commented May 21, 2020

echesakov commented May 26, 2020

TamarChristinaArm commented May 27, 2020

TamarChristinaArm commented Jun 8, 2020

TamarChristinaArm commented Jun 8, 2020

TamarChristinaArm commented Jun 11, 2020

echesakov Jun 11, 2020 • edited Loading

Choose a reason for hiding this comment

TamarChristinaArm Jun 12, 2020

Choose a reason for hiding this comment

echesakov commented Jun 11, 2020

terrajobst commented Jun 25, 2020

echesakov Jun 11, 2020 •

edited

Loading