Fold const WithElement to CNS_VEC #86212

jasper-d · 2023-05-13T18:30:15Z

Closes #84543

VN ternary SIMD ops and fold const WithElement to CNS_VEC for base-type float (#84543).

I believe regressions are a result of different register allocation.

OSX x64 failure is a known issue (#86612). I assume SIGSEGV on Linux arm32 is unrelated.

ghost · 2023-05-13T18:30:27Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

Issue Details

Fold const WithElement to CNS_VEC #84543.

@EgorBo Could you please take a look at this? Currently, it only handles TYP_FLOAT, but if the approach looks generally ok, I would try to generalize it to support other types as well.

Codegen now looks like this:

[MethodImpl(MethodImplOptions.NoInlining)]
internal static Vector4 Vector4Fields() => new Vector4 { X = 1, Y = 2, Z = 3, W = 4};

Emitting data sections: 16 total bytes

RWD00  	dq	400000003F800000h, 4080000040400000h
  section   0, size 16, RWD 0:	00 00 80 3f 00 00 00 40 00 00 40 40 00 00 80 40 

Allocated method code size =   19 , actual size =   19, unused size =    0

*************** After end code gen, before unwindEmit()
G_M24095_IG01:        ; func=00, offs=000000H, size=0003H, bbWeight=1, PerfScore 1.00, gcrefRegs=0000 {}, byrefRegs=0000 {}, byref, nogc <-- Prolog IG

IN0004: 000000 vzeroupper 

G_M24095_IG02:        ; offs=000003H, size=000FH, bbWeight=1, PerfScore 5.25, gcrefRegs=0000 {}, byrefRegs=0002 {rcx}, BB01 [0000], byref

IN0001: 000003 vmovups  xmm0, xmmword ptr [reloc @RWD00]
IN0002: 00000B vmovups  xmmword ptr [rcx], xmm0
IN0003: 00000F mov      rax, rcx

G_M24095_IG03:        ; offs=000012H, size=0001H, bbWeight=1, PerfScore 1.00, epilog, nogc, extend

IN0005: 000012 ret

Author:	jasper-d
Assignees:	-
Labels:	`area-CodeGen-coreclr`
Milestone:	-

jasper-d · 2023-05-13T20:55:14Z

Failures are related, will look into them tomorrow.

Whoopsie:

[000018] ----------- morphing withelement    *  HWINTRINSIC simd16 float WithElement
[000007] -----+-----                         +--*  CNS_VEC   simd8 <0x3f86872b, 0x40033333>
[000017] -----+-----                         +--*  CNS_INT   int    2
[000010] -----+-----                         \--*  CNS_DBL   float  3.4779999256134033
[000007] -----+----- morphed into this       *  CNS_VEC   simd8 <0x3f86872b, 0x40033333>

EgorBo · 2023-05-13T21:19:12Z

but if the approach looks generally ok,

I think you need to generalize it for any type, not just float/double. overall the approach looks good. Ideally we'd want to do this in VN but it needs extra efforts (we don't number intrinsics with more than 2 args).

And the failing test yes, you need to check source vector's type

EgorBo · 2023-05-14T13:52:15Z

Actually, no, this should be handled in VN phase (easier and can handle more cases), here is a quick prototype you neeed to extend with type checks, etc:

--- a/src/coreclr/jit/valuenum.cpp
+++ b/src/coreclr/jit/valuenum.cpp
@@ -11356,8 +11356,37 @@ void Compiler::fgValueNumberHWIntrinsic(GenTreeHWIntrinsic* tree)
     ValueNumPair excSetPair = ValueNumStore::VNPForEmptyExcSet();
     ValueNumPair normalPair = ValueNumPair();
 
-    if ((tree->GetOperandCount() > 2) || ((JitConfig.JitDisableSimdVN() & 2) == 2))
+    const bool disableSimdVN = (JitConfig.JitDisableSimdVN() & 2) == 2;
+    if ((tree->GetOperandCount() > 2) || disableSimdVN)
     {
+        if (!disableSimdVN && intrinsicId == NI_Vector128_WithElement)
+        {
+            assert(tree->GetOperandCount() == 3);
+            GenTree* op1 = tree->Op(1);
+            GenTree* op2 = tree->Op(2);
+            GenTree* op3 = tree->Op(3);
+            if (op1->gtVNPair.BothEqual() && vnStore->IsVNConstant(op1->gtVNPair.GetLiberal()))
+            {
+                if (op2->gtVNPair.BothEqual() && vnStore->IsVNConstant(op2->gtVNPair.GetLiberal()))
+                {
+                    if (op3->gtVNPair.BothEqual() && vnStore->IsVNConstant(op3->gtVNPair.GetLiberal()))
+                    {
+                        ValueNum constVecVN  = op1->gtVNPair.GetLiberal();
+                        ValueNum elemIndexVN = op2->gtVNPair.GetLiberal();
+                        ValueNum elemValueVN = op3->gtVNPair.GetLiberal();
+
+                        simd16_t constVec = vnStore->GetConstantSimd16(constVecVN);
+                        constVec.f32[vnStore->GetConstantInt32(elemIndexVN)] = vnStore->GetConstantSingle(elemValueVN);
+
+                        ValueNum newSimdVN = vnStore->VNForSimd16Con(constVec);
+                        tree->gtVNPair = vnStore->VNPWithExc(ValueNumPair(newSimdVN, newSimdVN), excSetPair);
+                        return;
+                    }
+                }
+            }
+        }
+
+
         // TODO-CQ: allow intrinsics with > 2 operands to be properly VN'ed.
         normalPair = vnStore->VNPairForExpr(compCurBB, tree->TypeGet());

jasper-d · 2023-05-14T18:41:35Z

here is a quick prototype you neeed to extend with type checks, etc

Thank you! To clarify, it would be ok to special case WithElement like you did and not resolve

runtime/src/coreclr/jit/valuenum.cpp

Line 11361 in f107b63

// TODO-CQ: allow intrinsics with > 2 operands to be properly VN'ed.

?

Because that looks rather complicated.

EgorBo · 2023-05-14T18:46:47Z

here is a quick prototype you neeed to extend with type checks, etc

Thank you! To clarify, it would be ok to special case WithElement like you did and not resolve

runtime/src/coreclr/jit/valuenum.cpp

Line 11361 in f107b63

// TODO-CQ: allow intrinsics with > 2 operands to be properly VN'ed.

?
Because that looks rather complicated.

Yes, special casing is fine in this case, to enable proper numbering for args>2 we need to check what kind of intrinsics we'll work with. Bur, presumably, it should be beneficial and not complicated

EgorBo · 2023-05-20T16:02:38Z

@jasper-d hint: if diffs won't find a lot of diffs and all of them about Vector2/3/4 with float fields - I think it will be fine to simplify the impl to only handle those since your current impl could be a bit an overkill if we only deal with floats

jasper-d · 2023-05-20T19:24:52Z

Looks like https://github.com/dotnet/runtime/blob/main/src/tests/JIT/HardwareIntrinsics/X86/Regression/GitHub_17957/GitHub_17957.cs is the only diff for non-float. Will change.

src/coreclr/jit/valuenum.cpp

jasper-d · 2023-06-08T14:25:24Z

@EgorBo PTAL

src/coreclr/jit/valuenum.h

src/coreclr/jit/valuenum.cpp

EgorBo

LGTM, thanks!

Co-authored-by: Egor Bogatov <egorbo@gmail.com>

jasper-d · 2023-06-11T15:00:26Z

LGTM, thanks!

Thank you for your help!

Fold const WithElement to CNS_VEC

c426c89

dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label May 13, 2023

ghost added the community-contribution Indicates that the PR has been added by a community member label May 13, 2023

jasper-d added 2 commits May 13, 2023 20:44

Run jit-format

f3503e6

Fix x86 build

8811921

EgorBo and others added 5 commits May 14, 2023 20:57

Add prototype for folding WithElement in VN

8cca177

Revert changes in morph

7d55585

VN ternary SIMD ops

1dd981d

Fold constant withelement

5af97c4

Merge remote-tracking branch 'upstream/main' into jasper-d/84543

99ff48c

Fix gcc build

b386719

jasper-d added 2 commits May 21, 2023 19:47

Optimize float only

e451bf7

Use actual values where we have them

22925aa

runfoapp bot mentioned this pull request May 22, 2023

Infra improvements for Helix #68176

Closed

build-analysis bot mentioned this pull request May 22, 2023

Assert failure in GC/API/NoGCRegion/Callback_Svr test #86612

Closed

jasper-d commented May 23, 2023

View reviewed changes

src/coreclr/jit/valuenum.cpp Outdated Show resolved Hide resolved

jasper-d marked this pull request as ready for review May 23, 2023 12:14

jasper-d changed the title ~~[WIP] Fold const WithElement to CNS_VEC~~ Fold const WithElement to CNS_VEC May 23, 2023

jasper-d commented May 23, 2023

View reviewed changes

src/coreclr/jit/valuenum.cpp Outdated Show resolved Hide resolved

JulieLeeMSFT assigned EgorBo and jasper-d May 30, 2023

EgorBo reviewed Jun 8, 2023

View reviewed changes

src/coreclr/jit/valuenum.h Show resolved Hide resolved

Do not extend vectors

4260e9b

EgorBo reviewed Jun 10, 2023

View reviewed changes

src/coreclr/jit/valuenum.cpp Outdated Show resolved Hide resolved

EgorBo reviewed Jun 10, 2023

View reviewed changes

src/coreclr/jit/valuenum.cpp Show resolved Hide resolved

jasper-d added 2 commits June 11, 2023 15:29

Remove unused parameter

52dbdf5

Add bound check

6233e21

EgorBo reviewed Jun 11, 2023

View reviewed changes

src/coreclr/jit/valuenum.cpp Outdated Show resolved Hide resolved

EgorBo approved these changes Jun 11, 2023

View reviewed changes

Update src/coreclr/jit/valuenum.cpp

5cbbe22

Co-authored-by: Egor Bogatov <egorbo@gmail.com>

EgorBo merged commit a052348 into dotnet:main Jun 13, 2023

ghost locked as resolved and limited conversation to collaborators Jul 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fold const WithElement to CNS_VEC #86212

Fold const WithElement to CNS_VEC #86212

jasper-d commented May 13, 2023 •

edited by EgorBo

Loading

ghost commented May 13, 2023

jasper-d commented May 13, 2023 •

edited

Loading

EgorBo commented May 13, 2023 •

edited

Loading

EgorBo commented May 14, 2023

jasper-d commented May 14, 2023

EgorBo commented May 14, 2023 •

edited

Loading

EgorBo commented May 20, 2023 •

edited

Loading

jasper-d commented May 20, 2023

jasper-d commented Jun 8, 2023

EgorBo left a comment

jasper-d commented Jun 11, 2023

Fold const WithElement to CNS_VEC #86212

Fold const WithElement to CNS_VEC #86212

Conversation

jasper-d commented May 13, 2023 • edited by EgorBo Loading

ghost commented May 13, 2023

jasper-d commented May 13, 2023 • edited Loading

EgorBo commented May 13, 2023 • edited Loading

EgorBo commented May 14, 2023

jasper-d commented May 14, 2023

EgorBo commented May 14, 2023 • edited Loading

EgorBo commented May 20, 2023 • edited Loading

jasper-d commented May 20, 2023

jasper-d commented Jun 8, 2023

EgorBo left a comment

Choose a reason for hiding this comment

jasper-d commented Jun 11, 2023

jasper-d commented May 13, 2023 •

edited by EgorBo

Loading

jasper-d commented May 13, 2023 •

edited

Loading

EgorBo commented May 13, 2023 •

edited

Loading

EgorBo commented May 14, 2023 •

edited

Loading

EgorBo commented May 20, 2023 •

edited

Loading