JIT: Mark replacements as dirty after setting GTF_VAR_DEATH #88669

jakobbotsch · 2023-07-11T16:40:29Z

Physically promoted struct locals are marked as dying when their remainder dies since all other state is stored in other locals. However, when we do this we must also mark all replacements as stale; otherwise a future struct use could skip writebacks and effectively introduce new uses, invalidating the previously marked last use bit.

Fix #88616

Note that the regression test introduced here requires #88665 to actually fail.

Physically promoted struct locals are marked as dying when their remainder dies since all other state is stored in other locals. However, when we do this we must also mark all replacements as stale; otherwise a future struct use could skip writebacks and effectively introduce new uses, invalidating the previously marked last use bit. Fix dotnet#88616

ghost · 2023-07-11T16:40:41Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

Issue Details

Physically promoted struct locals are marked as dying when their remainder dies since all other state is stored in other locals. However, when we do this we must also mark all replacements as stale; otherwise a future struct use could skip writebacks and effectively introduce new uses, invalidating the previously marked last use bit.

Fix #88616

Note that the regression test introduced here requires #88665 to actually fail.

Author:	jakobbotsch
Assignees:	-
Labels:	`area-CodeGen-coreclr`
Milestone:	-

jakobbotsch · 2023-07-11T19:14:22Z

cc @dotnet/jit-contrib PTAL @AndyAyersMS

No diffs.

Looks like I'll need to rerun CI; license/cla seems to be stuck.

Diff in test case:

@@ -1,30 +1,30 @@
 ; Assembly listing for method Runtime_88616:TestEntryPoint():int (FullOpts)
 ; Emitting BLENDED_CODE for X64 with AVX - Windows
 ; FullOpts code
 ; optimized code
 ; rsp based frame
 ; partially interruptible
 ; No PGO data
 ; Final local variable assignments
 ;
-;  V00 loc0         [V00    ] (  7,  7   )  struct (40) [rsp+20H]   do-not-enreg[XSF] must-init addr-exposed ld-addr-op
+;  V00 loc0         [V00    ] ( 12, 12   )  struct (40) [rsp+20H]   do-not-enreg[XSF] must-init addr-exposed ld-addr-op
 ;* V01 loc1         [V01    ] (  0,  0   )    long  ->  zero-ref   
 ;  V02 OutArgs      [V02    ] (  1,  1   )  struct (32) [rsp+00H]   do-not-enreg[XS] addr-exposed "OutgoingArgSpace"
 ;* V03 tmp1         [V03    ] (  0,  0   )    long  ->  zero-ref    "dup spill"
 ;* V04 tmp2         [V04    ] (  0,  0   )    long  ->  zero-ref    "impAppendStmt"
 ;* V05 tmp3         [V05    ] (  0,  0   )    long  ->  zero-ref    "impAppendStmt"
 ;* V06 tmp4         [V06    ] (  0,  0   )    long  ->  zero-ref    "impAppendStmt"
 ;* V07 tmp5         [V07    ] (  0,  0   )    long  ->  zero-ref    "dup spill"
 ;* V08 tmp6         [V08    ] (  0,  0   )    long  ->  zero-ref    "dup spill"
 ;* V09 tmp7         [V09    ] (  0,  0   )    long  ->  zero-ref    "dup spill"
 ;* V10 tmp8         [V10    ] (  0,  0   )    long  ->  zero-ref    "impAppendStmt"
 ;* V11 tmp9         [V11    ] (  0,  0   )    long  ->  zero-ref    "impAppendStmt"
 ;* V12 tmp10        [V12    ] (  0,  0   )    long  ->  zero-ref    "impAppendStmt"
 ;* V13 tmp11        [V13    ] (  0,  0   )    long  ->  zero-ref    "dup spill"
 ;* V14 tmp12        [V14    ] (  0,  0   )    long  ->  zero-ref    "dup spill"
 ;* V15 tmp13        [V15    ] (  0,  0   )    long  ->  zero-ref    "dup spill"
 ;* V16 tmp14        [V16    ] (  0,  0   )    long  ->  zero-ref    "impAppendStmt"
 ;* V17 tmp15        [V17    ] (  0,  0   )    long  ->  zero-ref    "impAppendStmt"
 ;* V18 tmp16        [V18    ] (  0,  0   )    long  ->  zero-ref    "impAppendStmt"
 ;* V19 tmp17        [V19    ] (  0,  0   )    long  ->  zero-ref    "dup spill"
 ;* V20 tmp18        [V20    ] (  0,  0   )    long  ->  zero-ref    "dup spill"
@@ -47,38 +47,37 @@
 ;* V37 tmp35        [V37    ] (  0,  0   )    long  ->  zero-ref    "V00.[032..040)"
 ;
 ; Lcl frame size = 72
 
 G_M45268_IG01:  ;; offset=0000H
        sub      rsp, 72
        vxorps   xmm4, xmm4, xmm4
        vmovdqa  xmmword ptr [rsp+20H], xmm4
        vmovdqa  xmmword ptr [rsp+30H], xmm4
        xor      eax, eax
        mov      qword ptr [rsp+40H], rax
 						;; size=27 bbWeight=1 PerfScore 5.83
 G_M45268_IG02:  ;; offset=001BH
        mov      qword ptr [rsp+20H], 50
        mov      qword ptr [rsp+28H], 100
        mov      qword ptr [rsp+30H], 150
        mov      qword ptr [rsp+38H], 200
        mov      qword ptr [rsp+40H], 250
        lea      rcx, [rsp+20H]
        call     [Runtime_88616:Mutate(Runtime_88616+S)]
+       mov      qword ptr [rsp+20H], 50
+       mov      qword ptr [rsp+28H], 100
+       mov      qword ptr [rsp+30H], 150
+       mov      qword ptr [rsp+38H], 200
+       mov      qword ptr [rsp+40H], 250
        lea      rcx, [rsp+20H]
        call     [Runtime_88616:Check(Runtime_88616+S):int]
        nop      
-						;; size=68 bbWeight=1 PerfScore 12.25
-G_M45268_IG03:  ;; offset=005FH
+						;; size=113 bbWeight=1 PerfScore 17.25
+G_M45268_IG03:  ;; offset=008CH
        add      rsp, 72
        ret      
 						;; size=5 bbWeight=1 PerfScore 1.25
 
-; Total bytes of code 100, prolog size 27, PerfScore 29.33, instruction count 18, allocated bytes for code 100 (MethodHash=35b94f2b) for method Runtime_88616:TestEntryPoint():int (FullOpts)
+; Total bytes of code 145, prolog size 27, PerfScore 38.83, instruction count 23, allocated bytes for code 145 (MethodHash=35b94f2b) for method Runtime_88616:TestEntryPoint():int (FullOpts)
 ; ============================================================
 
-FAIL: s.A == -42
-Xunit.Sdk.EqualException: Assert.Equal() Failure
-Expected: 100
-Actual:   -1
-   at Xunit.Assert.Equal[T](T expected, T actual, IEqualityComparer`1 comparer) in /_/src/xunit.assert/Asserts/EqualityAsserts.cs:line 96
-   at __GeneratedMainWrapper.Main()

AndyAyersMS

I am confused about something here -- are we somehow avoiding making a copy of foo to pass to Mutate?

AndyAyersMS · 2023-07-12T23:47:19Z

I am confused about something here -- are we somehow avoiding making a copy of foo to pass to Mutate?

Your repro case passes for me w/o your fix and makes a copy to pass to Mutate

jakobbotsch · 2023-07-13T03:04:42Z

I am confused about something here -- are we somehow avoiding making a copy of foo to pass to Mutate?

Your repro case passes for me w/o your fix and makes a copy to pass to Mutate

Yes, we avoid the copy, but the particular test I have introduced requires #88665 to actually avoid the copy and expose the problem. Without that change we do not mark foo as a last use when passed to the first call because we think the second appearance uses the remainder (even though there is none).

The Fuzzlyn found test case is more complex but repros the bug without #88665. The found test case there has an actual unpromoted remainder and redefines it between the two calls which has the effect of both uses of vr37 being marked as last uses.

jakobbotsch · 2023-07-13T03:10:27Z

/azp run runtime, runtime-coreclr superpmi-diffs

azure-pipelines · 2023-07-13T03:10:57Z

Azure Pipelines successfully started running 2 pipeline(s).

jakobbotsch · 2023-07-13T06:39:07Z

There's a couple of diffs now with this change after #88665 was merged. The diffs failure is some internal server error from Helix. The other failures are the known file check ones, it seems.

dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Jul 11, 2023

ghost assigned jakobbotsch Jul 11, 2023

Nit and rerun CI

c87d378

jakobbotsch requested a review from AndyAyersMS July 11, 2023 19:14

This was referenced Jul 11, 2023

simpleruntimeeventvalidation test failing in CI #88499

Closed

Test failure readytorun/HardwareIntrinsics/X86/CpuId_R2R_Avx/CpuId_R2R_Avx.sh #88582

Closed

Tracking issue for CI build timeouts #76454

Closed

AndyAyersMS reviewed Jul 12, 2023

View reviewed changes

build-analysis bot mentioned this pull request Jul 13, 2023

File check failures in x64 jit tests #88783

Closed

AndyAyersMS approved these changes Jul 13, 2023

View reviewed changes

jakobbotsch merged commit ddbce91 into dotnet:main Jul 13, 2023

jakobbotsch deleted the fix-88616 branch July 13, 2023 20:36

This was referenced Jul 14, 2023

Test failure System.Diagnostics.Metrics.Tests.AggregationStoreTests.GetThreeLabel #88890

Closed

Test failure JIT/Performance/CodeQuality/BenchmarksGame/regex-redux/regex-redux-1/regex-redux-1.dll #88882

Closed

ghost locked as resolved and limited conversation to collaborators Aug 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JIT: Mark replacements as dirty after setting GTF_VAR_DEATH #88669

JIT: Mark replacements as dirty after setting GTF_VAR_DEATH #88669

jakobbotsch commented Jul 11, 2023

ghost commented Jul 11, 2023

jakobbotsch commented Jul 11, 2023 •

edited

Loading

AndyAyersMS left a comment

AndyAyersMS commented Jul 12, 2023

jakobbotsch commented Jul 13, 2023 •

edited

Loading

jakobbotsch commented Jul 13, 2023

azure-pipelines bot commented Jul 13, 2023

jakobbotsch commented Jul 13, 2023

JIT: Mark replacements as dirty after setting GTF_VAR_DEATH #88669

JIT: Mark replacements as dirty after setting GTF_VAR_DEATH #88669

Conversation

jakobbotsch commented Jul 11, 2023

ghost commented Jul 11, 2023

jakobbotsch commented Jul 11, 2023 • edited Loading

AndyAyersMS left a comment

Choose a reason for hiding this comment

AndyAyersMS commented Jul 12, 2023

jakobbotsch commented Jul 13, 2023 • edited Loading

jakobbotsch commented Jul 13, 2023

azure-pipelines bot commented Jul 13, 2023

jakobbotsch commented Jul 13, 2023

jakobbotsch commented Jul 11, 2023 •

edited

Loading

jakobbotsch commented Jul 13, 2023 •

edited

Loading