Add a VN-based dead store removal phase #77990

SingleAccretion · 2022-11-07T20:45:44Z

This new phase will iterate over the stores referenced through SSA descriptors, and delete those which do not change the local's value, determined via VN's selection mechanism.

TP: ~0.1%.
SPMI diffs (Win-x64): -5K for benchmarks.run, -24K for libraries.pmi.

Overall minor impact on both counts, but the amount of code is not large too.

Detailed TP regression breakdown:

Base: 52494649362, Diff: 52555823204, +0.1165%

Compiler::optVNBasedDeadStoreRemoval                             : 20273376 : NA          : 28.04% : +0.0386%
Compiler::fgValueNumber                                          : 11366343 : +13.93%     : 15.72% : +0.0217%
ValueNumStore::VNForFunc                                         : 8873142  : +1.52%      : 12.27% : +0.0169%
ValueNumStore::GetAllocChunk                                     : 3852024  : +1.90%      : 5.33%  : +0.0073%
JitHashTable<unsigned int,unsigned int,>::Set                    : 3062006  : +6.79%      : 4.23%  : +0.0058%
ValueNumStore::VNForMapSelectWork                                : 3023527  : +3.65%      : 4.18%  : +0.0058%
ValueNumStore::VnForConst<int,ValueNumStore::VNMap<int,>>        : 2481512  : +8.15%      : 3.43%  : +0.0047%
ArenaAllocator::allocateMemory                                   : 1951190  : +0.19%      : 2.70%  : +0.0037%
JitExpandArray<unsigned __int64 *>::EnsureCoversInd              : 1326343  : +0.83%      : 1.83%  : +0.0025%
ValueNumStore::GetVNFunc1Map                                     : 1275570  : +11.92%     : 1.76%  : +0.0024%
DoPhase                                                          : 1221592  : +0.88%      : 1.69%  : +0.0023%
LIR::Range::Delete                                               : 912609   : +164434.05% : 1.26%  : +0.0017%
JitHashTable<VNDefFuncApp<1>, unsigned int>::Grow                : 833936   : +7.69%      : 1.15%  : +0.0016%
JitExpandArray<ValueNumStore::VNDefFuncApp<2> >::EnsureCoversInd : 808398   : +10.77%     : 1.12%  : +0.0015%
LclVarDsc::HasGCPtr                                              : 777500   : +353.94%    : 1.08%  : +0.0015%
ValueNumStore::VNZeroForType                                     : -1151448 : -3.39%      : 1.59%  : -0.0022%

The impact of the zero-init fix is more than I'd like, but it is not clear if it can be ameliorated without compromising correctness; the logic is not trivial.

The impact of fetching ZeroObj VNs is also somewhat surprising; one more reason to rework the "init block" IR shape as it exists today to have something STRUCT-typed on the RHS.

All that said, the improvements from #77655 should cover for this, with a bit left still.

ghost · 2022-11-07T20:45:54Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

Issue Details

This new phase will iterate over the stores references through SSA descriptors, and delete those which do not change the local's value, determined via VN's selection mechanism.

TP impact: ~0.05%.
SPMI diffs (Win-x64): -4K for benchmarks.run, -23K for libraries.pmi.

Author:	SingleAccretion
Assignees:	-
Labels:	`area-CodeGen-coreclr`
Milestone:	-

src/coreclr/jit/optimizer.cpp

Fewer regressions; a bit faster TP-wise. Diffs (libraries.pmi) for the full implementation: --------------------------------------------------------------------------------- 2,236 contexts with diffs (1,871 improvements, 111 regressions) -25,583/+1,467 bytes Diffs (libraries.pmi) for partial definitions only: --------------------------------------------------------------------------------- 1,687 contexts with diffs (1,683 improvements, 3 regressions) -23,661/+25 bytes TP impact about the same, ~0.05% against ~0.045%.

This reverts commit 127dccf.

SingleAccretion · 2022-11-07T23:35:23Z

As one would suspect, we're bumping here into bugs with how simplistic is VN's logic for whether something will be zero-initialized in the prolog is. Will take some iterations to sort this out.

Edit: done.

(The "lvaIsOSRLocal" check in codegen was redundant)

SingleAccretion · 2022-11-08T13:26:08Z

@dotnet/jit-contrib

(Edit: naturally, we should stress and fuzz this)

jakobbotsch · 2022-11-08T15:33:33Z

/azp run runtime-coreclr jitstress, runtime-coreclr libraries-jitstress, Fuzzlyn

azure-pipelines · 2022-11-08T15:34:02Z

Azure Pipelines successfully started running 3 pipeline(s).

jakobbotsch · 2022-11-08T21:00:43Z

Not sure what happened in the win-arm64 Fuzzlyn run, the artifacts were apparently not published in the expected location. I checked the partitions manually and saw no errors.

SingleAccretion · 2022-11-08T22:05:18Z

Looks like there are stress failures that will need to be investigated.

System.Drawing on ARM64.
baseservices.threading on macOS ARM64 (possibly unrelated).

SingleAccretion · 2022-11-09T13:41:09Z

The System.Drawing tests are a clear case of silent bad codegen. Unfortunately, without an ARM64 device, it is not trivial to understand where exactly the failure comes from. The pattern of tests failures suggests the problem is somewhere in the auto-generated interop code, and it is of a very particular nature (most test results are 2x of what they should be).

Will try PMIing the assembly manually next.

Edit: notably, the tests in question only fail with TC on...

Edit: the local debugging plan didn't pan out. Will have to use the CI to get more information.

Edit: more CI debugging did not reproduce the issue. Curious.

SingleAccretion · 2022-11-10T23:18:54Z

@jakobbotsch Let's run another libraries stress here. I've run the System.Drawing tests 3 times in the supposedly same configuration in #PR, and couldn't reproduce the failures. So either:

The failure is intermittent (and infrequent); quite possible given TC=1.
It was, somehow, a one-off.
The setup in #PR is wrong.

So, if we see it fail again, it was most likely 3. If not... Well, I am not sure at this point.

I have also pushed a debug-only commit to capture the dump if we do see it fail.

jakobbotsch · 2022-11-10T23:24:00Z

/azp run runtime-coreclr libraries-jitstress

azure-pipelines · 2022-11-10T23:24:11Z

Azure Pipelines successfully started running 1 pipeline(s).

jakobbotsch · 2022-11-10T23:25:17Z

I'll see if I can repro the failure on win-arm64 some time on the weekend or next week.

SingleAccretion · 2022-11-11T12:22:35Z

No luck, the stress was clean. Will revert the debug commit.

This reverts commit 797dc55.

jakobbotsch · 2022-11-12T12:20:43Z

I was not able to repro the failures after running the exact bits that failed in CI on my win-arm64 machine in a loop for a while, so I think it can be ignored (maybe some kind of GDI bug?).

SingleAccretion · 2022-11-12T12:57:56Z

Thank you for looking! In the meantime I found a reference to the second failure here: #72365 (comment).

jakobbotsch · 2022-11-14T08:43:48Z

/azp run runtime-coreclr superpmi-diffs, runtime-coreclr superpmi-replay

azure-pipelines · 2022-11-14T08:44:07Z

Azure Pipelines successfully started running 2 pipeline(s).

src/coreclr/jit/jitconfigvalues.h

src/coreclr/jit/rationalize.cpp

jakobbotsch · 2022-11-14T16:49:12Z

Awesome work, thank you!

EgorBo · 2022-11-24T20:38:36Z

Improvements on win-x64: dotnet/perf-autofiling-issues#10064

SingleAccretion added 3 commits November 7, 2022 23:23

Add a flag for explicit inits

127dccf

Move optConservativeNormalVN

cd3f489

Fix call rationalization

83ff819

dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Nov 7, 2022

ghost added the community-contribution Indicates that the PR has been added by a community member label Nov 7, 2022

Add VN-based dead store removal

1844134

SingleAccretion force-pushed the SSA-Dead-Code branch from 670cca0 to 84ea3ab Compare November 7, 2022 21:17

SingleAccretion commented Nov 7, 2022

View reviewed changes

src/coreclr/jit/optimizer.cpp Show resolved Hide resolved

SingleAccretion added 2 commits November 8, 2022 02:18

Revert "Add a flag for explicit inits"

553906f

This reverts commit 127dccf.

runfoapp bot mentioned this pull request Nov 7, 2022

Infra improvements for Helix #68176

Closed

Fix VN logic for zero-init

d6547a3

(The "lvaIsOSRLocal" check in codegen was redundant)

SingleAccretion force-pushed the SSA-Dead-Code branch from 84ea3ab to d6547a3 Compare November 8, 2022 00:23

SingleAccretion marked this pull request as ready for review November 8, 2022 13:25

build-analysis bot mentioned this pull request Nov 8, 2022

Tracking issue for CI build timeouts #76454

Closed

build-analysis bot mentioned this pull request Nov 9, 2022

Test failure JIT\\HardwareIntrinsics\\General\\Vector256\\Vector256_r\\Vector256_r.cmd #76280

Closed

SingleAccretion mentioned this pull request Nov 9, 2022

Disallow IND<struct> except as a source of STORE_DYN_BLK #74784

Merged

Add a fail fast to collect dumps

797dc55

Revert "Add a fail fast to collect dumps"

ea49c0a

This reverts commit 797dc55.

jakobbotsch reviewed Nov 14, 2022

View reviewed changes

src/coreclr/jit/jitconfigvalues.h Show resolved Hide resolved

jakobbotsch approved these changes Nov 14, 2022

View reviewed changes

jakobbotsch reviewed Nov 14, 2022

View reviewed changes

src/coreclr/jit/rationalize.cpp Show resolved Hide resolved

SingleAccretion added 2 commits November 14, 2022 13:24

Add JitEnableVNBasedDeadStoreRemovalRange

b9e9559

Update phase description

15f9816

jakobbotsch merged commit 73f2bd4 into dotnet:main Nov 14, 2022

SingleAccretion deleted the SSA-Dead-Code branch November 14, 2022 17:59

SingleAccretion mentioned this pull request Dec 16, 2022

Remove GT_ADDR nodes #11057

Closed

ghost locked as resolved and limited conversation to collaborators Jan 2, 2023

jeffhandley added the blog-candidate Completed PRs that are candidate topics for blog post coverage label Mar 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a VN-based dead store removal phase #77990

Add a VN-based dead store removal phase #77990

SingleAccretion commented Nov 7, 2022 •

edited

Loading

ghost commented Nov 7, 2022

SingleAccretion commented Nov 7, 2022 •

edited

Loading

SingleAccretion commented Nov 8, 2022 •

edited

Loading

jakobbotsch commented Nov 8, 2022

azure-pipelines bot commented Nov 8, 2022

jakobbotsch commented Nov 8, 2022

SingleAccretion commented Nov 8, 2022

SingleAccretion commented Nov 9, 2022 •

edited

Loading

SingleAccretion commented Nov 10, 2022 •

edited

Loading

jakobbotsch commented Nov 10, 2022

azure-pipelines bot commented Nov 10, 2022

jakobbotsch commented Nov 10, 2022

SingleAccretion commented Nov 11, 2022

jakobbotsch commented Nov 12, 2022

SingleAccretion commented Nov 12, 2022

jakobbotsch commented Nov 14, 2022

azure-pipelines bot commented Nov 14, 2022

jakobbotsch commented Nov 14, 2022

EgorBo commented Nov 24, 2022

Add a VN-based dead store removal phase #77990

Add a VN-based dead store removal phase #77990

Conversation

SingleAccretion commented Nov 7, 2022 • edited Loading

ghost commented Nov 7, 2022

SingleAccretion commented Nov 7, 2022 • edited Loading

SingleAccretion commented Nov 8, 2022 • edited Loading

jakobbotsch commented Nov 8, 2022

azure-pipelines bot commented Nov 8, 2022

jakobbotsch commented Nov 8, 2022

SingleAccretion commented Nov 8, 2022

SingleAccretion commented Nov 9, 2022 • edited Loading

SingleAccretion commented Nov 10, 2022 • edited Loading

jakobbotsch commented Nov 10, 2022

azure-pipelines bot commented Nov 10, 2022

jakobbotsch commented Nov 10, 2022

SingleAccretion commented Nov 11, 2022

jakobbotsch commented Nov 12, 2022

SingleAccretion commented Nov 12, 2022

jakobbotsch commented Nov 14, 2022

azure-pipelines bot commented Nov 14, 2022

jakobbotsch commented Nov 14, 2022

EgorBo commented Nov 24, 2022

SingleAccretion commented Nov 7, 2022 •

edited

Loading

SingleAccretion commented Nov 7, 2022 •

edited

Loading

SingleAccretion commented Nov 8, 2022 •

edited

Loading

SingleAccretion commented Nov 9, 2022 •

edited

Loading

SingleAccretion commented Nov 10, 2022 •

edited

Loading