JIT: add OSR patchpoint strategy, inhibit tail duplication #66208

AndyAyersMS · 2022-03-04T19:22:14Z

Two changes for OSR:

add new strategies for placing patchpoints -- either at
backedge sources (instead of targets) or adaptive. depending
on number of backedges. Change default to adaptive, since this
works better with the flow we see from C# for loops.
inhibit tail duplication for OSR as it may end up interfering
with loop recognition.

We may not be able to place patchpoints at sources, for various reasons;
if so we fall back to placing them at targets.

ghost · 2022-03-04T19:22:21Z

Tagging subscribers to this area: @JulieLeeMSFT
See info in area-owners.md if you want to be subscribed.

Issue Details

Two changes for OSR:

add new strategies for placing patchpoints -- either at
backedge sources (instead of targets) or adaptive depending
on number of backedges. Change default to sources, since this
works better with the flow we see from C# for loops.
inhibit tail duplication for OSR as it may end up interfering
with loop recognition.

Adaptive placement may end up working out better overall, but
needs further evaluation.

Author:	AndyAyersMS
Assignees:	AndyAyersMS
Labels:	`area-CodeGen-coreclr`
Milestone:	-

AndyAyersMS · 2022-03-04T19:22:39Z

@BruceForstall PTAL
cc @dotnet/jit-contrib

am11 · 2022-03-04T19:36:23Z

src/coreclr/jit/importer.cpp

                    //
-                    assert(!block->hasHndIndex());
+                    const int patchpointStrategy = JitConfig.TC_PatchpointStrategy();


Would it be clearer to have enum PatchpointStrategy { BackedgeSource, BackedgeTarget, Adaptive }; and return enum value from TC_PatchpointStrategy()?

The config system is kind of primitive, so probably not.

AndyAyersMS · 2022-03-04T19:40:44Z

Looks like this will need some tweaking. We may not be at stack empty point at the top of a source block and so might miss out setting some needed patchpoints.

IIRC ecma requires that IL be stack empty at backedge targets, so the issue hasn't come up before.

Also we may see backedge sources in handler regions; can't put patchpoints there.

Two changes for OSR: * add new strategies for placing patchpoints -- either at backedge sources (instead of targets) or adaptive. depending on number of backedges. Change default to adaptive, since this works better with the flow we see from C# `for` loops. * inhibit tail duplication for OSR as it may end up interfering with loop recognition. We may not be able to place patchpoints at sources, for various reasons; if so we fall back to placing them at targets.

AndyAyersMS · 2022-03-04T22:08:10Z

Ok, should work a bit better now.

BruceForstall · 2022-03-04T22:29:10Z

src/coreclr/jit/block.h

@@ -551,6 +551,8 @@ enum BasicBlockFlags : unsigned __int64
    BBF_HAS_ALIGN            = MAKE_BBFLAG(39), // BB ends with 'align' instruction
    BBF_TAILCALL_SUCCESSOR   = MAKE_BBFLAG(40), // BB has pred that has potential tail call

+    BBF_BACKWARD_JUMP_SOURCE = MAKE_BBFLAG(41), // Block is a source of a backward jump


It's unfortunate that we have/need bits for this (and BBF_BACKWARD_JUMP_TARGET). Lots of code just compares bbNum, but that depends on bbNum being up-to-date; presumably you can't depend on that? Oh, I guess in addition, you don't want to depend on the preds list being up-to-date and walking the preds list? These bits can obviously themselves get out of date if the flow graph is reordered. Also, do we properly update these bits when, say, we merge blocks and delete the one with the bit?

The def/use happens very early, in the flow graph build and importer, before anything else can mess with the flow graph. I could look at source/target nums instead, but that would require enumeration of succs (for sources) and preds (for targets) for every block which would be more costly, and I'd have to build cheap preds.

Nothing later depends on these flag bits. I could scrub them out (at least in debug) if you think leaving them around and potentially gone stale is an attractive nuisance.

Or we could figure out how to have limited-lifetime flags that automatically expire at some point.

It would be useful if we knew when any of the block bits were expected to be valid, either as documentation here, or better, with some kind of checking. Scrubbing them might not be possible (they're only 0/1 already, so scrubbing to 0 doesn't help), unless we add a "valid/invalid" bit for each one, with a debug accessor/checker. Certainly not required with this change (unless you wanted to add comments describing their lifetime).

src/coreclr/jit/importer.cpp

BruceForstall · 2022-03-04T22:47:39Z

src/coreclr/jit/importer.cpp

+
+                            if (succBlock->bbNum <= block->bbNum)
+                            {
+                                // The succBlock had better agree it's as target.


src/coreclr/jit/importer.cpp

AndyAyersMS · 2022-03-05T01:17:58Z

Most failures look like some kind of CI outage or configuration change

##[warning]Docker pull failed with exit code 1, back off 9.857 seconds before retry.
Unable to find image 'mcr.microsoft.com/dotnet-buildtools/prereqs:rhel-7-rpmpkg-c982313-20174116044113' locally

Test failure almost certainly unrelated as OSR is not enabled innerloop except in a handful of runtime tests.

src/coreclr/jit/importer.cpp

Co-authored-by: Bruce Forstall <brucefo@microsoft.com>

AndyAyersMS · 2022-03-05T02:01:08Z

Sigh, using github to edit code has its downsides. Will fix.

AndyAyersMS · 2022-03-05T03:03:23Z

/azp run runtime-jit-experimental

azure-pipelines · 2022-03-05T03:03:38Z

Azure Pipelines successfully started running 1 pipeline(s).

AndyAyersMS · 2022-03-05T19:15:33Z

OSR stress testing revealed a few issues:

despite me claiming the flow graph isn't altered and so the flags can be trusted, eh canons can introduce blocks.
we're setting patchpoints at non-stack-empty block entries, this leads to invalid program errors when the OSR method is jitted because of stack underflows. We need to avoid such patchpoints, which in some IL cases means we will have loops we can't break out of.

Fixes forthcoming.

AndyAyersMS · 2022-03-05T21:48:43Z

/azp run runtime-jit-experimental

azure-pipelines · 2022-03-05T21:48:59Z

Azure Pipelines successfully started running 1 pipeline(s).

AndyAyersMS · 2022-03-06T00:15:46Z

Several of bindhandle tests failing in jit-experimental. Can't repro locally so far

      BindHandle call succeeded
      Got wrong error: System.ApplicationException: Error in the application.
         at System.Threading.PortableThreadPool.RegisterForIOCompletionNotifications(IntPtr handle)
         at System.Threading.ThreadPool.BindHandle(SafeHandle osHandle)
         at BindHandle1.RunTest()

There is a concurrent run in main going on right now, will see if it hits those as well.

jakobbotsch · 2022-03-06T00:18:55Z

The bindhandle failures seem to be somewhat widespread, we hit them in #65682 as well (#65682 (comment)).

AndyAyersMS · 2022-03-06T01:47:51Z

Yeah -- the jit-experimental run on main also seeing those failures.

AndyAyersMS · 2022-03-06T04:10:13Z

Looks like both this change and main see 146 failures in jit-experimental.

Spot checked a bunch and all matched, so assuming they exactly overlap.

AndyAyersMS · 2022-03-06T16:25:14Z

OSX runs seem to be consistently timing out. Nothing here is arch/os specific, and it's almost all off by default. Will ignore.

dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Mar 4, 2022

ghost assigned AndyAyersMS Mar 4, 2022

AndyAyersMS mentioned this pull request Mar 4, 2022

On Stack Replacement Next Steps #33658

Open

72 tasks

am11 reviewed Mar 4, 2022

View reviewed changes

AndyAyersMS force-pushed the PatchpointStrategy branch from 734a821 to d0228c8 Compare March 4, 2022 22:06

BruceForstall approved these changes Mar 4, 2022

View reviewed changes

AndyAyersMS commented Mar 5, 2022

View reviewed changes

src/coreclr/jit/importer.cpp Outdated Show resolved Hide resolved

Apply suggestions from code review

5221e36

Co-authored-by: Bruce Forstall <brucefo@microsoft.com>

fix syntax

0b0346c

fixes for issues uncovered by stress

590b7b4

AndyAyersMS merged commit f9da3db into dotnet:main Mar 6, 2022

EgorBo mentioned this pull request Mar 6, 2022

Fix address exposure in forward sub (x86) #66253

Merged

AndyAyersMS mentioned this pull request Mar 7, 2022

[Perf] Changes at 3/2/2022 3:17:47 PM dotnet/perf-autofiling-issues#3864

Closed

ghost locked as resolved and limited conversation to collaborators Apr 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JIT: add OSR patchpoint strategy, inhibit tail duplication #66208

JIT: add OSR patchpoint strategy, inhibit tail duplication #66208

AndyAyersMS commented Mar 4, 2022 •

edited

Loading

ghost commented Mar 4, 2022

AndyAyersMS commented Mar 4, 2022

am11 Mar 4, 2022

AndyAyersMS Mar 4, 2022

AndyAyersMS commented Mar 4, 2022 •

edited

Loading

AndyAyersMS commented Mar 4, 2022

BruceForstall Mar 4, 2022

AndyAyersMS Mar 5, 2022

BruceForstall Mar 5, 2022

BruceForstall Mar 4, 2022

AndyAyersMS commented Mar 5, 2022

AndyAyersMS commented Mar 5, 2022

AndyAyersMS commented Mar 5, 2022

azure-pipelines bot commented Mar 5, 2022

AndyAyersMS commented Mar 5, 2022

AndyAyersMS commented Mar 5, 2022

azure-pipelines bot commented Mar 5, 2022

AndyAyersMS commented Mar 6, 2022

jakobbotsch commented Mar 6, 2022

AndyAyersMS commented Mar 6, 2022

AndyAyersMS commented Mar 6, 2022

AndyAyersMS commented Mar 6, 2022

JIT: add OSR patchpoint strategy, inhibit tail duplication #66208

JIT: add OSR patchpoint strategy, inhibit tail duplication #66208

Conversation

AndyAyersMS commented Mar 4, 2022 • edited Loading

ghost commented Mar 4, 2022

AndyAyersMS commented Mar 4, 2022

am11 Mar 4, 2022

Choose a reason for hiding this comment

AndyAyersMS Mar 4, 2022

Choose a reason for hiding this comment

AndyAyersMS commented Mar 4, 2022 • edited Loading

AndyAyersMS commented Mar 4, 2022

BruceForstall Mar 4, 2022

Choose a reason for hiding this comment

AndyAyersMS Mar 5, 2022

Choose a reason for hiding this comment

BruceForstall Mar 5, 2022

Choose a reason for hiding this comment

BruceForstall Mar 4, 2022

Choose a reason for hiding this comment

AndyAyersMS commented Mar 5, 2022

AndyAyersMS commented Mar 5, 2022

AndyAyersMS commented Mar 5, 2022

azure-pipelines bot commented Mar 5, 2022

AndyAyersMS commented Mar 5, 2022

AndyAyersMS commented Mar 5, 2022

azure-pipelines bot commented Mar 5, 2022

AndyAyersMS commented Mar 6, 2022

jakobbotsch commented Mar 6, 2022

AndyAyersMS commented Mar 6, 2022

AndyAyersMS commented Mar 6, 2022

AndyAyersMS commented Mar 6, 2022

AndyAyersMS commented Mar 4, 2022 •

edited

Loading

AndyAyersMS commented Mar 4, 2022 •

edited

Loading