JIT: Fix profiler tail call insertion logic for FIELD_LIST #76883

jakobbotsch · 2022-10-11T16:37:41Z

This logic was not handling FIELD_LIST and was also not handling linear order appropriately.

The logic is still a bit odd, it would probably be better to use the same kind of logic as CFG (moving PUTARG nodes ahead of the profiler hook instead), but in practice this seems to be ok.

Fix #76879

This logic was not handling FIELD_LIST and was also not handling linear order appropriately. The logic is still a bit odd, it would probably be better to use the same kind of logic as CFG (moving PUTARG nodes ahead of the profiler hook instead), but in practice this seems to be ok. Fix dotnet#76879

ghost · 2022-10-11T16:37:56Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

Issue Details

This logic was not handling FIELD_LIST and was also not handling linear order appropriately.

The logic is still a bit odd, it would probably be better to use the same kind of logic as CFG (moving PUTARG nodes ahead of the profiler hook instead), but in practice this seems to be ok.

Fix #76879

Author:	jakobbotsch
Assignees:	-
Labels:	`area-CodeGen-coreclr`
Milestone:	-

jakobbotsch · 2022-10-11T17:47:41Z

src/coreclr/jit/codegenarmarch.cpp

-        // GT_RELOAD/GT_COPY use the child node
-        argNode = argNode->gtSkipReloadOrCopy();


Seeing/skipping these here would be illegal IIUC. SPMI replay does not find any occurrences of this, and codegenxarch.cpp also does not have the equivalent here.

jakobbotsch · 2022-10-11T18:04:29Z

/azp run runtime-coreclr jitstress, runtime-coreclr libraries-jitstress

azure-pipelines · 2022-10-11T18:04:51Z

Azure Pipelines successfully started running 2 pipeline(s).

jakobbotsch · 2022-10-12T00:27:33Z

cc @dotnet/jit-contrib PTAL @BruceForstall @kunalspathak

Failures are #76880, #76836 and #76910

BruceForstall · 2022-10-12T15:08:45Z

src/coreclr/jit/lower.cpp

+//
+GenTree* Lowering::FindEarliestPutArg(GenTreeCall* call)
+{
+    size_t numMarkedNodes = 0;


Do we need a 2-pass system? Is it the case that we have a required ordering where the putarg under the field_list are in order, such that we can (1) look for the first field_list and then (2) look for the first putarg under that, in list order?

I guess your solution seems more robust and doesn't require strict list ordering, but does require more traversals.

Personally I'd prefer that we don't make any assumptions on linear order vs operand order in LIR after rationalization. For example, it might be reasonable in the future to allow some trivial scheduling/reordering of the PUTARG_REG order in lowering. One place today we are moving these nodes already is when control-flow guard is enabled (MoveCFGCallArg -- but I think this ends up keeping the nodes in order).

One place where the operand order does not match the execution order is on x86, where the use list is reversed.

Is that really true? I think the order matches execution order, but the arguments are pushed in that order so they end up on stack in opposite order of the other platforms.

Pretty sure?

// The code generator will push these fields in reverse order by offset. Reorder the list here s.t. the order // of uses is visible to LSRA. assert(fieldList->Uses().IsSorted()); fieldList->Uses().Reverse();

Ah, for the FIELD_LIST operands. Ok, interesting, was not aware of this. This logic skips x86 anyway.

BruceForstall · 2022-10-12T15:09:15Z

src/coreclr/jit/lower.cpp

+        }
+    }
+
+    if (numMarkedNodes <= 0)


nit: size_t is unsigned, so < is unnecessary (maybe confusing? Or just "defense in depth"?)

This is just a habit of mine. I always write this check like this.

BruceForstall · 2022-10-12T15:10:29Z

src/coreclr/jit/lower.cpp

+    }
+    else
+    {
+        node->gtLIRFlags |= LIR::Flags::Mark;


Would it be appropriate to add:

assert((node->gtLIRFlags & LIR::Flags::Mark) == 0);

I.e., how do we know the LIR doesn't have any Mark set at this point of compilation?

Yes that would be appropriate, I can add the assert.

I.e., how do we know the LIR doesn't have any Mark set at this point of compilation?

Same assumption is made by LIR::Range::GetTreeRange that we are using from lowering already. This code is somewhat like LIR::Range::GetMarkedRange.

jakobbotsch · 2022-10-13T12:23:09Z

Few diffs where we now place the profiler hook a bit earlier, and some TP improvements from removing the gtSkipCopyOrReload.

The failures look known according to build analysis.

dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Oct 11, 2022

ghost assigned jakobbotsch Oct 11, 2022

jakobbotsch and others added 4 commits October 11, 2022 18:39

Nit

3f0f797

Nit

278ed35

Add an assert

d8e881a

Remove illegal skip

bd01df5

jakobbotsch commented Oct 11, 2022

View reviewed changes

build-analysis bot mentioned this pull request Oct 11, 2022

Test failure in System.Transactions.Tests.OleTxTests.* tests #76836

Closed

jakobbotsch requested review from BruceForstall and kunalspathak October 12, 2022 00:27

BruceForstall approved these changes Oct 12, 2022

View reviewed changes

Update lower.cpp

8190ad5

This was referenced Oct 12, 2022

System.Data.OleDb.Test - timeout and hangs #74488

Open

Assertion failed: (*card_word)==0 in DynamicGenerics tests #76801

Closed

jakobbotsch merged commit 8b999ee into dotnet:main Oct 13, 2022

jakobbotsch deleted the fix-76879 branch October 13, 2022 12:23

ghost locked as resolved and limited conversation to collaborators Nov 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JIT: Fix profiler tail call insertion logic for FIELD_LIST #76883

JIT: Fix profiler tail call insertion logic for FIELD_LIST #76883

jakobbotsch commented Oct 11, 2022

ghost commented Oct 11, 2022

jakobbotsch Oct 11, 2022 •

edited

Loading

jakobbotsch commented Oct 11, 2022

azure-pipelines bot commented Oct 11, 2022

jakobbotsch commented Oct 12, 2022

BruceForstall Oct 12, 2022

jakobbotsch Oct 12, 2022

SingleAccretion Oct 12, 2022 •

edited

Loading

jakobbotsch Oct 12, 2022 •

edited

Loading

SingleAccretion Oct 12, 2022

jakobbotsch Oct 12, 2022

BruceForstall Oct 12, 2022

jakobbotsch Oct 12, 2022

BruceForstall Oct 12, 2022

jakobbotsch Oct 12, 2022

jakobbotsch commented Oct 13, 2022

		// GT_RELOAD/GT_COPY use the child node
		argNode = argNode->gtSkipReloadOrCopy();

JIT: Fix profiler tail call insertion logic for FIELD_LIST #76883

JIT: Fix profiler tail call insertion logic for FIELD_LIST #76883

Conversation

jakobbotsch commented Oct 11, 2022

ghost commented Oct 11, 2022

jakobbotsch Oct 11, 2022 • edited Loading

Choose a reason for hiding this comment

jakobbotsch commented Oct 11, 2022

azure-pipelines bot commented Oct 11, 2022

jakobbotsch commented Oct 12, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SingleAccretion Oct 12, 2022 • edited Loading

Choose a reason for hiding this comment

jakobbotsch Oct 12, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jakobbotsch commented Oct 13, 2022

jakobbotsch Oct 11, 2022 •

edited

Loading

SingleAccretion Oct 12, 2022 •

edited

Loading

jakobbotsch Oct 12, 2022 •

edited

Loading