add a wrapper around Mir with an exit node, dominators returns error when nodes are unreachable #34556

scottcarr · 2016-06-29T16:49:39Z

We want the Mir CFG to have an exit node to calculate post dominators. The MirWithExit type allows us to add the exit node on demand.

When nodes are unreachable from the start node, dominators are undefined, so dominators now returns an error in that case.

…raph

rust-highfive · 2016-06-29T16:49:53Z

Thanks for the pull request, and welcome! The Rust team is excited to review your changes, and you should hear from @Aatch (or someone else) soon.

If any changes to this PR are deemed necessary, please add them as extra commits. This ensures that the reviewer can see what has changed since they last reviewed the code. Due to the way GitHub handles out-of-date commits, this should also make it reasonably obvious what issues have or haven't been addressed. Large or tricky changes may require several passes of review and changes.

Please see the contribution instructions for more information.

scottcarr · 2016-06-29T16:53:00Z

r? @nikomatsakis

arielb1 · 2016-06-29T21:06:45Z

What is the motivation for postdoms here? LLVM seems to use them only for its region analysis.

BTW, this does not handle loops without an exit.

scottcarr · 2016-06-29T22:12:48Z

What is the motivation for postdoms here? LLVM seems to use them only for its region analysis.

For the "move up propagation" optimization I'm working on (which eliminates a temporary), it only fires if this particular use post dominates the temporary's definition.

BTW, this does not handle loops without an exit.

The way I was thinking to handle it is:

If the Mir CFG has loops without an exit, then there is no "the exit node" and the post-dominators are undefined. When we calculate the dominators of the transposed MirWithExit CFG (which are the post-dominators of the original Mir CFG), that graph will have unreachable nodes and dominators returns an error in that case.

arielb1 · 2016-06-30T06:21:41Z

Use post-dominates definition? That means it won't work if there are panics in the middle, right?

scottcarr · 2016-06-30T14:06:17Z

Use post-dominates definition?

I'm not sure 100% sure what you're asking. For some variable v, I want to know if some particular use (ex: x = ... v ..) post dominates some definition of v (ex: v = ...).

That means it won't work if there are panics in the middle, right?

Dominators::dominators doesn't panic when it encounters unreachable nodes, it returns an Result. Callers should check the result if the graph might have unreachable nodes. Mir's CFG shouldn't have unreachable nodes, AFAIK, but I can change Mir::dominators to return the Result if needed.

arielb1 · 2016-06-30T17:24:52Z

@scottcarr

The problem is that if you consider panic edges, your analysis will be reduced to be basically local, as every call has a panic edge which means that nothing post-dominates anything.

scottcarr · 2016-06-30T22:11:00Z

@arielb1

Let me make sure I understand what you mean. If we have:

bb0: {
  tmp0 = 5;
  tmp1 = 42;
  tmp2 = foo(tmp1) -> [return: bb1, unwind: bb2] 
}

bb1: {
  tmp3 = tmp0;
  ...
}

bb2: {
  resume;
}

You are suggesting we should optimize to:

bb0: {
  tmp3 = 5; // tmp0 optimized out
  tmp1 = 42;
  tmp2 = foo(tmp1) -> [return: bb1, unwind: bb2] 
}

bb1: {
  // tmp3 = tmp0 optimized out
  ...
}

bb2: {
  resume;
}

.. because "tmp3 = tmp0" is on all paths from "tmp0 = 5" to some "exit" that do not end in a resume;?

scottcarr · 2016-06-30T22:24:10Z

FWIW, move up optimization does fire a non-zero number of times when building the compiler. But it may be that all the statement pairs it optimizes are pretty local to each other.

#34585

nikomatsakis · 2016-07-01T18:46:35Z

So I chatted a bit with @scottcarr on IRC. I don't think that panic edges are actually particularly relevant. I think that what it comes down to is that if you are going to move the write B so that it occurs at the point A:

B0: {
   TMP = ... // Point A
   ...
}

Bn: {
    X = TMP // Point B
    ...
}

then basically anything reachable from A without passing through B must not be able to observe the fact that X has changed before it was supposed to have changed. So, if you trace paths from A and you encounter a RETURN or UNWIND terminator (whatever we call those now), then you could conclude that because the local variable X is being popped, you can safely perform the optimization. But if you encounter some node that may observe the value of X (which might include calls, depending on whether the address of X has been taken and what kind of conservative rules we are using) then you can't safely move it backward.

In other words, I think @arielb1 is right that it might be better not to consider post-doms, but I think the focus on panics etc isn't that important.

nikomatsakis · 2016-07-01T18:47:45Z

(To be clear, I didn't read all the comments on this PR in depth.)

arielb1 · 2016-07-01T21:25:41Z

I think we got this discussion totally wrong anyway. Here's my model of the optimization:

The optimization is to transform

S1:
    tmp = SRC
    ...
S2:
    DEST = tmp

to

S1:
    DEST = SRC

I think this is best split into 4 steps. I don't think we ever want to do the steps separately, but this clarifies the analysis rules.

Step 0 (original)

S1:
    tmp = SRC
    ...
S2:
    DEST = tmp

Step 1 - add additional dead write

S1:
    tmp = SRC
    DEST = tmp
    ...
S2:
    DEST = tmp

This requires that DEST is dead at S1, and that it can be evaluated there (e.g. it is not a dereference of a pointer that is uninitialized there). Nothing else matters.

Step 2: common subexpression introduction

S1:
    tmp = SRC
    DEST = tmp
    ...
S2:
    tmp2 = DEST
    DEST = tmp2

This requires that the newly-added read links with the write of tmp - basically a "memdep" analysis.

This also requires that the address of DEST does not change, which is non-trivial because DEST can be the dereference of a pointer.

Step 3: remove write-of-read

S1:
    tmp = SRC
    DEST = tmp
    ...
S2:
    NOP

After step 2, S2 is obviously a NOP and can be removed.

Step 4: remove `tmp`

S1:
    DEST = SRC
    ...
S2:
    NOP

This is purely a local operation, but may require some sophistication if e.g. SRC is a function call. I think we are always justified doing it, but we should make sure it is OK.

scottcarr · 2016-07-06T21:09:00Z

Since we're not planning to use post dominators for move-up-propagation, should be close this PR and move discussion to #34693?

nikomatsakis · 2016-07-07T17:47:56Z

@scottcarr I think we should.

scottcarr added 4 commits June 27, 2016 13:51

add MirWithExit

3deddc2

refactor dominators to fail when there are unreachable nodes in the g…

dac4bc3

…raph

always assume 0..N nodes, check rpo.len == N

0360166

reenable dominators

c943ba0

rust-highfive assigned Aatch Jun 29, 2016

rust-highfive assigned nikomatsakis and unassigned Aatch Jun 29, 2016

scottcarr added 3 commits June 29, 2016 10:37

we dont use max anymore

ad669f1

remove accidentially checked in file

dc241a4

remove trailing whitespace

da0034b

scottcarr added 3 commits June 29, 2016 15:14

fix tidy and add infinite loop test

7a3ac76

fix more whitespace

d79dcc9

remove whitespace

1db087d

scottcarr mentioned this pull request Jul 1, 2016

[WIP] MIR Move up propagation #34585

Closed

nikomatsakis closed this Jul 7, 2016

Uh oh!

add a wrapper around Mir with an exit node, dominators returns error when nodes are unreachable #34556

add a wrapper around Mir with an exit node, dominators returns error when nodes are unreachable #34556

Uh oh!

Conversation

scottcarr commented Jun 29, 2016

Uh oh!

rust-highfive commented Jun 29, 2016

Uh oh!

scottcarr commented Jun 29, 2016

Uh oh!

arielb1 commented Jun 29, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

scottcarr commented Jun 29, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arielb1 commented Jun 30, 2016

Uh oh!

scottcarr commented Jun 30, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arielb1 commented Jun 30, 2016

Uh oh!

scottcarr commented Jun 30, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

scottcarr commented Jun 30, 2016

Uh oh!

nikomatsakis commented Jul 1, 2016

Uh oh!

nikomatsakis commented Jul 1, 2016

Uh oh!

arielb1 commented Jul 1, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Step 0 (original)

Step 1 - add additional dead write

Step 2: common subexpression introduction

Step 3: remove write-of-read

Step 4: remove tmp

Uh oh!

scottcarr commented Jul 6, 2016

Uh oh!

nikomatsakis commented Jul 7, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

arielb1 commented Jun 29, 2016 •

edited

Loading

scottcarr commented Jun 29, 2016 •

edited

Loading

scottcarr commented Jun 30, 2016 •

edited

Loading

scottcarr commented Jun 30, 2016 •

edited

Loading

arielb1 commented Jul 1, 2016 •

edited

Loading

Step 4: remove `tmp`