Use a more efficient iteration order for forward dataflow #62062

ecstatic-morse · 2019-06-22T20:15:35Z

Currently, dataflow begins by visiting each block in order of ID (BasicBlock(0), BasicBlock(1), etc.). This PR changes that initial iteration to reverse post-order (see this blog post for more info). This ensures that the effects of all predecessors will be applied before a basic block is visited if the CFG has no back-edges, and should result in less total iterations even when back-edges exist. This should not change the results of dataflow analysis.

The current ordering for basic blocks may be pretty close to RPO already--BasicBlock(0) is already the start block--in which case the cost of doing the traversal up front will outweigh the efficiency gains.
A perf run is needed to check this.

r? @pnkfelix (I think).

ecstatic-morse · 2019-06-22T20:54:06Z

@nagisa. Do you think the assertion about unreachable basic blocks is correct? I added it because there could be two successive basic blocks which are unreachable from the start block, in which case the entry set for the second one would be wrong.

nagisa · 2019-06-22T23:24:19Z

@bors try

bors · 2019-06-22T23:24:28Z

⌛ Trying commit b03ebd53e5b427b95fa36b0ec295b72125d7e371 with merge 0905d6a630cb4afc3f894f9e91c1a7a20c32416b...

bors · 2019-06-23T01:53:58Z

☀️ Try build successful - checks-travis
Build commit: 0905d6a630cb4afc3f894f9e91c1a7a20c32416b

Centril · 2019-06-23T11:44:18Z

@rust-timer build 0905d6a630cb4afc3f894f9e91c1a7a20c32416b

rust-timer · 2019-06-23T11:44:19Z

Success: Queued 0905d6a630cb4afc3f894f9e91c1a7a20c32416b with parent d6884ae, comparison URL.

rust-timer · 2019-06-23T17:14:00Z

Finished benchmarking try commit 0905d6a630cb4afc3f894f9e91c1a7a20c32416b, comparison URL.

pnkfelix · 2019-06-27T08:00:36Z

This seems ... fine? It doesn't seems to hurt anything; it also doesn't improve things terribly much, probably because the block numbering was close to RPO already, as hypothesized.

ts probably not a good idea to implicitly rely on the block numbering always remaining close to RPO, right? And therefore we should land this?

What do you think, @ecstatic-morse ? (@nagisa and @arielb1 may also have thoughts on this, since I know they've each also spent time thinking about or working on dataflow analyses.)

ecstatic-morse · 2019-06-27T17:38:29Z

I think this should probably land (totally not biased or anything :). It's a sensible default, the overhead of a single extra RPO traversal per dataflow analysis is pretty small, and it removes the implicit dependency on a certain basic block ordering, which may change as more MIR transformations are added. With the addition of more expensive dataflow passes (i.e. reaching definitions), the naive ordering could eventually have an observable performance impact.

The alternative is to wait until a MIR transformation is actually added which renumbers basic blocks, but this PR will probably be long forgotten by then 😄.

ecstatic-morse · 2019-06-27T18:34:18Z

Also, this does reduce the number of dataflow iterations, just not enough to matter. I observed a reduction of 9% across all dataflow analyses which took more than 5 iterations when running the tests in src/test/run-pass/array-slice-vec/.

Currently, dataflow begins by visiting each block in order of ID (`BasicBlock(0)`, `BasicBlock(1)`, etc.). This PR changes that initial iteration to reverse post-order. This ensures that the effects of all predecessors will be applied before a basic block is visited if the CFG has no back-edges, and should result in less total iterations even when back-edges exist. This should not change the results of dataflow analysis. The current ordering for basic blocks is pretty close to RPO already--`BasicBlock(0)` is already the start block, so the gains from this are pretty small, especially since we need to do an extra traversal up front. Note that some basic blocks are unreachable from the `START_BLOCK` during dataflow. We add these blocks to the work queue as well to preserve the original behavior.

This applies the same basic principle as rust-lang#62062 to the reverse dataflow analysis used to compute liveness information. It is functionally equivalent, except that post-order is used instead of reverse post-order. Some `mir::Body`s contain basic blocks which are not reachable from the `START_BLOCK`. We need to add them to the work queue as well to preserve the original semantics.

arielb1 · 2019-06-28T16:14:56Z

@pnkfelix

I think it's OK to land this to avoid the dependence on basic block ordering.

nagisa · 2019-06-29T22:24:15Z

@bors r+

bors · 2019-06-29T22:24:16Z

📌 Commit 07c5e2b has been approved by nagisa

@pnkfelix

…gisa Use a more efficient iteration order for forward dataflow Currently, dataflow begins by visiting each block in order of ID (`BasicBlock(0)`, `BasicBlock(1)`, etc.). This PR changes that initial iteration to reverse post-order (see [this blog post](https://eli.thegreenplace.net/2015/directed-graph-traversal-orderings-and-applications-to-data-flow-analysis/#data-flow-analysis) for more info). This ensures that the effects of all predecessors will be applied before a basic block is visited if the CFG has no back-edges, and should result in less total iterations even when back-edges exist. This should not change the results of dataflow analysis. The current ordering for basic blocks may be pretty close to RPO already--`BasicBlock(0)` is already the start block--in which case the cost of doing the traversal up front will outweigh the efficiency gains. A perf run is needed to check this. r? @pnkfelix (I think).

…der, r=nagisa Use a more efficient iteration order for backward dataflow This applies the same basic principle as rust-lang#62062 to the reverse dataflow analysis used to compute liveness information. It is functionally equivalent, except that post-order is used instead of reverse post-order. In the long-term, `BitDenotation` should probably be extended to support both forward and backward dataflow, but there's some more work needed to get to that point.

@ghost

Rollup of 8 pull requests Successful merges: - #62062 (Use a more efficient iteration order for forward dataflow) - #62063 (Use a more efficient iteration order for backward dataflow) - #62224 (rustdoc: remove unused derives and variants) - #62228 (Extend the #[must_use] lint to boxed types) - #62235 (Extend the `#[must_use]` lint to arrays) - #62239 (Fix a typo) - #62241 (Always parse 'async unsafe fn' + properly ban in 2015) - #62248 (before_exec actually will only get deprecated with 1.37) Failed merges: r? @ghost

rust-highfive assigned pnkfelix Jun 22, 2019

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Jun 22, 2019

ecstatic-morse mentioned this pull request Jun 22, 2019

Use a more efficient iteration order for backward dataflow #62063

Merged

This comment has been minimized.

Sign in to view

bors added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Jun 22, 2019

This comment has been minimized.

Sign in to view

ecstatic-morse force-pushed the dataflow-order branch from 1d26fb0 to b03ebd5 Compare June 22, 2019 22:01

This comment has been minimized.

Sign in to view

ecstatic-morse force-pushed the dataflow-order branch from b03ebd5 to 07c5e2b Compare June 27, 2019 18:39

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Jun 29, 2019

Centril mentioned this pull request Jun 30, 2019

Rollup of 8 pull requests #62253

Merged

bors merged commit 07c5e2b into rust-lang:master Jul 1, 2019

ecstatic-morse deleted the dataflow-order branch October 6, 2020 01:42

Use a more efficient iteration order for forward dataflow #62062

Use a more efficient iteration order for forward dataflow #62062

Uh oh!

Conversation

ecstatic-morse commented Jun 22, 2019

Uh oh!

This comment has been minimized.

This comment has been minimized.

ecstatic-morse commented Jun 22, 2019

Uh oh!

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

nagisa commented Jun 22, 2019

Uh oh!

bors commented Jun 22, 2019

Uh oh!

bors commented Jun 23, 2019

Uh oh!

Centril commented Jun 23, 2019

Uh oh!

rust-timer commented Jun 23, 2019

Uh oh!

rust-timer commented Jun 23, 2019

Uh oh!

pnkfelix commented Jun 27, 2019

Uh oh!

ecstatic-morse commented Jun 27, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ecstatic-morse commented Jun 27, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arielb1 commented Jun 28, 2019

Uh oh!

nagisa commented Jun 29, 2019

Uh oh!

bors commented Jun 29, 2019

Uh oh!

Uh oh!

ecstatic-morse commented Jun 27, 2019 •

edited

Loading

ecstatic-morse commented Jun 27, 2019 •

edited

Loading