[REFACTOR][BOYC] Non recursive partitioning #5493

zhiics · 2020-04-30T21:38:40Z

This PR refactors the partitioning pass by using non-recursive mutator. It also removes the unnecessary mutators as we only need to look at begin/end annotations which are definitely wrapped in call nodes. In addition, a metadata struct is used to maintain the intermediate data needed for partitioning.

zhiics · 2020-04-30T21:40:43Z

@mbrookhart please take a look for the mixed mutator pattern. BTW, we will still need to refactor the infertype pass as it is the most frequently used pass.

@comaniac @masahi @mbaret @manupa-arm @trevor-m please take a look.

comaniac

We were trying rewriter but it is not applicable to this pass due to https://github.com/apache/incubator-tvm/pull/5493/files#diff-8d2cdf6314f73e4b32892679ad4dc44aR280, which traverses other nodes out of order. As a result, the current solution seems the most suitable.

mbrookhart

Use of the iterative traversals looks great. Since you originally authored the class, I'll believe you about the necessary mutators.

Why so much auto-formatting noise? As a larger conversation, we might want to build a style checker into CI that enforces a particular auto-format implementaiton...

comaniac · 2020-04-30T22:01:47Z

Ah I think that's because I manually ran clang-format for the file. We should definitely build style checker in CI.

src/relay/transforms/partition_graph.cc

masahi · 2020-05-01T11:55:02Z

src/relay/transforms/partition_graph.cc

-  bool found_start_{false};
-  bool found_end_{false};
+  /*! \brief Map from each region output expr node to its output index and TupleGetItem node. */
+  std::unordered_map<Expr, std::pair<int, TupleGetItem>, ObjectHash, ObjectEqual> out_expr_indices;


Since both arguments of TupleGetItem (func_call and the index) are in this struct, this TupleGetItem seems redundant to me. Also see my comment at L282.

If this TupleGetItem is intended to be cached, please come up with a better name than out_expr_indices, since it is not just indices.

Rename to region_func_out and only cache the function output expressions (Call or TupleGetItem).

src/relay/transforms/partition_graph.cc

comaniac · 2020-05-01T18:12:11Z

@masahi thanks for valuable suggestions. We've refactored CreateFunction to create all TupleGetItem nodes so that GetFunctionOutput can be safely removed and the metadata could be more concise. PTAL.

masahi · 2020-05-01T20:28:48Z

Thanks @zhiics @comaniac @mbrookhart @manupa-arm

* non recursive partitioning * refactor maps * rebase upstream * refactor shared output * address comments Co-authored-by: Cody Yu <comaniac0422@gmail.com>

zhiics and others added 4 commits April 30, 2020 21:22

non recursive partitioning

ed684e5

refactor maps

b8bfc59

rebase upstream

aaed2b4

refactor shared output

e8ffa89

comaniac approved these changes Apr 30, 2020

View reviewed changes

mbrookhart approved these changes Apr 30, 2020

View reviewed changes

manupak reviewed May 1, 2020

View reviewed changes

src/relay/transforms/partition_graph.cc Show resolved Hide resolved

masahi self-assigned this May 1, 2020