[Relay][Pass] Support combine multiple dense op just into dense #6062

wrongtest-intellif · 2020-07-15T07:31:46Z

Hi there, this PR is a minor modification to CombineParallelDense pass, refer to https://discuss.tvm.ai/t/yet-another-dense-op-combine-strategy/7126. The changes are:

Add option "to_batch" (default to True) to control whether combine dense ops into batch_matmul or dense.
Add implementation to combine dense ops into one large dense instead of batch_matmul, which take almost same logic with that of CombineParallelConv2D pass.
Test cases for combine various shapes of elem-wise op followed.
The new strategy can combine even ops of different output dims and may take better performance in circumstances where flat matmul operation is faster than equivalent batch_matmul operation.

wrongtest-intellif · 2020-07-15T11:25:36Z

cc @jroesch @icemelon9 @comaniac

comaniac

Thanks for the PR! I have two questions other than the comments:

How to trigger this change (i.e., to_batch=false) for an end user? It seems to me that you can configure it only by manually modifying the build_module or VM compiler and rebuilding TVM.
IIUC, the reason that ParallelDenseFlatCombiner derived from ParallelOpCombiner instead of ParallelOpBatchCombiner is it requires special processes to almost every function, so it seems no benefit to derive from ParallelOpBatchCombiner. Now the class hierarchy becomes:
- ParallelDenseBatchCombiner <- ParallelOpBatchCombiner <- ParallelOpCombiner
- ParallelDenseFlatCombiner <----------------------------------|
Since I didn't find any other classes derived from ParallelOpBatchCombiner, should we simplify ParallelOpBatchCombiner class if we cannot make both ParallelDense*Combiner derive from it?

src/relay/backend/build_module.cc

src/relay/backend/vm/compiler.cc

src/relay/transforms/combine_parallel_dense.cc

include/tvm/relay/transform.h

wrongtest-intellif · 2020-07-16T03:32:53Z

Thanks for the PR! I have two questions other than the comments:

How to trigger this change (i.e., to_batch=false) for an end user? It seems to me that you can configure it only by manually modifying the build_module or VM compiler and rebuilding TVM.

IIUC, the reason that ParallelDenseFlatCombiner derived from ParallelOpCombiner instead of ParallelOpBatchCombiner is it requires special processes to almost every function, so it seems no benefit to derive from ParallelOpBatchCombiner. Now the class hierarchy becomes:

ParallelDenseBatchCombiner <- ParallelOpBatchCombiner <- ParallelOpCombiner

ParallelDenseFlatCombiner <----------------------------------|

Since I didn't find any other classes derived from ParallelOpBatchCombiner, should we simplify ParallelOpBatchCombiner class if we cannot make both ParallelDense*Combiner derive from it?

Thanks for your comments !

In our practice we just manually call Python api mod = relay.transform.CombineParallelDense(3, False)(mod).
Because this pass will change the shape, currently we have to manually call it (or relay.optimize(mod) for default optimization) before any auto-tuning step to consider the combined kernel shape and then build.
How about improve to make ParallelOpBatchCombiner as an exposed optional pass (maybe in another PR)? It can be used like mod = CombineParallelOpToBatch("op_name", "batch_op_name", 3). This may serve the original idea of this class and users can combine various kinds of op flexibly. Of course, the use case may be rare in common network structures.

comaniac

LGTM. Thanks.
Would like to get comments from @jroesch @icemelon9 @jonso4 as well.

MarisaKirisame

IMO there should be a CombineParallelOp Pass that just call CombineParallelDense/Conv2d/etc in sequential. Can you add that?
Also, why CombineParallel instead of Batch? Ppl use static batching/dynamic batching to describe this process.

include/tvm/relay/transform.h

tqchen · 2020-07-25T15:19:17Z

@MarisaKirisame please manage the PR and merge after things everyone approves

MarisaKirisame · 2020-07-25T19:50:41Z

@tqchen got it.

tqchen · 2020-08-06T03:40:30Z

@MarisaKirisame please followup

MarisaKirisame · 2020-08-06T06:03:53Z

@wrongtest can you just make a pass that is nothing but a sequential of the 3 combine passes? that's all i think should be changed.

wrongtest-intellif · 2020-08-06T07:58:50Z

Sorry for too late, a wrapped function BatchingOps() is added.

…he#6062) * feat: Support combine multiple matmuls to flat matmul * fix: Change to_batch -> to_batch_matmul and enrich docstring * feat: Add wrapped batching ops pass for python

wrongtest-intellif force-pushed the feat/SupportAnotherDenseCombineStrategy branch from 022bd68 to 2170bed Compare July 15, 2020 08:28

feat: Support combine multiple matmuls to flat matmul

2170bed

comaniac requested changes Jul 16, 2020

View reviewed changes

src/relay/backend/build_module.cc Outdated Show resolved Hide resolved

src/relay/backend/vm/compiler.cc Outdated Show resolved Hide resolved

src/relay/transforms/combine_parallel_dense.cc Outdated Show resolved Hide resolved

include/tvm/relay/transform.h Outdated Show resolved Hide resolved

fix: Change to_batch -> to_batch_matmul and enrich docstring

71f0c0e

comaniac approved these changes Jul 16, 2020

View reviewed changes

MarisaKirisame reviewed Jul 24, 2020

View reviewed changes

include/tvm/relay/transform.h Outdated Show resolved Hide resolved

tqchen assigned MarisaKirisame Jul 25, 2020

feat: Add wrapped batching ops pass for python

48cd48a

MarisaKirisame approved these changes Aug 6, 2020

View reviewed changes

MarisaKirisame merged commit b3c42f9 into apache:master Aug 7, 2020

ZihengJiang mentioned this pull request Sep 25, 2020

TVM v0.7 Release Note Candidate #6486

Closed

zhanghaohit deleted the feat/SupportAnotherDenseCombineStrategy branch December 17, 2020 02:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Relay][Pass] Support combine multiple dense op just into dense #6062

[Relay][Pass] Support combine multiple dense op just into dense #6062

wrongtest-intellif commented Jul 15, 2020

wrongtest-intellif commented Jul 15, 2020

comaniac left a comment

wrongtest-intellif commented Jul 16, 2020 •

edited

Loading

comaniac left a comment

MarisaKirisame left a comment

tqchen commented Jul 25, 2020

MarisaKirisame commented Jul 25, 2020 •

edited

Loading

tqchen commented Aug 6, 2020

MarisaKirisame commented Aug 6, 2020

wrongtest-intellif commented Aug 6, 2020

[Relay][Pass] Support combine multiple dense op just into dense #6062

[Relay][Pass] Support combine multiple dense op just into dense #6062

Conversation

wrongtest-intellif commented Jul 15, 2020

wrongtest-intellif commented Jul 15, 2020

comaniac left a comment

Choose a reason for hiding this comment

wrongtest-intellif commented Jul 16, 2020 • edited Loading

comaniac left a comment

Choose a reason for hiding this comment

MarisaKirisame left a comment

Choose a reason for hiding this comment

tqchen commented Jul 25, 2020

MarisaKirisame commented Jul 25, 2020 • edited Loading

tqchen commented Aug 6, 2020

MarisaKirisame commented Aug 6, 2020

wrongtest-intellif commented Aug 6, 2020

wrongtest-intellif commented Jul 16, 2020 •

edited

Loading

MarisaKirisame commented Jul 25, 2020 •

edited

Loading