[RELAY][PASS] FoldScaleAxis Backward #2024

tqchen · 2018-10-29T00:04:08Z

This is a followup of #2020 , this PR implemented the infrastructure to do backward folding of scale on axis.

Goal

Fold the scaling of axis(usually caused by BatchNorm) into weight of conv2d in the past. For example

Old:

%1 = conv2d(%1, %w, data_layout="NHWC")
%2 = multiply(%x, %scale)

Transformed:

# scale weight's output channel
%1 = multiply(%w, expand_dims(%scale, axis=1, num_newaxis=3))
%2 = conv2d(%x, %1, data_layout="NHWC")

Further constant folding can fold the multiplication and we remove the scaling in the network.

The Algorithm

The general idea is similar to the Forward algorithm. We pass additional argument (axes, scale), when doing transformation, to request the result to satisfy

result = value
for i, k in enumerate(axes):
   k-ith dimension of result *= i-th dimension of scale

The problem is again we don;t want to blindly propagate scale backward it won't get consumed. So we run a forward "preparation phase", which propagates the demand of the potential axes scaling.

The new pass is more general than the FoldScaleAxis in nnvm

The new pass support arbitrary scaling of multiple axes(although the further implementation is necessary), which could be helpful in NHWCc case,
The new pass support folding backward into sum of two of conv2d, which was not possible in NNVM.

tqchen · 2018-10-29T00:05:40Z

@jroesch @ZihengJiang @yzhliu @zhiics @FrozenGene @merrymercy @srkreddy1238 @masahi please review

FrozenGene · 2018-10-29T01:44:51Z

src/relay/pass/pattern_util.h

+      call->args[1]->type_as<TensorTypeNode>()->shape,
+      weight_layout, kOIHW);
+  return is_const_int(wshape[0], param->groups) &&
+      is_const_int(wshape[1], 1);


Could we use is_const_int(wshape[0], param->groups) to check depthwise convolution? In theory, wshape[1] could be not equal to 1, although TF / MXNet / Caffe and so on frontends will make it be 1 even though the depth channel multiplier not be 1 (such as 0.25)

The current code makes the assumption of input and output groups being even, and the special logic relies on this

A better approach would enhance the Conv2D code itself to handle general group convolution, which should not be too hard as a followup contribution from anyone in the community:)

FrozenGene · 2018-10-29T01:47:46Z

src/relay/pass/fold_scale_axis.cc

+  return CallNode::make(call->op, {input}, call->attrs, call->type_args);
+}
+
+RELAY_REGISTER_OP("nn.relu")


How about add clip operator like relu? Like Tensorflow frontend or Keras frontend, which use clip to implement relu6 operator.

clip can be supported, this PR mainly introduces the scaffolding of things necessary, with new folder infrastructure, we can support clip as well as other operators(flatten, transpose) by more registrations

masahi · 2018-10-29T14:46:53Z

src/relay/pass/fold_scale_axis.cc

 // - Prepare phase: backward propagation of demand.
 // - Transform phase: forward transformation,
+//
+// Similarly, borward folding process is done in two steps:


masahi · 2018-10-29T14:48:52Z

src/relay/pass/fold_scale_axis.cc

+class BackwardTransformer;
+
+/*!
+ * \brief Preparation function for for pass scale backward.


tqchen · 2018-10-29T22:32:57Z

Thanks, @masahi @FrozenGene I have updated the PR according to your comments

tqchen · 2018-10-30T02:44:09Z

Thanks @FrozenGene @masahi , this is merged. Followup PRs to support more ops(clip) and grouped conv is more than welcomed!

[RELAY][PASS] FoldScaleAxis Backward

112945a

tqchen force-pushed the master branch from dbbc062 to 112945a Compare October 29, 2018 00:09

FrozenGene reviewed Oct 29, 2018

View reviewed changes

FrozenGene approved these changes Oct 29, 2018

View reviewed changes

masahi reviewed Oct 29, 2018

View reviewed changes

src/relay/pass/fold_scale_axis.cc

class BackwardTransformer;

/*!

* \brief Preparation function for for pass scale backward.

Copy link

Member

masahi Oct 29, 2018

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

for

fix review comments

9144381

tqchen merged commit d5103bb into apache:master Oct 30, 2018

FrozenGene pushed a commit to FrozenGene/tvm that referenced this pull request Dec 27, 2018

[RELAY][PASS] FoldScaleAxis Backward (apache#2024)

feca27e

wweic pushed a commit to neo-ai/tvm that referenced this pull request Feb 20, 2019

[RELAY][PASS] FoldScaleAxis Backward (apache#2024)

e19d700

wweic pushed a commit to neo-ai/tvm that referenced this pull request Feb 20, 2019

[RELAY][PASS] FoldScaleAxis Backward (apache#2024)

691c64e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RELAY][PASS] FoldScaleAxis Backward #2024

[RELAY][PASS] FoldScaleAxis Backward #2024

tqchen commented Oct 29, 2018

tqchen commented Oct 29, 2018

FrozenGene Oct 29, 2018

tqchen Oct 29, 2018

FrozenGene Oct 29, 2018

tqchen Oct 29, 2018

masahi Oct 29, 2018

masahi Oct 29, 2018

tqchen commented Oct 29, 2018

tqchen commented Oct 30, 2018

[RELAY][PASS] FoldScaleAxis Backward #2024

[RELAY][PASS] FoldScaleAxis Backward #2024

Conversation

tqchen commented Oct 29, 2018

Goal

The Algorithm

tqchen commented Oct 29, 2018

FrozenGene Oct 29, 2018

Choose a reason for hiding this comment

tqchen Oct 29, 2018

Choose a reason for hiding this comment

FrozenGene Oct 29, 2018

Choose a reason for hiding this comment

tqchen Oct 29, 2018

Choose a reason for hiding this comment

masahi Oct 29, 2018

Choose a reason for hiding this comment

masahi Oct 29, 2018

Choose a reason for hiding this comment

tqchen commented Oct 29, 2018

tqchen commented Oct 30, 2018