[ConvertLayout] Support QNN ops. #5066

anijain2305 · 2020-03-14T08:20:26Z

Recently introduced Op strategy has disabled conversion from NHWC to NCHW in AlterOpLayout (which is correct thing to do). We can solve this problem by calling ConvertLayout in the parser if needed. However, this only works for FP32.

For quantized models, parsers give a QNN graph. And this QNN graph goes to relay.build. Relay build internally calls QNN Legalize passes to convert it to Relay-only ops. The problem is ConvertLayout does not work on QNN ops. Therefore, even if we call ConvertLayout after parser, the layouts will not change.

This PR implements ConvertLayout for QNN ops. In addition, I have changed the interface of FInferCorrectLayout to ingest an array of Relay Types instead of shapes. This is helpful in operators like Concatenate where we need to know the number of input data tensors.

@icemelon9 @zhiics @yzhliu

anijain2305 · 2020-03-16T02:42:21Z

@icemelon9 @zhiics @yzhliu @tqchen

Let me know what you think about this.

yzhliu

overall looks good to me.

yzhliu · 2020-03-16T18:45:25Z

src/relay/qnn/op/convolution.cc

+
+  // Fill the layouts of remaining input tensors - scales and zero points. The layouts of these
+  // tensors can be ignored as they dont go through any transformation.
+  Layout ignore_layout = Layout("I");


are them always input channel?

They can be scalar, or output channel. I initially thought of putting them as "C". But, chose "I" to be more specific. I am open to discuss.

maybe "C" is better. I don't have strong opinion though

zhiics

LGTM

anijain2305 · 2020-03-19T03:04:11Z

Thanks @zhiics @yzhliu This is merged

* [ConvertLayout] Support QNN ops. * Changing layouts to C. * Fixing dilation. * Empty commit. Co-authored-by: Ubuntu <ubuntu@ip-172-31-53-55.us-west-2.compute.internal>

apache#5066

anijain2305 force-pushed the qnn_layout branch 4 times, most recently from 235c079 to a4c5092 Compare March 15, 2020 07:54

anijain2305 mentioned this pull request Mar 15, 2020

Look for TupleType instead of TupleNode in LayoutRewriter #5018

Closed

anijain2305 marked this pull request as ready for review March 16, 2020 02:41

yzhliu reviewed Mar 16, 2020

View reviewed changes

Ubuntu added 2 commits March 18, 2020 17:43

[ConvertLayout] Support QNN ops.

c2c4d59

Changing layouts to C.

1a5d81c

anijain2305 force-pushed the qnn_layout branch from 306fc33 to 1a5d81c Compare March 18, 2020 17:44

Fixing dilation.

371dac0

yzhliu approved these changes Mar 18, 2020

View reviewed changes

Empty commit.

8cedd60

zhiics approved these changes Mar 19, 2020

View reviewed changes

anijain2305 merged commit 38118be into apache:master Mar 19, 2020

shoubhik added a commit to shoubhik/incubator-tvm that referenced this pull request May 12, 2020

[ConvertLayout] Support QNN ops. apache#5066

a1d12cc

apache#5066

ZihengJiang mentioned this pull request Sep 25, 2020

TVM v0.7 Release Note Candidate #6486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ConvertLayout] Support QNN ops. #5066

[ConvertLayout] Support QNN ops. #5066

anijain2305 commented Mar 14, 2020

anijain2305 commented Mar 16, 2020

yzhliu left a comment

yzhliu Mar 16, 2020

anijain2305 Mar 16, 2020

yzhliu Mar 17, 2020

zhiics left a comment

anijain2305 commented Mar 19, 2020

[ConvertLayout] Support QNN ops. #5066

[ConvertLayout] Support QNN ops. #5066

Conversation

anijain2305 commented Mar 14, 2020

anijain2305 commented Mar 16, 2020

yzhliu left a comment

Choose a reason for hiding this comment

yzhliu Mar 16, 2020

Choose a reason for hiding this comment

anijain2305 Mar 16, 2020

Choose a reason for hiding this comment

yzhliu Mar 17, 2020

Choose a reason for hiding this comment

zhiics left a comment

Choose a reason for hiding this comment

anijain2305 commented Mar 19, 2020