[QNN] Register a bunch of unary elementwise ops #10086

AndrewZhaoLuo · 2022-01-27T20:51:08Z

Adds a way to quickly add unary operators to QNN with default canonicalizations.

Also adds following:

qnn.exp
qnn.erf
qnn.tanh
qnn.sigmoid
qnn.sqrt

AndrewZhaoLuo · 2022-01-28T02:19:54Z

This is now ready for review, cc @anwang2009 @masahi @mbrookhart @anijain2305

masahi

Very nice, I've been thinking that we should make "all" ops qnn-aware by auto-generating default lowering like this, so that we don't have to decide which op can run on int8.

anwang2009 · 2022-02-01T06:13:55Z

src/relay/transforms/pattern_utils.h

@@ -520,8 +525,8 @@ inline Expr FastSoftmax(Expr e, tvm::Attrs attr) {
  return Call(op, {e}, attr);
 }

-inline Expr Log(Expr e) {
-  static const Op& op = Op::Get("log");


why remove log?

Oops, didn't know it was originally there.

At first, I added like 20 unary funcs but decided to scale it back to make review easier (and most of the unary funcs probably have little use). Didn't realize Log was already in pattern_utils so probably deleted it on accident

anwang2009 · 2022-02-01T06:29:39Z

src/relay/qnn/op/op_common.h

+ *
+ * FloatingPointFunc is usually a handle from "src/relay/transforms/pattern_utils.h"
+ *
+ * \param OpName the name of registry.


nit: update this to "FloatingPointFunc" description

anwang2009 · 2022-02-01T06:52:29Z

tests/python/relay/test_op_qnn_unary_elementwise.py

+            relay.qnn.op.sqrt,
+            np.sqrt,
+            input_dtype="int8",
+            x_data=np.arange(1, 128, dtype="int8"),


any reason this test has x_data specified but the other int8 tests don't?

Yeah, it's sqrt so we want to keep things in domain. Outside of the domain the function can return really anything (probably either 0 or max/min value of int8)

anwang2009 · 2022-02-01T18:29:34Z

src/relay/qnn/op/op_common.h

+    for (size_t i = 1; i < 5; ++i) {                                                              \
+      types.push_back(arg_types[i]);                                                              \
+    }                                                                                             \
+    auto dequantized_arg = Dequantize(args.x, args.scale, args.zero_point, types, -1);            \


Looks like Dequantize -> DequantizeLower expects types to start with the input data type, but in this code types starts with the scale type.

https://github.com/apache/tvm/blob/main/src/relay/qnn/op/dequantize.cc#L99-L105

Good catch, you appear to be right. Moving to using the MakeDequantize and MakeQuantize to avoid issues with getting type right

AndrewZhaoLuo · 2022-02-03T01:24:17Z

@anwang2009 PTAL

anwang2009

LGTM. thanks!

AndrewZhaoLuo · 2022-02-15T22:23:56Z

cc @masahi we cool with merging this?

* 0;276;0cinitial commit * register a bunch of ops * unary ops * add a bunch of tests * 0;276;0crefactor tests * add tests to qnn * comments on macros * add back in log to pattern utils * update floating point func description * proper creating of calls to quantize and dequantize * fix lowering process for using dequantize and quantize ops

AndrewZhaoLuo requested review from anijain2305, jwfromm, ZihengJiang, icemelon, jroesch, junrushao, MarisaKirisame, mbrookhart, slyubomirsky, vinx13, wweic, yzhliu, zhiics, areusch, comaniac, merrymercy and tqchen as code owners January 27, 2022 20:51

AndrewZhaoLuo changed the title ~~[WIP][QNN] Register a bunch of unary elementwise ops~~ [QNN] Register a bunch of unary elementwise ops Jan 28, 2022

masahi approved these changes Feb 1, 2022

View reviewed changes

anwang2009 reviewed Feb 1, 2022

View reviewed changes

anwang2009 approved these changes Feb 9, 2022

View reviewed changes

AndrewZhaoLuo added 7 commits February 15, 2022 09:33

0;276;0cinitial commit

c8515c0

register a bunch of ops

eaaa69f

unary ops

e93d6fa

add a bunch of tests

e3d4c29

0;276;0crefactor tests

69792b2

add tests to qnn

6e86ed3

comments on macros

ee8fa06

AndrewZhaoLuo added 4 commits February 15, 2022 09:33

add back in log to pattern utils

c2f047d

update floating point func description

6af5dee

proper creating of calls to quantize and dequantize

bdfe5d1

fix lowering process for using dequantize and quantize ops

5456a51

AndrewZhaoLuo force-pushed the aluo/qnn/elementwise-unary-ops branch from a473635 to 5456a51 Compare February 15, 2022 17:33

masahi merged commit 64e94ab into apache:main Feb 15, 2022

driazati mentioned this pull request Jul 14, 2022

TVM v0.9.0.rc0 Release Candidate Notes #12102

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QNN] Register a bunch of unary elementwise ops #10086

[QNN] Register a bunch of unary elementwise ops #10086

AndrewZhaoLuo commented Jan 27, 2022 •

edited

Loading

AndrewZhaoLuo commented Jan 28, 2022

masahi left a comment

anwang2009 Feb 1, 2022

AndrewZhaoLuo Feb 3, 2022

anwang2009 Feb 1, 2022

AndrewZhaoLuo Feb 3, 2022

anwang2009 Feb 1, 2022

AndrewZhaoLuo Feb 3, 2022

anwang2009 Feb 1, 2022

AndrewZhaoLuo Feb 3, 2022

AndrewZhaoLuo commented Feb 3, 2022

anwang2009 left a comment

AndrewZhaoLuo commented Feb 15, 2022

[QNN] Register a bunch of unary elementwise ops #10086

[QNN] Register a bunch of unary elementwise ops #10086

Conversation

AndrewZhaoLuo commented Jan 27, 2022 • edited Loading

AndrewZhaoLuo commented Jan 28, 2022

masahi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

AndrewZhaoLuo commented Feb 3, 2022

anwang2009 left a comment

Choose a reason for hiding this comment

AndrewZhaoLuo commented Feb 15, 2022

AndrewZhaoLuo commented Jan 27, 2022 •

edited

Loading