[QNN] Optimize requantize for power of 2 and fix dequantize for per-channel quantized input #6675

anijain2305 · 2020-10-13T02:01:06Z

Optimize the lowering for requantize op for a power of 2. Additionally, fixing a bug for dequantize.

anijain2305 · 2020-10-13T02:03:10Z

@ZihengJiang Can you please review?

ZihengJiang · 2020-10-13T02:47:04Z

src/relay/qnn/op/requantize.cc

-          (is_upward_rounding
-               ? FixedPointMultiply(scaled_int32_t, fixed_point_multiplier, shift)
-               : FixedPointMultiplyToNearest(scaled_int32_t, double_multiplier, input_shape));
+      if (is_upward_rounding && fixed_point_multiplier == (1 << 30)) {


why the fixed_point_multiplier must be (1 << 30)?

So, we use frexp to represent a floating point numbers. It gives a float significant which is between [0.5, 1). For power of 2, it is always 0.5. We convert the float significand into a fixed point 32-bit integer, where decimal point is between the first and second bit. 0.5 in this representation = 1 << 30

@anijain2305 , can add a small one line comment regarding (1<<30) ? These days aside from float32 many other types of float floats around.

@cbalint13 Added a comment, can you PTAL

@anijain2305 , Thank you !

u99127 · 2020-10-21T21:42:50Z

Optimize the lowering for requantize op for a power of 2. Additionally, fixing a bug for dequantize.

Can you describe what's the bug you are fixing for dequantize in the commit message or link to an actual bug report ?

Ramana

anijain2305 · 2020-10-21T21:56:18Z

dequantize in main branch fails when input is per-channel quantized. There is no bug report. I found this while working on quantizing models in Relay.

u99127 · 2020-10-21T21:59:02Z

dequantize in main branch fails when input is per-channel quantized. There is no bug report. I found this while working on quantizing models in Relay.

Cool, could the description above say something like "Fix dequantize for per-channel quantized input" instead of just fixing a bug ?

giuseros

Few comments also from me before this goes in !

giuseros · 2020-10-26T15:13:48Z

src/relay/qnn/op/requantize.cc

+        // Power of 2 is determined by the fixed_point_multiplier == 1 << 30. In case of power of 2,
+        // fixed point multiplier will represent a float value of 0.5. In fixed point, this is
+        // represented by 1 << 30.
+        scaled_int32_t = PowerOfTwoMultiply(scaled_int32_t, shift - 1);


Does it make sense for this to go in FixedPointMultiply? This would give the possibility to everybody using FixedPointMultiply to exploit this fix.

giuseros · 2020-10-26T15:19:10Z

src/relay/qnn/utils.cc

+    auto rounding_factor = 1 << (exp - 1);
+    auto rounded_t = Add(tensor, MakeConstantScalar(DataType::Int(32), rounding_factor));
+    out = RightShift(rounded_t, MakeConstantScalar(DataType::Int(32), exp));


Are you sure you don't need to convert to int64 upfront and then cast back to int32?

anijain2305 · 2020-10-28T00:47:25Z

@giuseros Can you PTAL? I put all the stuff in fixed_point_multiply. Regarding your comment on upcasting to int64, I don't think it's necessary. But, please let me know if I am missing a corner case.

giuseros · 2020-10-28T11:51:15Z

Hi @anijain2305 ,
LGTM, that's brilliant thanks!

anijain2305 · 2020-10-28T18:28:00Z

@ZihengJiang @u99127 Can you please take a look again? PR changed after I incorporated @giuseros comments.

ZihengJiang · 2020-10-29T21:15:20Z

LGTM! Thanks! @anijain2305

…hannel quantized input (apache#6675) * [QNN] Optimize requantize for power of 2 and bug in dequantize * Comments * Docs * Comments * Ethos

anijain2305 force-pushed the deq_q branch from def4e78 to 55a7363 Compare October 13, 2020 02:02

anijain2305 force-pushed the deq_q branch 2 times, most recently from ec78ab9 to 3e41e7d Compare October 13, 2020 02:10

ZihengJiang reviewed Oct 13, 2020

View reviewed changes

ZihengJiang approved these changes Oct 13, 2020

View reviewed changes

anijain2305 force-pushed the deq_q branch 3 times, most recently from 5825c48 to da11098 Compare October 21, 2020 00:23

cbalint13 approved these changes Oct 21, 2020

View reviewed changes

anijain2305 changed the title ~~[QNN] Optimize requantize for power of 2 and bug in dequantize~~ [QNN] Optimize requantize for power of 2 and handle per-channel quantized input for dequantize Oct 21, 2020

anijain2305 changed the title ~~[QNN] Optimize requantize for power of 2 and handle per-channel quantized input for dequantize~~ [QNN] Optimize requantize for power of 2 and fix dequantize for per-channel quantized input Oct 21, 2020

anijain2305 added 3 commits October 25, 2020 23:16

[QNN] Optimize requantize for power of 2 and bug in dequantize

4b1c203

Comments

fbf5286

Docs

4c86a86

anijain2305 force-pushed the deq_q branch from 7ee43bc to 4c86a86 Compare October 25, 2020 23:16

giuseros reviewed Oct 26, 2020

View reviewed changes

Comments

2a9fce4

anijain2305 force-pushed the deq_q branch from e1fd80a to 2a9fce4 Compare October 28, 2020 00:52

Ethos

2402661

ZihengJiang merged commit 380e2e9 into apache:main Oct 29, 2020

junrushao mentioned this pull request Nov 1, 2021

Apache TVM v0.8 Release Note Candidate #9416

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QNN] Optimize requantize for power of 2 and fix dequantize for per-channel quantized input #6675

[QNN] Optimize requantize for power of 2 and fix dequantize for per-channel quantized input #6675

anijain2305 commented Oct 13, 2020

anijain2305 commented Oct 13, 2020

ZihengJiang Oct 13, 2020

anijain2305 Oct 13, 2020 •

edited

Loading

cbalint13 Oct 13, 2020

anijain2305 Oct 21, 2020

cbalint13 Oct 21, 2020

u99127 commented Oct 21, 2020

anijain2305 commented Oct 21, 2020

u99127 commented Oct 21, 2020

giuseros left a comment

giuseros Oct 26, 2020

giuseros Oct 26, 2020

anijain2305 commented Oct 28, 2020

giuseros commented Oct 28, 2020

anijain2305 commented Oct 28, 2020

ZihengJiang commented Oct 29, 2020

[QNN] Optimize requantize for power of 2 and fix dequantize for per-channel quantized input #6675

[QNN] Optimize requantize for power of 2 and fix dequantize for per-channel quantized input #6675

Conversation

anijain2305 commented Oct 13, 2020

anijain2305 commented Oct 13, 2020

ZihengJiang Oct 13, 2020

Choose a reason for hiding this comment

anijain2305 Oct 13, 2020 • edited Loading

Choose a reason for hiding this comment

cbalint13 Oct 13, 2020

Choose a reason for hiding this comment

anijain2305 Oct 21, 2020

Choose a reason for hiding this comment

cbalint13 Oct 21, 2020

Choose a reason for hiding this comment

u99127 commented Oct 21, 2020

anijain2305 commented Oct 21, 2020

u99127 commented Oct 21, 2020

giuseros left a comment

Choose a reason for hiding this comment

giuseros Oct 26, 2020

Choose a reason for hiding this comment

giuseros Oct 26, 2020

Choose a reason for hiding this comment

anijain2305 commented Oct 28, 2020

giuseros commented Oct 28, 2020

anijain2305 commented Oct 28, 2020

ZihengJiang commented Oct 29, 2020

anijain2305 Oct 13, 2020 •

edited

Loading