[Requantize] Cleanup and Optimize Lowering #5286

anijain2305 · 2020-04-09T01:45:11Z

This PR does minor cleanup and optimizes the lowering. Basically, the FixedPointMultiply now takes int32 inputs and gives int32 outputs. Internally it upcast the integers to int64 as before.

I checked the accuracy for all MXNet pre-quantized models, and it is unchanged. Writing down the numbers here for completeness.

Same accuracy before and after this PR (measures on Intel machine)

Model Name	Top1	Top5
Resnet18_v1	69.86	89.05
Resnet50_v1	76.16	92.73
Resnet50_v1b	76.56	92.6
Resnet101_v1	76.97	93.09
Resnet152_v2	75.75	92.12
Inception-V3	77.28	93.32
Inception-BN	71.79	90.25
MobileNetV1	71.13	90.16
MobileNetV2	70.14	89.52

This PR helps a lot in improving performance on rasp4. Quantized TFLite mobilenet performance improved from 56 ms to 46 ms. This is mainly due to keeping calculations in int64 to as minimum as possible.

src/relay/qnn/op/requantize.cc

anijain2305 · 2020-04-10T16:23:20Z

@yzhliu @vinx13 Please review.

yzhliu · 2020-04-12T06:10:48Z

Thanks @anijain2305

* Adding Cast back to Int32 in FixedPointMultiply. * Removing extra clip. * Fix space. * Retrigger. * Retrigger.

apache#5286

…m_data:master to master * commit 'cd0d52daa6942bdafa9363ff6cfa3d25fcd5b8d6': (824 commits) [Intrinsic] Add log1p, ldexp, atan2, hypot, nextafter, copysign (apache#5312) [Rust][CI] Restore Rust CI (apache#5137) Remove PrimExpr from String (apache#5311) [Requantize] Cleanup and Optimize Lowering (apache#5286) [IR][TRANSFORM] Enable CopyOnWrite for passes. (apache#5309) [PYTORCH]Abs, Arange, Softplus ops (apache#5295) [LLVM] Fix generation of LLVM intrinsics (apache#5282) [BYOC] Add example of Composite + Annotate for DNNL fused op (apache#5272) [Frontend][TensorFlow]Improve TensorFlow Static Shape Tensor Array (apache#5243) [RUNTIME] Introduce RValue reference(move) support to TypedPackedFunc (apache#5271) [RELAY][FRONTEND][CAFFE2] add Mul and ConvTranspose operator (apache#5302) [BYOC] Refine AnnotateTarget and MergeCompilerRegion Passes (apache#5277) [CI] Fix the hexagon string (apache#5304) [Arith] linear system and equation solver (apache#5171) [PYTORCH]Repeat, Reciprocal & Reshape Op support (apache#5280) [FRONTEND][TENSORFLOW] Fix gather_nd indices (apache#5279) Update device_annotation.cc (apache#5291) [REFACTOR][IR] Move to runtime::String (apache#5276) [NDArray] Set shape_ in NDArray::FromDLPack (apache#5301) [RUNTIME] Initial implementation of Hexagon runtime support (apache#5252) ...

Adding Cast back to Int32 in FixedPointMultiply.

1dce423

anijain2305 commented Apr 9, 2020

View reviewed changes

src/relay/qnn/op/requantize.cc Show resolved Hide resolved

anijain2305 added 2 commits April 9, 2020 20:05

Removing extra clip.

8a0f8dc

Fix space.

8535134

anijain2305 force-pushed the arm_int64_mult branch from 5f2744e to 8535134 Compare April 9, 2020 20:57

anijain2305 added 2 commits April 9, 2020 22:03

Retrigger.

0bd377a

Retrigger.

d78195f

anijain2305 marked this pull request as ready for review April 10, 2020 16:22

yzhliu approved these changes Apr 12, 2020

View reviewed changes

yzhliu merged commit 92d0ec1 into apache:master Apr 12, 2020

yzhliu added the status: accepted label Apr 12, 2020

masahi pushed a commit to masahi/tvm that referenced this pull request Apr 12, 2020

[Requantize] Cleanup and Optimize Lowering (apache#5286)

65c0db9

* Adding Cast back to Int32 in FixedPointMultiply. * Removing extra clip. * Fix space. * Retrigger. * Retrigger.

trevor-m pushed a commit to trevor-m/tvm that referenced this pull request Apr 16, 2020

[Requantize] Cleanup and Optimize Lowering (apache#5286)

0a5980a

* Adding Cast back to Int32 in FixedPointMultiply. * Removing extra clip. * Fix space. * Retrigger. * Retrigger.

zhiics pushed a commit to neo-ai/tvm that referenced this pull request Apr 17, 2020

[Requantize] Cleanup and Optimize Lowering (apache#5286)

9938a66

* Adding Cast back to Int32 in FixedPointMultiply. * Removing extra clip. * Fix space. * Retrigger. * Retrigger.

dpankratz pushed a commit to dpankratz/incubator-tvm that referenced this pull request Apr 24, 2020

[Requantize] Cleanup and Optimize Lowering (apache#5286)

e05a336

* Adding Cast back to Int32 in FixedPointMultiply. * Removing extra clip. * Fix space. * Retrigger. * Retrigger.

shoubhik added a commit to shoubhik/incubator-tvm that referenced this pull request May 12, 2020

[Requantize] Cleanup and Optimize Lowering apache#5286

66b8d7b

apache#5286

ZihengJiang mentioned this pull request Sep 25, 2020

TVM v0.7 Release Note Candidate #6486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Requantize] Cleanup and Optimize Lowering #5286

[Requantize] Cleanup and Optimize Lowering #5286

anijain2305 commented Apr 9, 2020 •

edited

Loading

anijain2305 commented Apr 10, 2020

yzhliu commented Apr 12, 2020

[Requantize] Cleanup and Optimize Lowering #5286

[Requantize] Cleanup and Optimize Lowering #5286

Conversation

anijain2305 commented Apr 9, 2020 • edited Loading

anijain2305 commented Apr 10, 2020

yzhliu commented Apr 12, 2020

anijain2305 commented Apr 9, 2020 •

edited

Loading