[QUANTIZE] Refactor quantization codebase and fix model accuracy #3543

ZihengJiang · 2019-07-14T00:10:17Z

Separate quantization code base into different files: partition.cc, annotate.cc, realize.cc
Change rewrite_for_vta to extra partition pass and enable it by default
Change annotation.force_cast(x) to annotation.cast_hint(x, dtype)
Remove qconfig.store_lowbit_output and enable it by default
Fixed accuracy of models like mobilenet：
- resnet18_v1(8-16bit): 69.29%
- resnet18_v1(8-32bit): 69.29%
- resnet34_v1: 73.33%
- resnet50_v1: 74.78%
- resnet101_v1: 75.66%
- mobilenetv2_1.0: 66.64%

cc @tqchen @eqy @vinx13 @tmoreau89

ZihengJiang · 2019-07-19T04:32:35Z

After discussing offline with Tianqi, we decide to build the nightly regression tests in another repo.

ZihengJiang · 2019-07-19T04:41:49Z

@tqchen @eqy @vinx13 @tmoreau89 Could you please help to review this change?

python/tvm/relay/quantize/quantize.py

python/tvm/relay/quantize/_annotate.py

python/tvm/relay/quantize/_partition.py

eqy · 2019-07-19T16:35:23Z

@ZihengJiang Do you think we we could try to get the calibration PR first? I have to port it over to the new pass infra and I think this is likely more easy to replay on top of calibration than vice-versa.

ZihengJiang · 2019-07-19T17:07:57Z

@eqy Do you mean this one? #3294
Sure if you have time recently

tmoreau89

LGTM

src/relay/pass/quantize/quantize.h

tqchen · 2019-08-02T15:53:33Z

@ZihengJiang please followup now that #3538 is merged

tmoreau89 · 2019-08-07T01:41:36Z

What is the status on this PR? @tqchen @ZihengJiang

ZihengJiang · 2019-08-12T23:42:38Z

python/tvm/relay/quantize/_partition.py

+def add_partition_function(ref_call, new_args, ctx):
+    """Rewrite function for ewise add for partition"""
+    if 'cuda' in _target.current_target().keys:
+        #TODO(wuwei/ziheng) cuda specific rules


@vinx13 Since general devices and VTA are okay/required to insert stop_fusion in both side, let's use different rewrite rules for specific target here,

tests/python/nightly/quantization/test_quantization_accuracy.py

ZihengJiang · 2019-08-15T09:31:23Z

@mingwayzhang those links should be helpful:

…che#3543) * Refactor. * update * update * update * update * update * update

ZihengJiang changed the title ~~[WIP] Add nightly quantization regression tests~~ [QUANTIZE] Refactor codebase, fix accuracy, add nightly regression tests Jul 19, 2019

ZihengJiang changed the title ~~[QUANTIZE] Refactor codebase, fix accuracy, add nightly regression tests~~ [QUANTIZE] Refactor codebase and fix accuracy Jul 19, 2019

ZihengJiang changed the title ~~[QUANTIZE] Refactor codebase and fix accuracy~~ [QUANTIZE] Refactor quantization codebase and fix model accuracy Jul 19, 2019

ZihengJiang added the status: need review label Jul 19, 2019

vinx13 requested changes Jul 19, 2019

View reviewed changes

vinx13 mentioned this pull request Jul 22, 2019

[Relay][Quantization] KL-divergence-based per-layer calibration #3538

Merged

tmoreau89 approved these changes Jul 23, 2019

View reviewed changes

src/relay/pass/quantize/quantize.h Outdated Show resolved Hide resolved

tmoreau89 mentioned this pull request Aug 9, 2019

[VTA][Relay] Extending Vision model coverage compilation for VTA #3740

Merged

ZihengJiang force-pushed the quantize-lr branch from c01618d to c685007 Compare August 12, 2019 21:36

Refactor.

46c9667

ZihengJiang force-pushed the quantize-lr branch from 481c4fa to 46c9667 Compare August 12, 2019 21:43

update

c68f087

ZihengJiang commented Aug 12, 2019

View reviewed changes

ZihengJiang mentioned this pull request Aug 13, 2019

[QUANTIZE] Refactor quantization codebase and fix model accuracy #3762

Closed

ZihengJiang added 5 commits August 12, 2019 19:10

update

0c53a16

update

6dc23aa

update

15ee364

update

8bcfc40

Merge branch 'master' of github.com:dmlc/tvm into dev

d6b9381

vinx13 approved these changes Aug 15, 2019

View reviewed changes

tests/python/nightly/quantization/test_quantization_accuracy.py Outdated Show resolved Hide resolved

update

4cac1f9

ZihengJiang merged commit 7eb1f35 into apache:master Aug 15, 2019

ZihengJiang deleted the quantize-lr branch August 15, 2019 09:31

vinx13 mentioned this pull request Aug 16, 2019

[Relay][Quantization] Fix out-of-date realize #3790

Merged

wweic pushed a commit to neo-ai/tvm that referenced this pull request Aug 16, 2019

[QUANTIZE] Refactor quantization codebase and fix model accuracy (apa…

d537774

…che#3543) * Refactor. * update * update * update * update * update * update

anijain2305 pushed a commit to anijain2305/tvm that referenced this pull request Aug 22, 2019

[QUANTIZE] Refactor quantization codebase and fix model accuracy (apa…

addf4b4

…che#3543) * Refactor. * update * update * update * update * update * update

wweic pushed a commit to neo-ai/tvm that referenced this pull request Sep 6, 2019

[QUANTIZE] Refactor quantization codebase and fix model accuracy (apa…

c4b9fbc

…che#3543) * Refactor. * update * update * update * update * update * update

tqchen mentioned this pull request Nov 8, 2019

[RELEASE][DRAFT] TVM v0.6 Release candidate #4259

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QUANTIZE] Refactor quantization codebase and fix model accuracy #3543

[QUANTIZE] Refactor quantization codebase and fix model accuracy #3543

ZihengJiang commented Jul 14, 2019 •

edited

Loading

ZihengJiang commented Jul 19, 2019

ZihengJiang commented Jul 19, 2019

eqy commented Jul 19, 2019

ZihengJiang commented Jul 19, 2019 •

edited

Loading

tmoreau89 left a comment

tqchen commented Aug 2, 2019

tmoreau89 commented Aug 7, 2019

ZihengJiang Aug 12, 2019 •

edited

Loading

ZihengJiang commented Aug 15, 2019 •

edited

Loading

[QUANTIZE] Refactor quantization codebase and fix model accuracy #3543

[QUANTIZE] Refactor quantization codebase and fix model accuracy #3543

Conversation

ZihengJiang commented Jul 14, 2019 • edited Loading

ZihengJiang commented Jul 19, 2019

ZihengJiang commented Jul 19, 2019

eqy commented Jul 19, 2019

ZihengJiang commented Jul 19, 2019 • edited Loading

tmoreau89 left a comment

Choose a reason for hiding this comment

tqchen commented Aug 2, 2019

tmoreau89 commented Aug 7, 2019

ZihengJiang Aug 12, 2019 • edited Loading

Choose a reason for hiding this comment

ZihengJiang commented Aug 15, 2019 • edited Loading

ZihengJiang commented Jul 14, 2019 •

edited

Loading

ZihengJiang commented Jul 19, 2019 •

edited

Loading

ZihengJiang Aug 12, 2019 •

edited

Loading

ZihengJiang commented Aug 15, 2019 •

edited

Loading