[Relay][Quantize] Integrate data-aware calibration into quantization #4295

vinx13 · 2019-11-10T00:52:17Z

This PR did some refactor work for the calibration part: integrate evaluation script for KL, the collect stats part has been moved into internal collect_stats. New config options calibration_mode, weight_scale have been added.
Removed opt_level=3 for prerequisite_optimize, as it caused some accuracy issue when FoldScaleAxis is invoked before calibration.

Part of this PR is based on #3828.

@ZihengJiang @anijain2305 @tmoreau89

python/tvm/relay/quantize/_calibrate.py

yzhliu

possible to have a test case?

python/tvm/relay/quantize/quantize.py

python/tvm/relay/quantize/_calibrate.py

vinx13 · 2019-11-15T02:16:17Z

It's difficult to do unit tests. The refactor is covered by nightly tests. I plan to add more nightly tests for the new calibration mode later because there are some more work to do

vinx13 · 2019-11-19T00:58:31Z

@yzhliu comments addressed

tmoreau89

LGTM, added a couple nits that can be addressed optionally

tmoreau89 · 2019-11-19T20:55:52Z

python/tvm/relay/quantize/kl_divergence.py

@@ -54,6 +54,8 @@ def kl_divergence_scale(arr, quantized_dtype='int8', num_bins=8001, num_quantize
    http://on-demand.gputechconf.com/gtc/2017/presentation/s7310-8-bit-inference-with-tensorrt.pdf
    """
    assert isinstance(arr, np.ndarray)
+    assert stats is not None, "scipy need to be installed for \


tmoreau89 · 2019-11-19T20:56:49Z

python/tvm/relay/quantize/quantize.py

@@ -143,9 +141,20 @@ def qconfig(**kwargs):
    nbit_dict: dict of QAnnotateKind -> int
        Number of bit for every kind of annotate field.

+    calibrate_mode: str
+        The calibration mode. 'global_scale' or 'kl'.


can we spell it out so it's more self explanatory (e.g. kullback_leibler)?

tmoreau89 · 2019-11-25T06:54:29Z

@vinx13 I realize that this PR might have broken the E2E VTA tutorial flow (that includes quantization for VTA hardware pipeline). I'll need to investigate, but what puzzles me more is why that wasn't caught by the CI which should be building the sphinx galleries successfully, which should include the e2e VTA tutorial.

tmoreau89 · 2019-11-25T06:56:07Z

The script in question is vta/tutorials/frontend/deploy_vision_on_vta.py

…pache#4295) * [Relay][Quantize] Integrate data-aware calibration into quantization * Update _calibrate.py * trigger ci * Address comments * address comments

tmoreau89 · 2019-11-27T01:29:36Z

Fix for the VTA bug introduced is in #4433.

vinx13 force-pushed the feature/quanti_eval branch 4 times, most recently from d8c7679 to f60b25d Compare November 10, 2019 01:20

[Relay][Quantize] Integrate data-aware calibration into quantization

4050a78

vinx13 force-pushed the feature/quanti_eval branch from f60b25d to 4050a78 Compare November 10, 2019 08:17

tqchen assigned ZihengJiang Nov 10, 2019

tqchen added the status: need review label Nov 10, 2019

anijain2305 approved these changes Nov 11, 2019

View reviewed changes

python/tvm/relay/quantize/_calibrate.py Show resolved Hide resolved

vinx13 and others added 2 commits November 11, 2019 17:47

Update _calibrate.py

5e3befe

trigger ci

6d3bd3b

vinx13 force-pushed the feature/quanti_eval branch from 1729eb1 to 6d3bd3b Compare November 13, 2019 05:29

yzhliu reviewed Nov 14, 2019

View reviewed changes

python/tvm/relay/quantize/quantize.py Outdated Show resolved Hide resolved

python/tvm/relay/quantize/_calibrate.py Outdated Show resolved Hide resolved

python/tvm/relay/quantize/_calibrate.py Outdated Show resolved Hide resolved

Address comments

3b51c1d

yzhliu approved these changes Nov 19, 2019

View reviewed changes

ZihengJiang approved these changes Nov 19, 2019

View reviewed changes

tmoreau89 approved these changes Nov 19, 2019

View reviewed changes

address comments

4086078

vinx13 mentioned this pull request Nov 19, 2019

[RELEASE][DRAFT] TVM v0.6 Release candidate #4259

Closed

vinx13 merged commit 500ff05 into apache:master Nov 19, 2019

vinx13 added status: accepted and removed status: need review labels Nov 19, 2019

tmoreau89 mentioned this pull request Nov 25, 2019

[VTA] Removing dependences to NNVM #4419

Merged

tqchen mentioned this pull request Nov 27, 2019

[RFC][RELEASE] Apache TVM 0.6.0 Release Candidate (verifying, feedback, etc.) #4406

Closed

tmoreau89 mentioned this pull request Nov 27, 2019

[VTA][HotFix] Relay->VTA quantization fix #4433

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Relay][Quantize] Integrate data-aware calibration into quantization #4295

[Relay][Quantize] Integrate data-aware calibration into quantization #4295

vinx13 commented Nov 10, 2019

yzhliu left a comment

vinx13 commented Nov 15, 2019

vinx13 commented Nov 19, 2019

tmoreau89 left a comment

tmoreau89 Nov 19, 2019

tmoreau89 Nov 19, 2019

tmoreau89 commented Nov 25, 2019

tmoreau89 commented Nov 25, 2019

tmoreau89 commented Nov 27, 2019

[Relay][Quantize] Integrate data-aware calibration into quantization #4295

[Relay][Quantize] Integrate data-aware calibration into quantization #4295

Conversation

vinx13 commented Nov 10, 2019

yzhliu left a comment

Choose a reason for hiding this comment

vinx13 commented Nov 15, 2019

vinx13 commented Nov 19, 2019

tmoreau89 left a comment

Choose a reason for hiding this comment

tmoreau89 Nov 19, 2019

Choose a reason for hiding this comment

tmoreau89 Nov 19, 2019

Choose a reason for hiding this comment

tmoreau89 commented Nov 25, 2019

tmoreau89 commented Nov 25, 2019

tmoreau89 commented Nov 27, 2019