Add batch normalization folding to QAT quantizer #3911

chenbohua3 · 2021-07-07T02:25:01Z

This pr adds batch normalization folding to the QAT quantizer, the core ideas are described in #3890

nni/algorithms/compression/pytorch/quantization/quantizers.py

linbinskn · 2021-07-09T08:36:26Z

nni/algorithms/compression/pytorch/quantization/quantizers.py

@@ -125,7 +125,7 @@ class QAT_Quantizer(Quantizer):
    http://openaccess.thecvf.com/content_cvpr_2018/papers/Jacob_Quantization_and_Training_CVPR_2018_paper.pdf
    """

-    def __init__(self, model, config_list, optimizer=None):
+    def __init__(self, model, config_list, optimizer=None, model_inputs=None):


Is model_inputs the same concept with dummy_input in pruning speedup and quantization speedup? If so, recommend using dummy_input instead of model_inputs to be aligned.

linbinskn · 2021-07-11T14:38:17Z

Looks good. I only have one question right now. Is there any problems If we want to export simulated model with new feature bn folding to backend execution engine such as TensorRT? For instance, during inference, conv+bn+relu will be fused into singel op by updating the conv's weight/bias parameter with bn parameters. However, currently our conv's weights have already been equal to fused weight while bn layer still exists. If the problem actual exists, maybe we can discuss an appropriate method to resolve it.

chenbohua3 · 2021-07-12T09:16:43Z

You are right. I have added some code logic to restore folded weight/bias in export_model.

nni/compression/pytorch/compressor.py

nni/algorithms/compression/pytorch/quantization/quantizers.py

linbinskn · 2021-07-19T00:39:14Z

Please update content of bn folding in doc Supported Quantization Algorithms on NNI.

chenbohua3 · 2021-07-19T03:29:19Z

the content of bn folding has been added

QuanluZhang · 2021-07-19T12:31:46Z

nni/algorithms/compression/pytorch/quantization/quantizers.py

-    def fold_bn(self, config, **kwargs):
-        # TODO simulate folded weight
-        pass
+    def fold_bn(self, *inputs, wrapper):


this function is QAT_Quantizer specific? other quantizers may have a different fold_bn function?

This function should also work well for other quantizers. (at least for lsq quantizer I think:) ). I will make it a common utility function in the pr that enables batch normalization folding for other quantizers.

chenbohua3 commented Jul 7, 2021

View reviewed changes

nni/algorithms/compression/pytorch/quantization/quantizers.py Outdated Show resolved Hide resolved

QuanluZhang requested review from linbinskn, J-shang and zheng-ningxin July 7, 2021 02:52

chenbohua3 force-pushed the bn_fold_qat branch from 68cebcf to 555f1a0 Compare July 8, 2021 05:08

linbinskn reviewed Jul 11, 2021

View reviewed changes

chenbohua3 force-pushed the bn_fold_qat branch from 1247ce0 to 0c71293 Compare July 14, 2021 06:10

J-shang reviewed Jul 17, 2021

View reviewed changes

nni/compression/pytorch/compressor.py Show resolved Hide resolved

nni/algorithms/compression/pytorch/quantization/quantizers.py Outdated Show resolved Hide resolved

chenbohua3 added 5 commits July 18, 2021 21:32

Add batch normalization folding to QAT quantizer

fbce790

refine

b2ce9ba

fix linter

de4fbee

update docs

408e9b2

update docs

7368b58

chenbohua3 force-pushed the bn_fold_qat branch from 466a44f to 7368b58 Compare July 18, 2021 13:33

J-shang approved these changes Jul 18, 2021

View reviewed changes

update docs

16f702f

update docs

16fe54c

linbinskn approved these changes Jul 19, 2021

View reviewed changes

chenbohua3 added 2 commits July 19, 2021 12:29

update docs

12af635

update docs

e4d7238

QuanluZhang closed this Jul 19, 2021

QuanluZhang reopened this Jul 19, 2021

QuanluZhang reviewed Jul 19, 2021

View reviewed changes

QuanluZhang approved these changes Jul 26, 2021

View reviewed changes

QuanluZhang merged commit 7fc5af0 into microsoft:master Jul 26, 2021

QuanluZhang mentioned this pull request Jul 26, 2021

NNI 2021 June~July Iteration Planning #3724

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add batch normalization folding to QAT quantizer #3911

Add batch normalization folding to QAT quantizer #3911

chenbohua3 commented Jul 7, 2021

linbinskn Jul 9, 2021

chenbohua3 Jul 12, 2021

linbinskn commented Jul 11, 2021 •

edited

Loading

chenbohua3 commented Jul 12, 2021

linbinskn commented Jul 19, 2021

chenbohua3 commented Jul 19, 2021

QuanluZhang Jul 19, 2021

chenbohua3 Jul 19, 2021

Add batch normalization folding to QAT quantizer #3911

Add batch normalization folding to QAT quantizer #3911

Conversation

chenbohua3 commented Jul 7, 2021

linbinskn Jul 9, 2021

Choose a reason for hiding this comment

chenbohua3 Jul 12, 2021

Choose a reason for hiding this comment

linbinskn commented Jul 11, 2021 • edited Loading

chenbohua3 commented Jul 12, 2021

linbinskn commented Jul 19, 2021

chenbohua3 commented Jul 19, 2021

QuanluZhang Jul 19, 2021

Choose a reason for hiding this comment

chenbohua3 Jul 19, 2021

Choose a reason for hiding this comment

linbinskn commented Jul 11, 2021 •

edited

Loading