Support user-defined activation/weight quantize and preprocess. #28570

baiyfbupt · 2020-11-12T04:02:41Z

PR types

New features

PR changes

APIs

Describe

Support user-defined quantification and preprocessing methods, such as PACT.
QAT model without using PACT

-----------------------------------------------------------------------------------------------
           Layer (type)                 Input Shape          Output Shape         Param #
===============================================================================================
     FakeQuantMovingAverage-1         [[1, 1, 28, 28]]      [1, 1, 28, 28]           3
FakeChannelWiseQuantDequantAbsMax-1    [[6, 1, 3, 3]]        [6, 1, 3, 3]            6
         QuantizedConv2D-1            [[1, 1, 28, 28]]      [1, 6, 28, 28]          60
             Pool2D-1                 [[1, 6, 28, 28]]      [1, 6, 14, 14]           0
     FakeQuantMovingAverage-2         [[1, 6, 14, 14]]      [1, 6, 14, 14]           3
FakeChannelWiseQuantDequantAbsMax-2   [[16, 6, 5, 5]]       [16, 6, 5, 5]           16
         QuantizedConv2D-2            [[1, 6, 14, 14]]     [1, 16, 10, 10]         2,416
             Pool2D-2                [[1, 16, 10, 10]]      [1, 16, 5, 5]            0
     FakeQuantMovingAverage-3            [[1, 400]]            [1, 400]              3
FakeChannelWiseQuantDequantAbsMax-3     [[400, 120]]          [400, 120]            120
         QuantizedLinear-1               [[1, 400]]            [1, 120]           48,120
     FakeQuantMovingAverage-4            [[1, 120]]            [1, 120]              3
FakeChannelWiseQuantDequantAbsMax-4     [[120, 84]]           [120, 84]             84
         QuantizedLinear-2               [[1, 120]]            [1, 84]            10,164
     FakeQuantMovingAverage-5            [[1, 84]]             [1, 84]               3
FakeChannelWiseQuantDequantAbsMax-5      [[84, 10]]            [84, 10]             10
         QuantizedLinear-3               [[1, 84]]             [1, 10]              850
===============================================================================================
Total params: 61,861
Trainable params: 61,610
Non-trainable params: 251
-----------------------------------------------------------------------------------------------
Input size (MB): 0.00
Forward/backward pass size (MB): 0.55
Params size (MB): 0.24
Estimated Total Size (MB): 0.79
-----------------------------------------------------------------------------------------------

PACT QAT model:

-----------------------------------------------------------------------------------------------
           Layer (type)                 Input Shape          Output Shape         Param #
===============================================================================================
              PACT-1                  [[1, 1, 28, 28]]      [1, 1, 28, 28]           1
     FakeQuantMovingAverage-1         [[1, 1, 28, 28]]      [1, 1, 28, 28]           3
FakeChannelWiseQuantDequantAbsMax-1    [[6, 1, 3, 3]]        [6, 1, 3, 3]            6
         QuantizedConv2D-1            [[1, 1, 28, 28]]      [1, 6, 28, 28]          60
             Pool2D-1                 [[1, 6, 28, 28]]      [1, 6, 14, 14]           0
              PACT-2                  [[1, 6, 14, 14]]      [1, 6, 14, 14]           1
     FakeQuantMovingAverage-2         [[1, 6, 14, 14]]      [1, 6, 14, 14]           3
FakeChannelWiseQuantDequantAbsMax-2   [[16, 6, 5, 5]]       [16, 6, 5, 5]           16
         QuantizedConv2D-2            [[1, 6, 14, 14]]     [1, 16, 10, 10]         2,416
             Pool2D-2                [[1, 16, 10, 10]]      [1, 16, 5, 5]            0
              PACT-3                     [[1, 400]]            [1, 400]              1
     FakeQuantMovingAverage-3            [[1, 400]]            [1, 400]              3
FakeChannelWiseQuantDequantAbsMax-3     [[400, 120]]          [400, 120]            120
         QuantizedLinear-1               [[1, 400]]            [1, 120]           48,120
              PACT-4                     [[1, 120]]            [1, 120]              1
     FakeQuantMovingAverage-4            [[1, 120]]            [1, 120]              3
FakeChannelWiseQuantDequantAbsMax-4     [[120, 84]]           [120, 84]             84
         QuantizedLinear-2               [[1, 120]]            [1, 84]            10,164
              PACT-5                     [[1, 84]]             [1, 84]               1
     FakeQuantMovingAverage-5            [[1, 84]]             [1, 84]               3
FakeChannelWiseQuantDequantAbsMax-5      [[84, 10]]            [84, 10]             10
         QuantizedLinear-3               [[1, 84]]             [1, 10]              850
===============================================================================================
Total params: 61,866
Trainable params: 61,615
Non-trainable params: 251
-----------------------------------------------------------------------------------------------
Input size (MB): 0.00
Forward/backward pass size (MB): 0.57
Params size (MB): 0.24
Estimated Total Size (MB): 0.81
-----------------------------------------------------------------------------------------------

paddle-bot-old · 2020-11-12T04:02:46Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

juncaipeng

weight_preprocess and act_preprocess 会保存到预测模型中，建议尝试对量化的层注册forward hook进行预处理。

用户定义的weight_quantize和act_quantize没办法在paddlelite和paddleinference中支持，这两个参数是否需要、是否暴露给用户，值得商榷。

python/paddle/fluid/contrib/slim/quantization/imperative/qat.py

python/paddle/fluid/contrib/slim/tests/test_imperative_qat_user_defined.py

baiyfbupt · 2020-11-17T11:02:24Z

weight_preprocess and act_preprocess 会保存到预测模型中，建议尝试对量化的层注册forward hook进行预处理。

用户定义的weight_quantize和act_quantize没办法在paddlelite和paddleinference中支持，这两个参数是否需要、是否暴露给用户，值得商榷。

forward_pre_hook中进行预处理无法在反向传播时更新可学习参数，PACT等带有可学习参数的方法不适用
这里是留一个快速验证新型量化方法是否有效的接口，用户可以快速写一个量化方式进行量化训练实验验证，如果有效再考虑新增op。这里还不需要预测库支持

juncaipeng

LGTM

baiyfbupt added 2 commits November 12, 2020 11:08

support user-defined quant and preprocess

ce748b0

code clean

55266ec

baiyfbupt changed the title ~~Dy qat~~ Support user-defined activation/weight quantize and preprocess. Nov 12, 2020

code clean

a813288

PaddlePaddle locked and limited conversation to collaborators Nov 13, 2020

PaddlePaddle unlocked this conversation Nov 13, 2020

juncaipeng reviewed Nov 17, 2020

View reviewed changes

python/paddle/fluid/contrib/slim/quantization/imperative/qat.py Outdated Show resolved Hide resolved

python/paddle/fluid/contrib/slim/tests/test_imperative_qat_user_defined.py Show resolved Hide resolved

baiyfbupt added 2 commits November 17, 2020 17:07

code clean

cf2741e

code clean

609026b

juncaipeng approved these changes Nov 18, 2020

View reviewed changes

baiyfbupt merged commit 5050e76 into PaddlePaddle:develop Nov 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support user-defined activation/weight quantize and preprocess. #28570

Support user-defined activation/weight quantize and preprocess. #28570

baiyfbupt commented Nov 12, 2020 •

edited

Loading

paddle-bot-old bot commented Nov 12, 2020

juncaipeng left a comment

baiyfbupt commented Nov 17, 2020 •

edited

Loading

juncaipeng left a comment

Support user-defined activation/weight quantize and preprocess. #28570

Support user-defined activation/weight quantize and preprocess. #28570

Conversation

baiyfbupt commented Nov 12, 2020 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Nov 12, 2020

juncaipeng left a comment

Choose a reason for hiding this comment

baiyfbupt commented Nov 17, 2020 • edited Loading

juncaipeng left a comment

Choose a reason for hiding this comment

baiyfbupt commented Nov 12, 2020 •

edited

Loading

baiyfbupt commented Nov 17, 2020 •

edited

Loading