New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

support fuse layers for ptq #35015

Merged

wanghaoshuang merged 37 commits into PaddlePaddle:develop from XGZhang11:fuse_1

Aug 31, 2021

Contributor

XGZhang11 commented Aug 19, 2021 •

edited

Loading

PR types

New features

PR changes

Others

Describe

支持了用于离线量化的层融合。可以将卷积层或全连接层以及其后的bn层融合为一个卷积层，计算在inference下是等价的。在离线量化时，可以指定需要融合的层进行融合。层融合可以进一步加速推理，适配某些不支持bn的硬件。

XGZhang11 added 24 commits

August 2, 2021 08:35


          support quantization of conv2d_transpose

f4f31f2


          Merge branch 'PaddlePaddle:develop' into develop

11fbba0


          fix quantization bugs

ac21a60


          Update post_training_quantization.py

350048e


          Update quantization_pass.py

cdfa3fe


          Merge branch 'PaddlePaddle:develop' into develop

111387f


          Merge branch 'PaddlePaddle:develop' into develop

9cfc38f


          update docs

4b047da


          add tests for quantized_conv2d_transpose

e5ea4eb


          update codestyle


          update docs

da48df7


          Merge branch 'PaddlePaddle:develop' into develop

7981cb3


          update tests and conv2dtranspose layer

43976be


          update quant tests

8ec36b6


          Merge branch 'PaddlePaddle:develop' into develop

fa20111


          update sampcd_processor for tests

fc74ab0


          update code examples

ccd1675


          Merge branch 'PaddlePaddle:develop' into develop

199cf30


          Merge branch 'develop' of github.com:XGZhang11/Paddle into fuse_1

eb0fa57


          Merge branch 'PaddlePaddle:develop' into develop

0250eed


          Merge branch 'develop' of github.com:XGZhang11/Paddle into fuse_1

7819d62


          support fuse for eval

ff95292


          Merge branch 'PaddlePaddle:develop' into fuse_1

2f7cb4b


          add tests for fuse

4140a48

paddle-bot-old bot commented Aug 19, 2021

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.


          support fuse conv/linear+bn

b75e4cf

qingqing01 requested review from wanghaoshuang and qingqing01

August 20, 2021 01:25

qingqing01 requested a review from juncaipeng

August 20, 2021 01:25

XGZhang11 added 10 commits

August 20, 2021 06:45


          update test

efb1acd


          update test

8870cc3


          update test

eca29a6


          add test

323cc3d


          add test

7a34ddb


          add test

97a1666


          add test

4b9c5dd


          add test

ba689a7


          add test

6c74925


          add test

5d3396a

qingqing01 reviewed

View reviewed changes

python/paddle/fluid/contrib/slim/tests/test_imperative_ptq.py Outdated

+                      quant_model = ptq.quantize(model, fuse=True, fuse_list=f_l)
+                      quant_h = ptq.quantize(model_h, fuse=True, fuse_list=f_l)
+                      for name, layer in quant_model.named_sublayers():
+                          print(name, layer)

Contributor

qingqing01 Aug 23, 2021

需要check fuse之后quant_model是否符合预期，比如check layer的类型，而不是只print

Contributor Author

XGZhang11 Aug 27, 2021

已加assert，用于判断是否还存在bn层

python/paddle/fluid/contrib/slim/quantization/imperative/ptq.py Outdated

+                          fuse(bool): Whether fuse layers.
+                                      Default: False.
+                          fuse_list(list): The layers to fuse.
+                                           Default: None.

Contributor

qingqing01 Aug 23, 2021

如果设置fuse=True，但fuse_list是None，会发生什么。
如果设置，格式是啥

这里的注释不全。

Contributor Author

XGZhang11 Aug 27, 2021

done

python/paddle/fluid/contrib/slim/quantization/imperative/ptq.py Outdated

                       """
                       Add quant config and hook to the target layer.
                       Args:
                           model(paddle.nn.Layer): The model to be quantized.
                           inplace(bool): Whether apply quantization to the input model.
                                          Default: False.
+                          fuse(bool): Whether fuse layers.

Contributor

qingqing01 Aug 23, 2021

Whether to fuse

Contributor Author

XGZhang11 Aug 27, 2021

done

python/paddle/fluid/contrib/slim/tests/test_imperative_ptq.py Outdated

+                      f_l = [['features.0', 'features.1'], ['features.4', 'features.5']]
+                      quant_model = self.ptq.quantize(model, fuse=True, fuse_list=f_l)
+                      for name, layer in quant_model.named_sublayers():
+                          print(name, layer)

Contributor

qingqing01 Aug 23, 2021

同上

Contributor Author

XGZhang11 Aug 27, 2021

done

python/paddle/fluid/contrib/slim/tests/test_imperative_ptq.py

+                          (after_acc_top1, self.eval_acc_top1))
+                      self.assertTrue(
+                          infer_acc_top1 >= after_acc_top1,
+                          msg='The acc is lower after converting model.')

Contributor

qingqing01 Aug 23, 2021 •

edited

Loading

加下注释为什么 after_acc_top1， eval_acc_top1， infer_acc_top1 是这个关系。和 before_acc_top1是啥关系

Contributor Author

XGZhang11 Aug 27, 2021

done。after和before主要是convert前后的关系，eval_acc_top1是0.95，用于检查精度是否正确，infer是存储后又load出来

wanghaoshuang reviewed

View reviewed changes

python/paddle/fluid/contrib/slim/quantization/imperative/fuse_utils.py Outdated



		def fuse_layers(model, layers_to_fuse, inplace=False):
		'''fuse layers in layers_to_fuse'''

Contributor

wanghaoshuang Aug 26, 2021

请添加参数说明

Contributor Author

XGZhang11 Aug 27, 2021

done

python/paddle/fluid/contrib/slim/quantization/imperative/fuse_utils.py Outdated

+              def _fuse_layers(model, layers_list):
+                  '''fuse all the layers in layers_list'''
+                  lay_list = []

Contributor

wanghaoshuang Aug 26, 2021

改成layer_list?

Contributor Author

XGZhang11 Aug 27, 2021

done

python/paddle/fluid/contrib/slim/quantization/imperative/fuse_utils.py Outdated

		setattr(parent_layer, sub_name, new_layers[i])


		def fuse_func(lay_list):

Contributor

wanghaoshuang Aug 26, 2021

方法命名规则请保持一致，如果是模块内函数，请加上下划线。

Contributor Author

XGZhang11 Aug 27, 2021

done

python/paddle/fluid/contrib/slim/quantization/imperative/fuse_utils.py Outdated

+              def fuse_func(lay_list):
+                  '''choose the fuser method and fuse layers'''
+                  types = tuple(type(m) for m in lay_list)
+                  fuser_method = layer_list_to_fuse_method.get(types, None)

Contributor

wanghaoshuang Aug 26, 2021

严格来讲，不是layer list to fuse method, 是types_to_fusion_method.

Contributor Author

XGZhang11 Aug 27, 2021

已改

python/paddle/fluid/contrib/slim/quantization/imperative/fuse_utils.py Outdated

Comment on lines 67 to 70

+                  for i in range(1, len(lay_list)):
+                      identity = Identity()
+                      identity.training = lay_list[0].training
+                      new_layers[i] = identity

Contributor

wanghaoshuang Aug 26, 2021

为什么要加identity呢？不能直接把bn layer删掉么？

Contributor Author

XGZhang11 Aug 27, 2021

这样方便把bn的post hook改到identity上

python/paddle/fluid/contrib/slim/quantization/imperative/fuse_utils.py Outdated

+                  types = tuple(type(m) for m in lay_list)
+                  fuser_method = layer_list_to_fuse_method.get(types, None)
+                  new_layers = [None] * len(lay_list)
+                  fused = fuser_method(*lay_list)

Contributor

wanghaoshuang Aug 26, 2021

fused改成fused_layer?

Contributor Author

XGZhang11 Aug 27, 2021

done

python/paddle/fluid/contrib/slim/quantization/imperative/fuse_utils.py Outdated

+              def fuse_func(lay_list):
+                  '''choose the fuser method and fuse layers'''
+                  types = tuple(type(m) for m in lay_list)
+                  fuser_method = layer_list_to_fuse_method.get(types, None)

Contributor

wanghaoshuang Aug 26, 2021

请斟酌下"fuser_method"的命名是否合适。

Contributor Author

XGZhang11 Aug 27, 2021

已改成fusion_method

python/paddle/fluid/contrib/slim/quantization/imperative/fuse_utils.py Outdated

+                  fused_conv.weight.set_value(fused_weight)
+                  if fused_conv.bias is None:
+                      fused_conv.bias = paddle.create_parameter(
+                          shape=[fused_conv._out_channels], is_bias=True, dtype='float32')

Contributor

wanghaoshuang Aug 26, 2021

dtype设置为bn.bias.dtype?

Contributor Author

XGZhang11 Aug 27, 2021

done

python/paddle/fluid/contrib/slim/quantization/imperative/fuse_utils.py Outdated

+                  fused_linear.weight.set_value(fused_weight)
+                  if fused_linear.bias is None:
+                      fused_linear.bias = paddle.create_parameter(
+                          shape=[fused_linear.weight.shape[1]], is_bias=True, dtype='float32')

Contributor

wanghaoshuang Aug 26, 2021

dtype同上

Contributor Author

XGZhang11 Aug 27, 2021

done

wanghaoshuang reviewed

View reviewed changes

python/paddle/fluid/contrib/slim/tests/test_imperative_ptq.py

               _logger = get_logger(
                   __name__, logging.INFO, fmt='%(asctime)s-%(levelname)s: %(message)s')
+              class TestFuseLinearBn(unittest.TestCase):
+                  """

Contributor

wanghaoshuang Aug 26, 2021

补充注释

Contributor Author

XGZhang11 Aug 27, 2021

done

XGZhang11 added 2 commits

August 27, 2021 05:08


          modified according to reviewers' suggestions

fb67b15


          update test

1c334c0

XGZhang11 requested review from wanghaoshuang and qingqing01

August 27, 2021 07:13

wanghaoshuang approved these changes

View reviewed changes

TCChenlong approved these changes

View reviewed changes

XiaoguangHu01 approved these changes

View reviewed changes

Contributor

XiaoguangHu01 left a comment

LGTM

wanghaoshuang merged commit ef53625 into PaddlePaddle:develop

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

wanghaoshuang wanghaoshuang approved these changes

TCChenlong TCChenlong approved these changes

XiaoguangHu01 XiaoguangHu01 approved these changes

juncaipeng Awaiting requested review from juncaipeng

qingqing01 Awaiting requested review from qingqing01

Labels

None yet