Add fusion_gru and multi_gru to PTQ (Post-Training Quantization) #33749

wozna · 2021-06-23T14:35:17Z

PR types

New features

PR changes

OPs

Describe

This PR:

adds support for fusion_gru and multi_gru to Post-Training Quantization process
adds gathering scales based on the max absolute value per output channel for fusion_gru/multi_gru operators
adds GRU model test for PTQ for with multi_gru_fuse_pass and without it
fix in the requantize operator so that there is no hard-coded data format but that it is set based on input

paddle-bot-old · 2021-06-23T14:35:21Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle-bot-old · 2021-07-09T02:35:28Z

Sorry to inform you that 3b67f94's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

wozna · 2021-07-20T08:00:05Z

I have a problem that in PR-CI-Coverage for file paddle/fluid/inference/api/mkldnn_quantizer.cc coverage is 0% even if all UT added in this PR run this code multiple times. I added the same changes in PR #33912 where I added a few prints and made the test not pass, then everything is written in the log and as you can see in the logs in PR-CI-Coverage the tests go through these parts of the code. I still don't know why only this file has line coverage equal to 0.

lidanqing-intel

LGTM

wozna · 2021-08-27T08:36:28Z

The recent changes are related to cmake files. It turned out that mkldnn_quantizer.cc was duplicated. It was added to the analysis_predictor library and then it was separately added to the paddle_inference_shared library where the analysis_predictor library was added as a dependency. That's why I left this file only in analysis_predictor lib.

wozna added int8 Intel labels Jun 23, 2021

wozna mentioned this pull request Jun 29, 2021

Add QuantizeFusionLSTM pass and collect lstm scales #33797

Closed

wozna force-pushed the gru_ptq branch from 1f18236 to 6be5684 Compare June 30, 2021 15:08

wozna force-pushed the gru_ptq branch from 3b67f94 to 5492219 Compare July 12, 2021 15:10

wozna force-pushed the gru_ptq branch from 5492219 to 4e81863 Compare August 9, 2021 12:02

lidanqing-intel added this to the v2.2 milestone Aug 25, 2021

wozna force-pushed the gru_ptq branch from 4e81863 to 6e4c175 Compare August 25, 2021 12:11

wozna added 9 commits August 25, 2021 14:24

Add calculation for gru op

154241d

Correct the types

ce4bd5d

Remove mkldnn only

e60e0ac

Correct mkldnn ifdef

24fdf47

Remove mkldnn ifdef

6c14a76

Separate mkldnn quantizer test

50141de

Correct Windows test

54138e6

Check different cmake fix

6e4c175

Revert cmake change

fcc67e7

wozna added 2 commits August 26, 2021 13:15

Cmake change 2

111ac3f

Cmake change 3

10500e1

lidanqing-intel self-requested a review August 27, 2021 07:44

lidanqing-intel approved these changes Aug 27, 2021

View reviewed changes

jczaja merged commit 7debae3 into PaddlePaddle:develop Aug 27, 2021

wozna deleted the gru_ptq branch February 24, 2023 16:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add fusion_gru and multi_gru to PTQ (Post-Training Quantization) #33749

Add fusion_gru and multi_gru to PTQ (Post-Training Quantization) #33749

wozna commented Jun 23, 2021

paddle-bot-old bot commented Jun 23, 2021

paddle-bot-old bot commented Jul 9, 2021

wozna commented Jul 20, 2021 •

edited

Loading

lidanqing-intel left a comment

wozna commented Aug 27, 2021

Add fusion_gru and multi_gru to PTQ (Post-Training Quantization) #33749

Add fusion_gru and multi_gru to PTQ (Post-Training Quantization) #33749

Conversation

wozna commented Jun 23, 2021

PR types

PR changes

Describe

paddle-bot-old bot commented Jun 23, 2021

paddle-bot-old bot commented Jul 9, 2021

wozna commented Jul 20, 2021 • edited Loading

lidanqing-intel left a comment

Choose a reason for hiding this comment

wozna commented Aug 27, 2021

wozna commented Jul 20, 2021 •

edited

Loading