New format quant model support for MKLDNN #45416

yeliang2258 · 2022-08-25T07:59:49Z

PR types

New features

PR changes

Others

Describe

New format quant model support for MKLDNN

CPU：Intel(R) Xeon(R) Gold 6271C CPU @ 2.60GHz
Thread nums：1
Test model：https://github.com/PaddlePaddle/PaddleSlim/tree/develop/example/auto_compression/pytorch_yolo_series

yolov5

model	Latency	mAP
FP32	212.75 ms	37.4
INT8	149.77 ms	36.8

yolov6

model	Latency	mAP
FP32	355.43 ms	42.4
INT8	126.86 ms	41.1

yolov7

model	Latency	mAP
FP32	999.75 ms	51.1
INT8	545.14 ms	50.9

…t_quant_model_dev

jiangjiajun

这里的代码应该需要加上单测，解决代码测试覆盖率的问题

paddle/fluid/framework/ir/mkldnn/quant_dequant_mkldnn_pass.cc

paddle/fluid/framework/ir/mkldnn/quant_dequant_mkldnn_pass.h

yaomichael · 2022-08-29T06:50:36Z

@wozna @jczaja can you help review this PR?

…t_quant_model_dev

wozna · 2022-08-30T12:29:57Z

@yaomichael @yeliang2258 I can see there are only changes in C++ passes. Are you planning to add this ONNX format to python/paddle/fluid/contrib/slim/quantization/quant2_int8_mkldnn_pass.py that serializes models?

yeliang2258 · 2022-08-30T12:57:21Z

@wozna According to the latest version, we only need to enable config.enable_mkldnn_int8(), no longer need to convert through save_quant_model.py script. So we no longer need to add codes to python/paddle/fluid/contrib/slim/quantization/quant2_int8_mkldnn_pass.py.

paddle/fluid/framework/ir/mkldnn/quant_dequant_mkldnn_pass.cc

…t_quant_model_dev

wozna

Thank you for adapting to the review. LGTM

jiangjiajun

LGTM

paddle/fluid/inference/api/paddle_analysis_config.h

python/paddle/fluid/tests/unittests/mkldnn/test_onnx_format_quantization_mobilenetv1.py

sfraczek

LGTM!

wozna

LGTM

yeliang2258 · 2022-09-05T03:08:23Z

PaddlePaddle/Paddle-Inference-Demo#357

paddle/fluid/inference/analysis/passes/ir_analysis_pass.cc

qingqing01

Approve for API

qingqing01

Approve for API

* support onnx format quantized model * update code * add test * add test * fix * fix test * fix cmake * update code * change scale file path to calibration file path * update code * update code * fix build bug * fix build bugs * fix * fix

yeliang2258 added 2 commits August 25, 2022 07:51

support onnx format quantized model

c9f0a45

Merge remote-tracking branch 'upstream/develop' into mkldnn_new_forma…

049af1d

…t_quant_model_dev

yeliang2258 requested review from jiangjiajun and yaomichael August 25, 2022 08:01

jiangjiajun reviewed Aug 25, 2022

View reviewed changes

paddle/fluid/framework/ir/mkldnn/quant_dequant_mkldnn_pass.cc Outdated Show resolved Hide resolved

paddle/fluid/framework/ir/mkldnn/quant_dequant_mkldnn_pass.cc Outdated Show resolved Hide resolved

paddle/fluid/framework/ir/mkldnn/quant_dequant_mkldnn_pass.h Outdated Show resolved Hide resolved

yeliang2258 added 3 commits August 25, 2022 10:03

update code

a31f713

add test

b81c97f

add test

99a6822

yeliang2258 requested a review from jiangjiajun August 26, 2022 06:10

yeliang2258 added 3 commits August 26, 2022 06:42

fix

a1467e9

fix test

7fbedf4

fix cmake

f27907c

yghstill previously approved these changes Aug 29, 2022

View reviewed changes

Merge remote-tracking branch 'upstream/develop' into mkldnn_new_forma…

d84f969

…t_quant_model_dev

jczaja requested a review from sfraczek August 30, 2022 08:07

wozna reviewed Aug 31, 2022

View reviewed changes

paddle/fluid/framework/ir/mkldnn/quant_dequant_mkldnn_pass.cc Outdated Show resolved Hide resolved

paddle/fluid/framework/ir/mkldnn/quant_dequant_mkldnn_pass.cc Show resolved Hide resolved

update code

6d85e3b

yeliang2258 dismissed yghstill’s stale review via 6d85e3b September 1, 2022 08:00

Merge remote-tracking branch 'upstream/develop' into mkldnn_new_forma…

6ea79ed

…t_quant_model_dev

yeliang2258 requested a review from wozna September 1, 2022 08:03

wozna previously approved these changes Sep 2, 2022

View reviewed changes

jiangjiajun previously approved these changes Sep 2, 2022

View reviewed changes

wozna mentioned this pull request Sep 2, 2022

[Feature] Adaptation of the new quantization method for mkldnn. #37422

Closed

yghstill reviewed Sep 2, 2022

View reviewed changes

paddle/fluid/inference/api/paddle_analysis_config.h Outdated Show resolved Hide resolved

change scale file path to calibration file path

79394a3

yeliang2258 dismissed stale reviews from jiangjiajun and wozna via 79394a3 September 2, 2022 10:01

$sfraczek$

sfraczek reviewed Sep 2, 2022

View reviewed changes

python/paddle/fluid/tests/unittests/mkldnn/test_onnx_format_quantization_mobilenetv1.py Outdated Show resolved Hide resolved

update code

8924f9b

yeliang2258 dismissed stale reviews from wozna and yghstill via 8924f9b September 2, 2022 13:16

yeliang2258 requested a review from sfraczek September 2, 2022 13:17

$sfraczek$

sfraczek previously approved these changes Sep 2, 2022

View reviewed changes

wozna previously approved these changes Sep 2, 2022

View reviewed changes

yeliang2258 requested a review from yghstill September 5, 2022 01:53

leiqing1 previously approved these changes Sep 5, 2022

View reviewed changes

qingqing01 reviewed Sep 5, 2022

View reviewed changes

paddle/fluid/inference/analysis/passes/ir_analysis_pass.cc Show resolved Hide resolved

paddle/fluid/inference/analysis/passes/ir_analysis_pass.cc Outdated Show resolved Hide resolved

update code

76ebfd9

yaomichael added Intel int8 labels Sep 5, 2022

yeliang2258 dismissed stale reviews from leiqing1, wozna, and sfraczek via 76ebfd9 September 5, 2022 03:44

yeliang2258 requested review from qingqing01 and XieYunshen September 5, 2022 03:44

qingqing01 previously approved these changes Sep 5, 2022

View reviewed changes

fix build bug

64de33a

yeliang2258 dismissed qingqing01’s stale review via 64de33a September 5, 2022 06:13

yeliang2258 added 3 commits September 5, 2022 06:31

fix build bugs

3726d55

fix

1d1255f

fix

3c29573

XieYunshen approved these changes Sep 5, 2022

View reviewed changes

leiqing1 approved these changes Sep 5, 2022

View reviewed changes

qingqing01 approved these changes Sep 5, 2022

View reviewed changes

jiangjiajun merged commit 4e4f458 into PaddlePaddle:develop Sep 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New format quant model support for MKLDNN #45416

New format quant model support for MKLDNN #45416

yeliang2258 commented Aug 25, 2022 •

edited

Loading

jiangjiajun left a comment

yaomichael commented Aug 29, 2022

wozna commented Aug 30, 2022

yeliang2258 commented Aug 30, 2022

wozna left a comment

jiangjiajun left a comment

$@sfraczek$ sfraczek left a comment

wozna left a comment

yeliang2258 commented Sep 5, 2022

qingqing01 left a comment

qingqing01 left a comment

New format quant model support for MKLDNN #45416

New format quant model support for MKLDNN #45416

Conversation

yeliang2258 commented Aug 25, 2022 • edited Loading

PR types

PR changes

Describe

jiangjiajun left a comment

Choose a reason for hiding this comment

yaomichael commented Aug 29, 2022

wozna commented Aug 30, 2022

yeliang2258 commented Aug 30, 2022

wozna left a comment

Choose a reason for hiding this comment

jiangjiajun left a comment

Choose a reason for hiding this comment

sfraczek left a comment

Choose a reason for hiding this comment

wozna left a comment

Choose a reason for hiding this comment

yeliang2258 commented Sep 5, 2022

qingqing01 left a comment

Choose a reason for hiding this comment

qingqing01 left a comment

Choose a reason for hiding this comment

yeliang2258 commented Aug 25, 2022 •

edited

Loading

$@sfraczek$ sfraczek left a comment