[Bug fix] Do not quantize weights Y when matmul X and Y both other ops outputs #43297

lidanqing-intel · 2022-06-07T14:21:48Z

PR types

Bug fixes

PR changes

Others

Describe

Fix bug for ernie3.0 enable_mkldnn_int8, that do not quantize weights Y when matmul X and Y both other ops outputs

…e the Y.

paddle-bot-old · 2022-06-07T14:21:53Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

lidanqing-intel · 2022-06-07T14:23:53Z

This change is aligned with python save_quant_model.py (save_quant_model.py could quantize Ernie3.0 but enable_mkldnn_int8 failed and this PR fixed it). Now Ernie3.0 enable_mkldnn_int8 could work successfully.

lidanqing-intel · 2022-06-07T14:24:36Z

@sfraczek @wozna Please review this PR, Thanks.

paddle/fluid/framework/ir/mkldnn/quant_dequant_mkldnn_pass.cc

sfraczek

LGTM

lidanqing-intel · 2022-06-09T01:43:26Z

@jczaja Hi could you please review this PR, we agreed to have at least two people to review the PR.

jczaja

LGTM

paddle-bot-old · 2022-06-09T09:19:15Z

你的PR已合入Paddle库，请关注后续测试结果。
Your PR has been merged into the repository. An official integration test will be conducted later. Stay tuned.

…s outputs (#43297) * fix some matmul that X and Y both other ops outputs, do not dequantize the Y. * fix CI format * fix according to review

…s outputs (PaddlePaddle#43297) * fix some matmul that X and Y both other ops outputs, do not dequantize the Y. * fix CI format * fix according to review

* Correct elementwise quantization (#43693) * [Bug fix] Do not quantize weights Y when matmul X and Y both other ops outputs (#43297) * fix some matmul that X and Y both other ops outputs, do not dequantize the Y. * fix CI format * fix according to review Co-authored-by: joanna.wozna.intel <joanna.wozna@intel.com>

fix some matmul that X and Y both other ops outputs, do not dequantiz…

2e586d4

…e the Y.

paddle-bot-old bot added contributor External developers status: proposed labels Jun 7, 2022

lidanqing-intel requested review from sfraczek and wozna June 7, 2022 14:22

lidanqing-intel changed the title ~~[Bug fix] Do not quantize weights when matmul X and Y both other ops outputs~~ [Bug fix] Do not quantize weights Y when matmul X and Y both other ops outputs Jun 7, 2022

lidanqing-intel mentioned this pull request Jun 7, 2022

Ernie enable_mkldnn_int8 crash on matmul #43238

Closed

lidanqing-intel added the Intel label Jun 7, 2022

paddle-bot-old bot removed the status: proposed label Jun 7, 2022

fix CI format

6e72ec7

lidanqing-intel force-pushed the develop-fix-ernie3.0 branch from 10bf1c2 to 6e72ec7 Compare June 8, 2022 05:56

fix according to review

68c79b3

$sfraczek$

sfraczek reviewed Jun 8, 2022

View reviewed changes

paddle/fluid/framework/ir/mkldnn/quant_dequant_mkldnn_pass.cc Outdated Show resolved Hide resolved

paddle/fluid/framework/ir/mkldnn/quant_dequant_mkldnn_pass.cc Outdated Show resolved Hide resolved

$sfraczek$

sfraczek approved these changes Jun 8, 2022

View reviewed changes

lidanqing-intel requested a review from jczaja June 8, 2022 20:41

jczaja approved these changes Jun 9, 2022

View reviewed changes

lidanqing-intel merged commit 06d999f into PaddlePaddle:develop Jun 9, 2022

paddle-bot-old bot added the status: accepted label Jun 9, 2022

lidanqing-intel mentioned this pull request Jun 9, 2022

Add ernie-3.0 mkldnn fp32 and int8 support PaddlePaddle/PaddleNLP#2468

Merged

lidanqing-intel deleted the develop-fix-ernie3.0 branch June 9, 2022 17:12

paddle-bot-old bot removed the contributor External developers label Oct 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug fix] Do not quantize weights Y when matmul X and Y both other ops outputs #43297

[Bug fix] Do not quantize weights Y when matmul X and Y both other ops outputs #43297

lidanqing-intel commented Jun 7, 2022

paddle-bot-old bot commented Jun 7, 2022

lidanqing-intel commented Jun 7, 2022

lidanqing-intel commented Jun 7, 2022 •

edited

Loading

$@sfraczek$ sfraczek left a comment

lidanqing-intel commented Jun 9, 2022

jczaja left a comment

paddle-bot-old bot commented Jun 9, 2022

[Bug fix] Do not quantize weights Y when matmul X and Y both other ops outputs #43297

[Bug fix] Do not quantize weights Y when matmul X and Y both other ops outputs #43297

Conversation

lidanqing-intel commented Jun 7, 2022

PR types

PR changes

Describe

paddle-bot-old bot commented Jun 7, 2022

lidanqing-intel commented Jun 7, 2022

lidanqing-intel commented Jun 7, 2022 • edited Loading

sfraczek left a comment

Choose a reason for hiding this comment

lidanqing-intel commented Jun 9, 2022

jczaja left a comment

Choose a reason for hiding this comment

paddle-bot-old bot commented Jun 9, 2022

lidanqing-intel commented Jun 7, 2022 •

edited

Loading

$@sfraczek$ sfraczek left a comment