dequantize matmul and matmul_v2 Y weights in quant2_int8 #37618

sfraczek · 2021-11-26T12:40:30Z

PR types

Bug fixes

PR changes

Others

Describe

fix for a problem in issue: #36962.
There, matmul_v2 has weights quantized but they are not dequantized during conversion of qat model to fp32 model. later matmul_v2 fp32 is used with those quantized weights.
I separated also conv2d and matmul unit tests for quant2_int8_mkldnn_pass

paddle-bot-old · 2021-11-26T12:41:11Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

wozna · 2021-11-29T09:12:53Z

python/paddle/fluid/contrib/slim/quantization/quant2_int8_mkldnn_pass.py

@@ -336,6 +338,9 @@ def _is_int8_weights(op_node, weight_name):
                self._dequantize_op_weights(graph, op, "Filter", "Output")
            elif op.name() in self._mul_ops and _is_int8_weights(op, "Y"):
                self._dequantize_op_weights(graph, op, "Y", "Out")
+            elif op.name() in self._matmul_ops and _is_int8_weights(op, "Y"):


It looks like you can combine this part into one and just check op.name() in [self._mul_ops, self._matmul_ops]. What do you think?

I can apply your idea, but I will concatenate those lists before for loop

I wonder if we even need matmul_ops variable separate from mul_ops

You're right, it looks like we are doing exactly the same things for them, so for me you can combine it.

lidanqing-intel · 2021-11-29T16:16:51Z

@baoachun We have big concern about matmul_v2 and related passes. Let's discuss it when you got time

Should we enable all fuses of mul, matmul_v1, fc for matmul_v2
Could we just keep matmul_v2 as the only interface and mapping it into different ops: mul, matmul_v1, fc according to different situations

baoachun · 2021-11-30T11:45:49Z

@baoachun We have big concern about matmul_v2 and related passes. Let's discuss it when you got time
1. Should we enable all fuses of mul, matmul_v1, fc for matmul_v2

2. Could we just keep matmul_v2 as the only interface and mapping it into different ops: mul, matmul_v1, fc according to different situations

I'm sorry I don't understand what you mean very much. What is the problem now?

lidanqing-intel

LGTM

lidanqing-intel

LGTM

wozna

LGTM

…e#37618) * dequantize matmul and matmul_v2 Y weights in qat2_int8 * review fix * split conv and mul tests, add matmul test * fixup * fix ci build * remove unused variables * formatting fix * remove extra newline at end of file

$sfraczek$

$@sfraczek$

dequantize matmul and matmul_v2 Y weights in qat2_int8

e9b4bb2

$@sfraczek$ sfraczek requested review from wozna and lidanqing-intel November 26, 2021 12:40

wozna previously approved these changes Nov 29, 2021

View reviewed changes

sfraczek added 2 commits November 29, 2021 12:43

$@sfraczek$

review fix

c603778

$@sfraczek$

split conv and mul tests, add matmul test

1d077f5

$@sfraczek$ sfraczek dismissed wozna’s stale review via 1d077f5 November 29, 2021 11:43

$@sfraczek$

fixup

a11eb6a

$@sfraczek$

fix ci build

beaa1dc

$@sfraczek$ sfraczek changed the title ~~dequantize matmul and matmul_v2 Y weights in qat2_int8~~ dequantize matmul and matmul_v2 Y weights in quant2_int8 Nov 30, 2021

sfraczek added 2 commits November 30, 2021 15:35

$@sfraczek$

remove unused variables

cc40577

$@sfraczek$

formatting fix

41d1a6d

$@sfraczek$ sfraczek requested a review from jczaja December 1, 2021 10:22

$@sfraczek$

remove extra newline at end of file

02c054e

lidanqing-intel added the Intel label Dec 1, 2021

lidanqing-intel approved these changes Dec 1, 2021

View reviewed changes

wozna approved these changes Dec 1, 2021

View reviewed changes

jczaja merged commit 7094251 into PaddlePaddle:develop Dec 1, 2021

$@sfraczek$ sfraczek added the int8 label Dec 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dequantize matmul and matmul_v2 Y weights in quant2_int8 #37618

dequantize matmul and matmul_v2 Y weights in quant2_int8 #37618

$@sfraczek$ sfraczek commented Nov 26, 2021 •

edited

Loading

paddle-bot-old bot commented Nov 26, 2021

wozna Nov 29, 2021

$@sfraczek$ sfraczek Nov 29, 2021

$@sfraczek$ sfraczek Nov 29, 2021 •

edited

Loading

wozna Nov 29, 2021

lidanqing-intel commented Nov 29, 2021

baoachun commented Nov 30, 2021

lidanqing-intel left a comment

lidanqing-intel left a comment

wozna left a comment

dequantize matmul and matmul_v2 Y weights in quant2_int8 #37618

dequantize matmul and matmul_v2 Y weights in quant2_int8 #37618

Conversation

sfraczek commented Nov 26, 2021 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Nov 26, 2021

wozna Nov 29, 2021

Choose a reason for hiding this comment

sfraczek Nov 29, 2021

Choose a reason for hiding this comment

sfraczek Nov 29, 2021 • edited Loading

Choose a reason for hiding this comment

wozna Nov 29, 2021

Choose a reason for hiding this comment

lidanqing-intel commented Nov 29, 2021

baoachun commented Nov 30, 2021

lidanqing-intel left a comment

Choose a reason for hiding this comment

lidanqing-intel left a comment

Choose a reason for hiding this comment

wozna left a comment

Choose a reason for hiding this comment

$@sfraczek$ sfraczek commented Nov 26, 2021 •

edited

Loading

$@sfraczek$ sfraczek Nov 29, 2021

$@sfraczek$ sfraczek Nov 29, 2021 •

edited

Loading