Added caching of scales for bias in conv2d int8 #36980

jczaja · 2021-11-03T15:54:26Z

PR types

Performance optimization

PR changes

OPs

Describe

Currently scaling of bias data for conv2d int8 happens in every iterations. This is not needed and scaling cna be done once and then stored. This PR is caching ones computed scaled bias.

Perf improvement: Mobilenet_v1 int8 ~2% improvement on whole model.
processor: Intel(R) Xeon(R) Gold 6248 CPU @ 2.50GHz

paddle-bot-old · 2021-11-03T15:54:30Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

jakpiase · 2021-11-03T22:30:12Z

paddle/fluid/operators/mkldnn/conv_mkldnn_op.cc

+    const auto& scale_in_data = ctx.Attr<float>("Scale_in");
+
+    bool is_multi_channel = scale_weights_data.size() > 1;
+    int mask_reorder = is_multi_channel ? 1 << 0 : 1;


What is the reason for bit shifting "1" zero bits? Isn't "1 << 0" equal to 1? For now that ternary operation is not needed, since no matter what value "is_multi_channel" has, the result will be equal to 1

@wozna Could you please take this question?

From what I understand "mask_reorder" is used as flags, but even with that knowledge that doesn't seem like it's easy to understand at the first glance

I disscussed this with @wozna and there is probably some problem here that require separate investigation (testing accuracy etc.). It will not be solved in this PR

jakpiase · 2021-11-03T22:33:43Z

paddle/fluid/operators/mkldnn/conv_mkldnn_op.cc

+    int mask_reorder = is_multi_channel ? 1 << 0 : 1;
+    const int count =
+        is_multi_channel
+            ? (groups > 1 ? (weights_tz)[1] * (weights_tz)[0] : (weights_tz)[0])


Nesting ternary operators produces code that is not necessarily easy to understand for me, could you please change that to normal if statement to avoid nesting ternary ops?

ok. Will do that

jakpiase · 2021-11-03T22:35:36Z

paddle/fluid/operators/mkldnn/conv_mkldnn_op.cc

+    bias_scale_tuple =
+        std::make_shared<std::tuple<float, std::vector<float>>>(std::make_tuple(
+            static_cast<float>(mask_reorder), std::vector<float>(count)));
+    for (int i = 0; i < count; i++) {


Some time ago I would complain about "i++", but since I have read that compiler will optimize it anyway(this knowledge comes from the book that you have recommended about modern CPUs, so thank you!)

arlesniak · 2021-11-04T10:33:15Z

paddle/fluid/operators/mkldnn/conv_mkldnn_op.cc

-            ? (groups > 1 ? (weights_tz)[1] * (weights_tz)[0] : (weights_tz)[0])
-            : 1;
+
+    int count;


Maybe if you initiate with :
int count =1;
then

else { count = 1; }

won't be necessary. What is your opinion ?

Ok. I incorporated your suggesstion while keeping if-else blocks as requested by @jakpiase

jakpiase

LGTM

jakpiase

LGTM

lidanqing-intel

LGTM

jczaja added 2 commits November 3, 2021 16:28

- Cached bias scales

edba4be

- Fix

131b40d

jczaja added the Intel label Nov 3, 2021

jczaja changed the title ~~[WIP] Added caching of scales for bias in conv2d int8~~ Added caching of scales for bias in conv2d int8 Nov 3, 2021

jakpiase reviewed Nov 3, 2021

View reviewed changes

jczaja requested a review from wozna November 4, 2021 08:20

- fixes after review

5c7d28f

arlesniak reviewed Nov 4, 2021

View reviewed changes

jakpiase previously approved these changes Nov 4, 2021

View reviewed changes

- second round of fixes after internal review

629207c

jczaja dismissed jakpiase’s stale review via 629207c November 4, 2021 11:34

jczaja requested a review from lidanqing-intel November 4, 2021 11:36

jakpiase approved these changes Nov 4, 2021

View reviewed changes

lidanqing-intel approved these changes Nov 5, 2021

View reviewed changes

jczaja merged commit 3705b12 into PaddlePaddle:develop Nov 5, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added caching of scales for bias in conv2d int8 #36980

Added caching of scales for bias in conv2d int8 #36980

jczaja commented Nov 3, 2021 •

edited

Loading

paddle-bot-old bot commented Nov 3, 2021

jakpiase Nov 3, 2021

jczaja Nov 4, 2021

jakpiase Nov 4, 2021

jczaja Nov 4, 2021

jakpiase Nov 3, 2021

jczaja Nov 4, 2021

jczaja Nov 4, 2021

jakpiase Nov 3, 2021

arlesniak Nov 4, 2021

jczaja Nov 4, 2021

jakpiase left a comment

jakpiase left a comment

lidanqing-intel left a comment

Added caching of scales for bias in conv2d int8 #36980

Added caching of scales for bias in conv2d int8 #36980

Conversation

jczaja commented Nov 3, 2021 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Nov 3, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jakpiase left a comment

Choose a reason for hiding this comment

jakpiase left a comment

Choose a reason for hiding this comment

lidanqing-intel left a comment

Choose a reason for hiding this comment

jczaja commented Nov 3, 2021 •

edited

Loading