More precise mkldnn kernel rules in GetExpectedKernelType #29840

arlesniak · 2020-12-22T11:48:48Z

PR types

Others

PR changes

Others

Describe

More precise mkldnn kernel choice in GetExpectedKernelType based also on kernel's registered data type

paddle-bot-old · 2020-12-22T11:48:56Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle/fluid/framework/operator.cc

arlesniak · 2021-01-14T15:10:07Z

@luotao1 PR-CI-OP-benchmark fails with this PR, however there are no performance changes inside. I restarted it several times but every time it gives another list of errors i.e.:
previous:
2021-01-13 02:00:19 [check_op_benchmark_result.py:150] [ERROR] Check speed result with case "minimum_2(backward)" failed.
2021-01-13 02:00:19 [check_op_benchmark_result.py:154] [ERROR] Check accuracy result with case "multiply_7(backward)" failed.
2021-01-13 02:00:19 [check_op_benchmark_result.py:154] [ERROR] Check accuracy result with case "multiply_2(backward)" failed.
2021-01-13 02:00:19 [check_op_benchmark_result.py:154] [ERROR] Check accuracy result with case "divide_4(backward)" failed.
latest:
2021-01-14 22:19:03 [check_op_benchmark_result.py:150] [ERROR] Check speed result with case "subtract_1(forward)" failed.
2021-01-14 22:19:03 [check_op_benchmark_result.py:150] [ERROR] Check speed result with case "subtract_7(forward)" failed.
2021-01-14 22:19:03 [check_op_benchmark_result.py:150] [ERROR] Check speed result with case "pow_4(forward)" failed.
2021-01-14 22:19:03 [check_op_benchmark_result.py:150] [ERROR] Check speed result with case "pow_2(backward)" failed.
2021-01-14 22:19:03 [check_op_benchmark_result.py:154] [ERROR] Check accuracy result with case "multiply_7(backward)" failed.
2021-01-14 22:19:03 [check_op_benchmark_result.py:154] [ERROR] Check accuracy result with case "multiply_4(backward)" failed.
2021-01-14 22:19:03 [check_op_benchmark_result.py:154] [ERROR] Check accuracy result with case "multiply_2(backward)" failed.

AFAIK the resulting log from the benchmark machine has checks about GPU ops, in the log there is no oneDNN verbose info, so it looks that oneDNN kernels are not run, which would eventually be correlated with the PR.

Could you advise on that, please?

arlesniak · 2021-01-19T16:39:38Z

After 10 restarts CI passed :)
@luotao1 Could you start your review please ?

arlesniak · 2021-01-20T15:43:43Z

@luotao1 could you please start your review?

arlesniak · 2021-01-22T15:43:57Z

@luotao1 Could you please start your review? PR-CI-Approval will not pass because many files were modified, but most of them it is single line of change.

wojtuss

When in the class derived from the OperatorWithKernel class you do not need to prepend the base class' public method calls with OperatorWithKernel:: nor this-> (applies to CanMKLDNNBeUsed(...) and IndicateVarDataType(...)) in particular). Although there are plenty of such calls in the original code, I would stick to the cleaner approach and skip the redundant prefixes.
If you do not agree, that's fine. LGTM then :-)

luotao1 · 2021-01-25T08:50:21Z

paddle/fluid/operators/activation_op.cc

@@ -93,6 +93,7 @@ framework::OpKernelType GetKernelType(const framework::ExecutionContext& ctx,
                                      const std::string& name) {
  framework::LibraryType library{framework::LibraryType::kPlain};
  framework::DataLayout layout = framework::DataLayout::kAnyLayout;
+  auto data_type = oper.IndicateVarDataType(ctx, name);


AFAIK the resulting log from the benchmark machine has checks about GPU ops, in the log there is no oneDNN verbose info, so it looks that oneDNN kernels are not run, which would eventually be correlated with the PR.
Could you advise on that, please?

Although op-benchmark-ci checks about GPU ops, does line 96 have additional time cost? How about move it into line 109?

if (library == framework::LibraryType::kPlain && it != oper.Attrs().end()) { auto data_type = oper.IndicateVarDataType(ctx, name); if (oper.CanMKLDNNBeUsed(ctx, data_type)) { xxxx } }

@luotao1 Thank you for the comment. The data_type variable is used also in line 119:
return framework::OpKernelType(data_type, ctx.GetPlace(), layout, library);

So it will be needed/calculated at the end of the function, whether the condition you mentioned is True or not (despite mkldnn is to be used or not).

In the way it's implemented in PR, the variable value is calculated only once per function as it was prior to my changes, without additional time cost.
It applies to every occurrence in other op files.

@wojtuss Thank you for the comment. Having in mind baby sitting the PR-CI-OP-benchmark for more than a week on the same code, I prefer to not refactor the code in the PR. Of course if it's OK with you because I respect you opinion.

In the way it's implemented in PR, the variable value is calculated only once per function as it was prior to my changes, without additional time cost.

Got it.

I prefer to not refactor the code in the PR. Of course if it's OK with you because I respect you opinion.

@wojtuss What's your opinion?

Totally understandable.

luotao1 · 2021-01-25T08:52:43Z

@Avin0323 @GaoWei8 Please see how to make op-benchmark-ci more stable? The same commitid have two different results.

GaoWei8 · 2021-01-25T09:00:31Z

@Avin0323 @GaoWei8 Please see how to make op-benchmark-ci more stable? The same commitid have two different results.

The accuracy of multiply op has been exposed before, and the threshold has been changed, so no more errors will be reported.
As for the speed issue, if rerun can pass, it is not considered as a performance issue.

wojtuss

LGTM

jczaja added the Intel label Dec 22, 2020

arlesniak changed the title ~~More precise mkldnn kernel choice in GetExpectedKernelType~~ More precise mkldnn kernel rules in GetExpectedKernelType Dec 23, 2020

arogowie-intel reviewed Jan 11, 2021

View reviewed changes

paddle/fluid/framework/operator.cc Outdated Show resolved Hide resolved

paddle/fluid/framework/operator.cc Outdated Show resolved Hide resolved

arlesniak force-pushed the arlesniak/more_precise_kernel_choice branch from 5c15712 to 235afda Compare January 11, 2021 20:50

arlesniak added 5 commits January 14, 2021 08:24

More precise mkldnn kernel choice in GetExpectedKernelType

f3a1f1d

Fixes after review

a5ce212

Refresh develop for CI

ade8965

CI experiment

ea2e096

get back from CI exper

3889476

arlesniak force-pushed the arlesniak/more_precise_kernel_choice branch from a912b3e to 3889476 Compare January 14, 2021 08:59

arogowie-intel approved these changes Jan 19, 2021

View reviewed changes

jczaja assigned luotao1 Jan 20, 2021

wojtuss reviewed Jan 25, 2021

View reviewed changes

luotao1 reviewed Jan 25, 2021

View reviewed changes

wojtuss approved these changes Jan 25, 2021

View reviewed changes

luotao1 merged commit 5bf25d1 into PaddlePaddle:develop Jan 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More precise mkldnn kernel rules in GetExpectedKernelType #29840

More precise mkldnn kernel rules in GetExpectedKernelType #29840

arlesniak commented Dec 22, 2020 •

edited

Loading

paddle-bot-old bot commented Dec 22, 2020

arlesniak commented Jan 14, 2021 •

edited

Loading

arlesniak commented Jan 19, 2021

arlesniak commented Jan 20, 2021

arlesniak commented Jan 22, 2021

wojtuss left a comment

luotao1 Jan 25, 2021

arlesniak Jan 25, 2021

arlesniak Jan 25, 2021

luotao1 Jan 25, 2021

wojtuss Jan 25, 2021

luotao1 commented Jan 25, 2021

GaoWei8 commented Jan 25, 2021

wojtuss left a comment

More precise mkldnn kernel rules in GetExpectedKernelType #29840

More precise mkldnn kernel rules in GetExpectedKernelType #29840

Conversation

arlesniak commented Dec 22, 2020 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Dec 22, 2020

arlesniak commented Jan 14, 2021 • edited Loading

arlesniak commented Jan 19, 2021

arlesniak commented Jan 20, 2021

arlesniak commented Jan 22, 2021

wojtuss left a comment

Choose a reason for hiding this comment

luotao1 Jan 25, 2021

Choose a reason for hiding this comment

arlesniak Jan 25, 2021

Choose a reason for hiding this comment

arlesniak Jan 25, 2021

Choose a reason for hiding this comment

luotao1 Jan 25, 2021

Choose a reason for hiding this comment

wojtuss Jan 25, 2021

Choose a reason for hiding this comment

luotao1 commented Jan 25, 2021

GaoWei8 commented Jan 25, 2021

wojtuss left a comment

Choose a reason for hiding this comment

arlesniak commented Dec 22, 2020 •

edited

Loading

arlesniak commented Jan 14, 2021 •

edited

Loading