Add mkldnn related unit-tests #15615

luotao1 · 2019-01-31T04:44:11Z

To improve the code (line+function) coverage, please add mkldnn related unit-tests:
http://ce.paddlepaddle.org:8080/viewLog.html?buildId=21110&buildTypeId=PaddlePaddleFramework_TestCoverage_TestCoverage&tab=report_project15_C___Coverage

framework/ir/mkldnn
operators/elementwise/mkldnn

kbinias · 2019-02-07T10:17:00Z

We reproduced your results with coverage testing tool.

cmake .. -DWITH_GPU=OFF -DWITH_PROFILER=ON -DWITH_STYLE_CHECK=OFF -DWITH_MKLDNN=ON -DWITH_TESTING=ON -DWITH_COVERAGE=ON
lcov --capture --directory . --output-file coverage.info
genhtml coverage.info --output-directory out

We will add missing tests.

kbinias · 2019-02-15T06:29:38Z

@luotao1 Whether it is possible to run more than 1 iteration in a single unit test for forward and backward phases ? We need to cover the part of the code responsible for reuse MKL-DNN primitives in backward (e.g. activation_mkldnn_op.cc::eltwise_grad function line 276-287). We have to run more than one iteration to enter this section of code.

luotao1 · 2019-02-15T07:09:26Z

Whether it is possible to run more than 1 iteration in a single unit test for forward and backward phases?

I think it's OK to run more than 1 iterations in elementwise_xxx_mkldnn unit-tests. However, if you add many more iterations, does the elasped time of unit-tests become larger?

kbinias · 2019-02-15T07:55:11Z

Not many, 2 iterations will be enough. Is there any unit test in PaddlePaddle which more than 1 iteration is started ?

luotao1 · 2019-02-18T03:43:39Z

@kbinias There is no 2 iterations unitest using OpTest. Maybe you should implement yourselves, and refer to test_batch_norm_op.py.

kbinias · 2019-02-18T16:25:40Z

@luotao1 Thanks for you answer. What do you think about creating new test based on test_imperative_resnet.py with use_mkldnn equals to True ? It will cover batch_norm, conv, pool and activations.

luotao1 · 2019-02-19T03:21:35Z

What do you think about creating a new test based on test_imperative_resnet.py with use_mkldnn equals to True?

@kbinias I think it's OK.

kbinias · 2019-03-07T10:02:23Z

@luotao1 What do you think about closing this PR ? The problem with function coverage is IMO related to lcov tool.

luotao1 · 2019-03-07T11:31:30Z

The Functions coverage is less than 80% in operators/mkldnn. You can add the unit-tests for activation_mkldnn_op.cc, conv_mkldnn_op.cc and transpose_mkldnn_op.cc when you are convenient. @kbinias

kbinias · 2019-03-13T12:55:57Z

@luotao1 Coverage feature in GCC doesn't support virtual destructors correctly. There is no way to cover tests in 100% if test uses Inheritance. You can find out more details below with good explanation:
https://stackoverflow.com/questions/25662174/is-there-a-way-to-call-the-deleting-destructor-of-a-pure-virtual-class.

One of potential workaround might be post-processing and remove all destructors from tests, we could consider that it’s a noise. The destructors should not be a part of final calculation.

luotao1 · 2019-03-13T13:55:12Z

Got it. Both @tensor-tang and I think destructors are normal.
However, the current coverage by May 12th is http://ce.paddlepaddle.org:8080/viewLog.html?buildId=31464&buildTypeId=PaddlePaddleFramework_TestCoverage_TestCoverage&tab=report_project15_C___Coverage

Why the coverage of batch_norm_mkldnn_op and lrn_mkldnn_op cut down that before?

lidanqing-intel · 2019-03-14T11:33:46Z

我们的结果完全不一样，lrn应该是100%的，您可以用下面的编译指令重新生成一下吗 @luotao1 谢谢
⦁        cmake .. -DWITH_GPU=OFF -DWITH_PROFILER=ON -DWITH_STYLE_CHECK=OFF -DWITH_MKLDNN=ON -DWITH_TESTING=ON -DWITH_COVERAGE=ON
⦁        make -j 12
⦁        ctest -R mkldnn
⦁        lcov --capture --directory . --output-file coverage.info
⦁        genhtml --demangle-cpp coverage.info --output-directory out

luotao1 · 2019-03-14T11:40:04Z

我是CI上的结果。

lidanqing-intel · 2019-03-14T11:43:01Z

Could you please tell how to reproduce it with commands ? It seems we have different procedure... 请问怎么reproduce呢，可以有什么指令吗？非常感谢！

lidanqing-intel · 2019-03-14T11:46:18Z

We are checking build log, hope we could find something. Thanks. If you have any further information, please inform us. Thanks

pawelpiotrowicz · 2019-03-19T10:50:37Z

Macro REGISTER_OP_KERNEL is not able to set correct handle for all grad functions such TouchOpKernelRegistrar_transpose_grad_MKLDNN_DEFAULT_TYPE().
Due to missed registration, we can’t cover all functions in tests. The issue is globally, moreover practical working example how to register kernel and enforce touchopkernelregistrar_grad correctly doesn’t exist. The open question is - do we really should care about it?

kbinias · 2019-03-19T21:26:27Z

@luotao1 What do you think about above problem ?

panyx0718 · 2019-03-20T03:18:43Z

do you mean paddle can't register some kernel or paddle can't select some registered kernels?

can you search REGISTER_OP_KERNEL_WITH_CUSTOM_TYPE and see if it works for you?

luotao1 · 2019-03-20T03:33:32Z

@shanyi15 Could you help to update How to add new op to introduce so many different REGISTER MACRO?

DaisyXten · 2019-03-20T09:13:52Z

@luotao1 About batch_norm_mkldnn and lrn_mkldnn line coverages, I have some idea and made some fix locally. I could build and make with docker[latest-dev], but I don't know what is the building and testing script. Something like ./paddle_build_cov.sh. I also don't know what main cov means in the build log on TeamCity. Could you share build script with me? This will speed up the work. Thank you!

luotao1 · 2019-03-20T09:34:05Z

@lidanqing-intel The different between paddle_build_cov.sh adn paddle_build.sh is only in function main()

cov)
        cmake_gen ${PYTHON_ABI:-""}
        build
        run_test
        ;;

@kolinwei will update paddle_build.sh in next PR.

lidanqing-intel · 2019-03-20T09:37:37Z

@lidanqing-intel The different between paddle_build_cov.sh adn paddle_build.sh is only in function main()
cov)
        cmake_gen ${PYTHON_ABI:-""}
        build
        run_test
        ;;
@kolinwei will update paddle_build.sh in next PR.

Thank you!

luotao1 · 2019-03-20T09:42:17Z

please see #16324

pawelpiotrowicz · 2019-03-20T12:53:06Z

@panyx0718 @luotao1 @kbinias

The point it that Register_Op_kernel consist of Register_op_Kernel_with_custom_type.

#define REGISTER_OP_KERNEL(op_type, library_type, place_class, __VA_ARGS__)
REGISTER_OP_KERNEL_WITH_CUSTOM_TYPE( op_type, library_type, place_class, DEFAULT_TYPE, ::paddle::framework::OpKernelType::kDefaultCustomizedTypeValue, __VA_ARGS__)

To better understand what I’m trying to tell is a simple example below.
We have two execution on macro ( for instance transpose_mkldnn_op.cc) .

REGISTER_OP_KERNEL(transpose, MKLDNN, ::paddle::platform::CPUPlace,
                   ops::TransposeMKLDNNOpKernel<float>);
REGISTER_OP_KERNEL(transpose_grad, MKLDNN, ::paddle::platform::CPUPlace,
                   ops::TransposeMKLDNNGradOpKernel<float>);

So we got two functions, only first one will be executed – I expect both of them.

int TouchOpKernelRegistrar_transpose_MKLDNN_DEFAULT_TYPE() { __op_kernel_registrar_transpose_MKLDNN_DEFAULT_TYPE__ .Touch(); return 0; }
int TouchOpKernelRegistrar_transpose_grad_MKLDNN_DEFAULT_TYPE() { __op_kernel_registrar_transpose_grad_MKLDNN_DEFAULT_TYPE__ .Touch(); return 0; }

lidanqing-intel · 2019-03-20T14:23:12Z

Hi, @luotao1 Both batch_norm_mkldnn and lrn_mkldnn failed because of timeout and it passed with good line coverage in our machines, without docker and tests are run in serial.

I just update the progress with you. And similar problems happened before so I put comment here.

I consider to set in cmake/generic.cmake, in line 394 and line 721
set_tests_properties(${TARGET_NAME} PROPERTIES TIMEOUT 1000)

We also found the unit tests are run in parallel, while timeout limit is set 600 strictly. We consider setting mkldnn unit test to run in the serial way as in PR #16233 .

luotao1 · 2019-03-20T14:33:15Z

@lidanqing-intel GOT it, Thanks very much for your analysis!

it passed with good line coverage in our machines

Could you give the line coverage here?

lidanqing-intel · 2019-03-20T14:44:00Z

As in krzysztof's comment.
line cov of batch_norm_mkldnn_op.cc 91%
line cov of lrn_mkldnn_op.cc 100%

luotao1 · 2019-03-20T15:18:41Z

@kbinias I close this issue due to #15615 (comment)
@pawelpiotrowicz For #15615 (comment), how about creating a new issue to discuss it?

luotao1 added the Intel label Jan 31, 2019

luotao1 assigned Sand3r-, jczaja, wojtuss and kbinias Jan 31, 2019

kbinias mentioned this issue Feb 21, 2019

MKL-DNN: Add UT to check whether primitives already exist in backward #15856

Merged

Sand3r- mentioned this issue Feb 21, 2019

MKL-DNN: Add test for conv bias fuse pass #15824

Merged

This was referenced Feb 23, 2019

MKL-DNN: Add Softmax UT to check whether primitives already exist in backward #15894

Closed

MKL-DNN: Add Activations and Softmax UTs to check if primitives already exist in backward #15922

Merged

lidanqing-intel mentioned this issue Feb 26, 2019

MKLDNN: Add UT for conv2d_mkldnn_op with fuse_bias and fuse_residual #15936

Closed

This was referenced Feb 26, 2019

MKL-DNN: Add placement pass tester #15943

Merged

MKL-DNN: Add a unit test for pooling when ceil mode is enabled #16013

Merged

This was referenced Mar 1, 2019

MKLDNN: Add UT for conv2d_mkldnn_op with fuse_bias and fuse_residual #16016

Merged

MKLDNN: Add UT for conv_transpose_mkldnn op. #16030

Merged

lidanqing-intel mentioned this issue Mar 15, 2019

Cannot build Debug build with WITH_GPU=ON due to file truncation #14775

Closed

kolinwei mentioned this issue Mar 20, 2019

add coverage option in paddle_build.sh #16324

Closed

luotao1 closed this as completed Mar 20, 2019

lidanqing-intel mentioned this issue Jul 24, 2019

remove unused TransposeINT8MKLDNNOpKernel for higher UT coverage #18791

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add mkldnn related unit-tests #15615

Add mkldnn related unit-tests #15615

luotao1 commented Jan 31, 2019 •

edited

Loading

kbinias commented Feb 7, 2019 •

edited

Loading

kbinias commented Feb 15, 2019

luotao1 commented Feb 15, 2019

kbinias commented Feb 15, 2019

luotao1 commented Feb 18, 2019

kbinias commented Feb 18, 2019 •

edited

Loading

luotao1 commented Feb 19, 2019

kbinias commented Mar 7, 2019

luotao1 commented Mar 7, 2019

kbinias commented Mar 13, 2019

luotao1 commented Mar 13, 2019

lidanqing-intel commented Mar 14, 2019 •

edited

Loading

luotao1 commented Mar 14, 2019

lidanqing-intel commented Mar 14, 2019

lidanqing-intel commented Mar 14, 2019

pawelpiotrowicz commented Mar 19, 2019 •

edited

Loading

kbinias commented Mar 19, 2019

panyx0718 commented Mar 20, 2019

luotao1 commented Mar 20, 2019

DaisyXten commented Mar 20, 2019

luotao1 commented Mar 20, 2019

lidanqing-intel commented Mar 20, 2019

luotao1 commented Mar 20, 2019

pawelpiotrowicz commented Mar 20, 2019 •

edited

Loading

lidanqing-intel commented Mar 20, 2019

luotao1 commented Mar 20, 2019

lidanqing-intel commented Mar 20, 2019

luotao1 commented Mar 20, 2019

Add mkldnn related unit-tests #15615

Add mkldnn related unit-tests #15615

Comments

luotao1 commented Jan 31, 2019 • edited Loading

kbinias commented Feb 7, 2019 • edited Loading

kbinias commented Feb 15, 2019

luotao1 commented Feb 15, 2019

kbinias commented Feb 15, 2019

luotao1 commented Feb 18, 2019

kbinias commented Feb 18, 2019 • edited Loading

luotao1 commented Feb 19, 2019

kbinias commented Mar 7, 2019

luotao1 commented Mar 7, 2019

kbinias commented Mar 13, 2019

luotao1 commented Mar 13, 2019

lidanqing-intel commented Mar 14, 2019 • edited Loading

luotao1 commented Mar 14, 2019

lidanqing-intel commented Mar 14, 2019

lidanqing-intel commented Mar 14, 2019

pawelpiotrowicz commented Mar 19, 2019 • edited Loading

kbinias commented Mar 19, 2019

panyx0718 commented Mar 20, 2019

luotao1 commented Mar 20, 2019

DaisyXten commented Mar 20, 2019

luotao1 commented Mar 20, 2019

lidanqing-intel commented Mar 20, 2019

luotao1 commented Mar 20, 2019

pawelpiotrowicz commented Mar 20, 2019 • edited Loading

lidanqing-intel commented Mar 20, 2019

luotao1 commented Mar 20, 2019

lidanqing-intel commented Mar 20, 2019

luotao1 commented Mar 20, 2019

luotao1 commented Jan 31, 2019 •

edited

Loading

kbinias commented Feb 7, 2019 •

edited

Loading

kbinias commented Feb 18, 2019 •

edited

Loading

lidanqing-intel commented Mar 14, 2019 •

edited

Loading

pawelpiotrowicz commented Mar 19, 2019 •

edited

Loading

pawelpiotrowicz commented Mar 20, 2019 •

edited

Loading