[MXNET-500]Test cases improvement for MKLDNN on Gluon #10921

juliusshufan · 2018-05-13T07:31:33Z

Description

This PR is a "follow-up" of previously merged #10764 .
In this PR, the followings are covered:

Refine the cases on nn.Conv2D and change the input shape to hit the MKLDNN code path;
Adding more test cases cover other gluon layers, like BN, Dense/FC, Pooling, Deconv etc. from the "MKLDNN-specialty" perspective;
Data coverage cases for some gluon layers, such as Conv2D, BN, Concat etc.

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

All the changes is reflected by tests/python/mkl/test_mkldnn.py

Comments

For the correctness check on gluon computation, it follows the design used by tests/python/unittest/test_gluon.py, and therefore, the helper functions defined in tests/python/unitest/common.py is also used.

marcoabreu · 2018-05-13T11:08:56Z

Hi, thanks for adding these tests! Could you elaborate why we need backend specific tests in a front-end language for operators? Please excuse me if I'm making a misassumption here, but the implementation should be transparent and always act the same.

This means we should not need any MKL specific tests for operators, considering their behaviour should be identical. From my point of view, there should rather be general tests for all operators (in the same style you just wrote them), and they should just succeed if we switch the backend to MKL.

I'm afraid that we will run into inconsistencies if we start writing custom tests for each backend. Ideally, all backends should produce the exact same output and execute the same behaviour. This would then be verified with general operator tests and thus make custom backend tests obsolete

@szha @piiswrong @zheng-da am I right with my assumption?

juliusshufan · 2018-05-13T12:18:29Z

@marcoabreu Thanks for your comments and asking, and I fully understand your concerns on the consistence. Let me try to explain:
The main purpose for these newly add cases is to cover the "specialty" of the from the perspective of the integration of mkldnn with MXNET as a back-end, and data coverage improvement, that includes:

1. From input shape perspective, as the "memory layout" of mkldnn for different mkldnn primitives, selective input shape is necessary for hitting the mkldnn path, these can be reflected by test cases "test_mkldnn_conv2d", "test_mkldnn_batchnorm", in these cases, the channel/filter number is all multipliers of 16, and these can effectively cover the mkldnn code path; For the test_mkldnn_batchnorm case, because of the taking Conv2D output computed by the mkldnn path as the input of the BN layer can hitting the mkldnn code path, I therefore use the Conv2D output as the input of BN layer;
2. Because (some of) the computations via mkldnn involves memory layout record, I add a series of test cases following the patterns involving a ndarray call involving potential memory operation (such as slice, reshape) and a real computation(such as conv, bn, pooling, dence/fc etc.) , these cases targeting the correctness check of the memory layout is properly handled before/after a computation via mkldnn.

As the test cases described in 1&2 not only focusing on the computation correctness, is it possible to execute these cases only for MKLDNN-enabled build by the Jeckins script?

@marcoabreu @szha @piiswrong @zheng-da @pengzhao-intel @TaoLv May I have all your comments?

Thanks.

marcoabreu · 2018-05-13T16:02:24Z

I see, thanks a lot for elaborating! Does this mean that MKLDNN is only being used if the input is in a certain shape? What happens in unusual shapes? The problem with this test is that you are not able to verify whether MKLDNN has actually been hit or if another implementation was used, right?

We might have to start looking into CPP tests for MKLDNN specific tasks, considering these backends are designed to be transparent and a lot of information is abstracted away due to the hourglass C-API. It might be easier to validate your assumptions in CPP. What do you think? I don't feel strongly about it, but this is something we should look into in future.

juliusshufan · 2018-05-13T16:38:53Z

@marcoabreu Thanks for your prompt reply.

For the input shape, limited by my knowledge to mkldnn, I think it might be more accurate to call it as the "preferred" shape, taking the conv2d as am example, the computation on 16X channel/filter number might be fully boosted/benefiting from mkldnn as the commonly used channel/filter number for neural network context is multiplier of 16 (such as 64, 128, 256, 512 etc.), for the behavior of mkldnn to unusual shapes, I think @zheng-da @pengzhao-intel @TaoLv are more expertised in this area, and can give a better explanation than me.

For CPP cases, I fully agreed with you it is helpful for the mkldnn specific tasks, especially for boundary/corner cases, at this situation, the input might be invalidated by framework and the mkldnn can't be hit. Though, the python cases are relatively fast to construct and can focus on the specialty of integration of mkldnn, especially from the data coverage and the cases involving different layers combination involving both a computation via mkldnn and native CPU implementation path. I think these python cases can be further elaborated and executed according to the existing pre-ci scheme of MXNET.

Meanwhile, for CPP cases, actually I am planning the cases as well, and I am very willing to work with you guys on the test improvement at this area.

What do you think?

Thanks.

zheng-da · 2018-05-13T16:59:00Z

@marcoabreu i agree with you that we need cpp tests for MKLDNN to explicitly cover different input and output NDArrays. I'm writing such unit tests. These tests will cover more different cases and many of them are not covered by the python unit tests. However, writing such cpp unit tests is much more difficult and probably can only be used for testing a single operator. We'll need more people to write C++ unit tests.
The tests on in this PR. #10651
I'll probably split the PR and submit another PR for adding cpp unit tests.

That's why the python tests can be useful. It can easily cover many different operator combinations, which may cause problems in MKLDNN and was never tested in our unit tests.

zheng-da · 2018-05-13T17:00:13Z