merge upstream to resolve use_mkldnn issue #557

ashokei · 2019-02-11T19:17:57Z

Description

(Brief description on what this PR is about)

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

Feature1, tests, (and when applicable, API doc)
Feature2, tests, (and when applicable, API doc)

Comments

If this change is a backward incompatible change, why must this change be made.
Interesting edge cases to note here

* Fix exception handling api doc * Update waitall api doc Co-Authored-By: anirudh2290 <anirudh2290@apache.org>

* add inception test * fix max iter for mlp * rename and add comment * rename epoch num

* Add imresize API to docs * address comments * copyMakeBorder

* fix control_flow_op * change type for M * add test for sparse where op

* add mkl blas to Jenkins * add mkl install script * fix bug in mkl script * remove python2 ut and add cpu-mkl node

Merge this PR for 1.4.x

* [MXNET-1155] Add scala packageTest utility * Clean up utility * Safe change directory in Makefile for scala * mvn install file instructions with details

Major JNI feature changes. Please find more info here: https://cwiki.apache.org/confluence/display/MXNET/Scala+maven+build+improvement

* Always use config.mk in make install instructions * Specify Cuda 0 for ubuntu with mkldnn * Scala install doc avoid build_from_source Minor doc fixes * Fix build_from_source CMake usage * CPP Install Instruction with CMake * Use cmake out of source build

* Optimize C++ API Pass parameter with reference instead of value. Add const as well as it is not changed. * fix docs/architecture/overview.md Fix BinaryShapeFunction typedef Add a right brace for SmoothL1Shape_

…ed in the model (#13596)

* openblas script * ps-lite dependencies * USE_S3 dependencies * image libraries * license

* add batch norm test * fix formatting * use out_arr as input * fix typo * remove const * use ptr * eval ptr

* Fix the bug of BidirectionalCell I did hybridize( ) and pass "valid_length" to the unroll( ) function of BidirectionalCell, then returned AssertionError in line 79. Because symbol.split( ) return a symbol but not a symbol list. Result in the length of inputs dont equal parameter "length" when call unroll( ) to compute r_outputs and r_states. * add a test for BidirectionalCell * Fix the bug of BidirectionalCell I did hybridize( ) and pass "valid_length" to the unroll( ) function of BidirectionalCell, then returned AssertionError in line 79. Because symbol.split( ) return a symbol but not a symbol list. Result in the length of inputs dont equal parameter "length" when call unroll( ) to compute r_outputs and r_states. * fix test_bidirectional_unroll_valid_length( ) Fix the error of parameter. * Fix the bug of BidirectionalCell I did hybridize( ) and pass "valid_length" to the unroll( ) function of BidirectionalCell, then returned AssertionError in line 79. Because symbol.split( ) return a symbol but not a symbol list. Result in the length of inputs dont equal parameter "length" when call unroll( ) to compute r_outputs and r_states. * fix test_bidirectional_unroll_valid_length( )

* Revert "Revert "Feature/mkldnn static 2 (#13503)" (#13540)" This reverts commit a3eca5f. * include headers on mkldnn lib * retrigger * retrigger

* config for pip * symbol whitelist * maven build config

…13602) * apache/mxnet#12255 doing import mxnet in multiple processes take very long time. Details : #12255 One of the reason we have OMP tuning code which iterates to find OMP tune overhead. We are reducing this iteration count to reduce the overehead of tuning code. Also, We added an environment variable where users can set the number of cores that should be used to determine tuning. * cpplint fix * Adding new environment variable: MXNET_USE_NUM_CORES_OPERATOR_TUNING to doc * fixing formatting in doc

* Add reshape op supported by MKL-DNN * fix build issue * fix lint * fix lint * fix lint * fix lint * fix lint * fix lint * fix white space * add unit test * merge if blocks

* Improve dev_menu, add build command and virtualenv creation with local builds for easy testing * Update dev_menu.py Co-Authored-By: larroy <pedro.larroy.lists@gmail.com> * Cuda off by default, use ccache * address CR

@kohr-h

…e latest maven.org release (#13507) * Correct the versions so they correspond to the latest maven.org release * trigger build * feedback from @kohr-h

* Change argsort to argpartition * Global statistics in metrics * Fix lint * Fixes from review * Trigger * Fixes from review, fix to F1, MCC and perplexity metrics, added test for global stats * Fix lint * Fix compatibility with Python 2

This reverts commit 5bcf2bd.

* support mkl log when dtype is fp32 or fp64 * remove macro * ensure data size less than or equal MKL_INT_MAX * code specification * fix indent * for retrigger

* update the scala installation tutorial on intellij * update the so answer * update the so answer

* Add CPU implementation of ToTensor * Add tests for cpu * Add gpu implementation and tests * Fix lint issues * Cleanup includes * Move back changes to original image operators files * Add 4D example * resolve merge conflicts * Fix failing tests * parallelize on channel in kernel launch

ran 10000 times with no failures

* Fix doc building * Remove deplicate in

@benkamphaus

* Add resource scope to clojure package * add rat * fix integration test * feedback from @benkamphaus - move from defs to atoms to make the tests a bit better * adding alias with-do and with-let more tests * another test * Add examples in docstring * refactor example and test to use resource-scope/with-let * fix tests and problem with laziness now they work as expected! * refactor to be a bit more modular * remove comments

LGTM

* add new cloud providers * fix colon

* cudnn dropout * test dropout as stateful op * add cudnn_off * refactor * fix bug when using inf forward * turn on cudnn in gluon * reuse dropout state space * dropout passthrough * address comments

* keeping same contexts for comparison * enabling test * testing default context * Revert "testing default context" This reverts commit 1f95d0228178debde14680839bb6abab14c6d049. * Disabling test due to CI failure on MKL-DNN

* Expand,tile op export * fix * adding test cases * adding comments

* fix roi align test * retrigger unittest * add more test detail for ROIAlign test * remove url in test_op_roi_align * remove blank line in test_op_roi_align in test_operator * merge master * Update test_operator.py * retrigger CI

* parallelize on channel forward pass * parallelize on channel normalize backward pass * Fix lint issues * Trying to fix CI build failure on GPU * Fix failing GPU test on CI Do not pass normalize param as is to GPU kernel * Fix to_tensor tests * Pass mean and std_dev as native types for kernel * Fix CI failure. Do not pass mean, std as vector to kernel

* Better bulkappend * Fix lint

* updating scala docs * Addressed PR feedback

…(#13680) * [MXNET-1121] Example to demonstrate the inference workflow using RNN * Addressed the review comments. Updated the ReadMe files. * Removed the unnecessary creation of NDArray * Added the unit tests to nightly tests to catch the failure. * Updated the makefiles and unit tests so that the examples are built and tested in nightly * Added the visual representation of the model and fixed the CI failure. * Added the missing pdf file. * Fixing the broken ci_test.sh * Update cpp-package/example/inference/README.md Co-Authored-By: leleamol <19983848+leleamol@users.noreply.github.com> * Update cpp-package/example/inference/README.md Co-Authored-By: leleamol <19983848+leleamol@users.noreply.github.com> * Update cpp-package/example/inference/README.md Co-Authored-By: leleamol <19983848+leleamol@users.noreply.github.com> * Update cpp-package/example/inference/README.md Co-Authored-By: leleamol <19983848+leleamol@users.noreply.github.com> * Update cpp-package/example/inference/README.md Co-Authored-By: leleamol <19983848+leleamol@users.noreply.github.com> * Update cpp-package/example/inference/simple_rnn.cpp Co-Authored-By: leleamol <19983848+leleamol@users.noreply.github.com> * Update cpp-package/example/inference/simple_rnn.cpp Co-Authored-By: leleamol <19983848+leleamol@users.noreply.github.com> * Update cpp-package/example/inference/simple_rnn.cpp Co-Authored-By: leleamol <19983848+leleamol@users.noreply.github.com> * Update cpp-package/example/inference/simple_rnn.cpp Co-Authored-By: leleamol <19983848+leleamol@users.noreply.github.com> * Update cpp-package/example/inference/simple_rnn.cpp Co-Authored-By: leleamol <19983848+leleamol@users.noreply.github.com> * Update cpp-package/example/inference/README.md Co-Authored-By: leleamol <19983848+leleamol@users.noreply.github.com> * Update cpp-package/example/inference/README.md Co-Authored-By: leleamol <19983848+leleamol@users.noreply.github.com> * Update cpp-package/example/inference/README.md Co-Authored-By: leleamol <19983848+leleamol@users.noreply.github.com> * Update cpp-package/example/inference/simple_rnn.cpp Co-Authored-By: leleamol <19983848+leleamol@users.noreply.github.com> * Update cpp-package/example/inference/simple_rnn.cpp Co-Authored-By: leleamol <19983848+leleamol@users.noreply.github.com> * Applying unresolved changes to README file. * Fixing the CI build failure. * Updated the RNN example from sequence generation to sentiment analysis * Updated the readme files. Updated the example to use trained model and updated the unit test. * Addressed the review comment to increase the default sequence length. Added the examples with inputs of various lengths. * Updated the example to handle variable length input. Updated the readme and unit test files accordingly. * Updated the example to share the memory between executors by createing shared executors. * Updated the creation of executors from largest to smallest bucket key * Creating the executor for the highest bucket key. * Updated the unit test to check for the results in a range and modified the function name to be consistent with others. * Fixed the logic to find the right bucket.

* hybridize rnn and add model graph * trigger CI * separate mxboard visualization * add options and she-bang * add defaults * trigger CI * rename export-model

* exclude concat for gpu quantization * remove quantized_concat test in non-subgraph flow

* Remove stale check for op req type * Do not register to tensor operator with in place option.

* Enable s8s8 support for MKLDNN convolution. * Fix cpp build * Fix build. * Fix build * Remove openmp min/max reduction for windows build * Add mkldnn_OIhw4i16o4i_s8s8 support * Add all s8s8 weight format * Change ssd quantize script. * Update * Manually cast mshadow shape size to size_t * Fix merge. * Fix perl package. * Retrigger CI * Fix GPU test * Fix GPU test * Rerun CI * Rerun CI * Rerun CI * Rerun CI * Remove weight_channelwise_scale from params. * Fix * Keep API compatible. * Rerun CI * Rerun CI * Rerun CI * Rerun CI * Address comments. * fix. * Address debug build. * Add comment for next_impl * Rerun ci * Add new api MXExecutorSetMonitorCallbackEX * Add default value for monitor_all for cpp header. * Rerun CI * fix * script change for uint8. * trigger ci * trigger ci

@sandeep-krishnamurthy

…815) * Unify the style here Unify the style here and remove the testing 'print' code segment. * Unify the description of comment Change the description of comment from "multi-layer perceptron" to "Get multi-layer perceptron" * Unify the style of comments Unify the style of comments suggested by @sandeep-krishnamurthy * git pull the lastest code from master of incubator-mxnet * Complete rebase * Solve PEP8 [C0304 ] Final newline missing Sovle example/deep-embedded-clustering/solver.py(150): [C0304 ] Final newline missing

srochel and others added 30 commits December 7, 2018 19:41

fix link for gluon model zoo (#13583)

95f1e1c

Fix exception handling api doc (#13519)

7d2b804

* Fix exception handling api doc * Update waitall api doc Co-Authored-By: anirudh2290 <anirudh2290@apache.org>

add cpp example inception to nightly test (#13534)

f2ca66f

* add inception test * fix max iter for mlp * rename and add comment * rename epoch num

Add notes about debug with libstdc++ symbols (#13533)

ba02bf2

Add imresize and copyMakeBorder to mx.image (#13357)

75d1d4f

* Add imresize API to docs * address comments * copyMakeBorder

[MXNET-1253] fix control_flow_op (#13555)

46a2990

* fix control_flow_op * change type for M * add test for sparse where op

Add Intel MKL blas to Jenkins (#13607)

e25e18f

* add mkl blas to Jenkins * add mkl install script * fix bug in mkl script * remove python2 ut and add cpu-mkl node

#13385 [Clojure] - Turn examples into integration tests (#13554)

449e17d

fix the Float not showing correctly problem (#13617)

1f8bb26

Merge this PR for 1.4.x

[MXNET-1155] Add scala packageTest utility (#13046)

a4c97ec

* [MXNET-1155] Add scala packageTest utility * Clean up utility * Safe change directory in Makefile for scala * mvn install file instructions with details

[MXNET-1224]: improve scala maven jni build and packing. (#13493)

b242b0c

Major JNI feature changes. Please find more info here: https://cwiki.apache.org/confluence/display/MXNET/Scala+maven+build+improvement

Fix warning in waitall doc (#13618)

9ce7eab

Optimize C++ API (#13496)

002e0bb

* Optimize C++ API Pass parameter with reference instead of value. Add const as well as it is not changed. * fix docs/architecture/overview.md Fix BinaryShapeFunction typedef Add a right brace for SmoothL1Shape_

Merge branch 'java-api' into master

25be62f

fix quantize pass error when the quantization supported Op are exclud…

e36f888

…ed in the model (#13596)

Merge branch 'revert-and-merge'

d1e7b81

Scripts for building dependency libraries of MXNet (#13282)

439f167

* openblas script * ps-lite dependencies * USE_S3 dependencies * image libraries * license

add batch norm test (#13625)

b45e127

* add batch norm test * fix formatting * use out_arr as input * fix typo * remove const * use ptr * eval ptr

Set install path for libmxnet.so dynamic lib on Mac OS (#13629)

5d2b763

Feature/mkldnn static (#13628)

5bcf2bd

* Revert "Revert "Feature/mkldnn static 2 (#13503)" (#13540)" This reverts commit a3eca5f. * include headers on mkldnn lib * retrigger * retrigger

build config for maven and pip (#13556)

7fe0409

* config for pip * symbol whitelist * maven build config

Add reshape op supported by MKL-DNN (#12980)

c4a619c

* Add reshape op supported by MKL-DNN * fix build issue * fix lint * fix lint * fix lint * fix lint * fix lint * fix lint * fix white space * add unit test * merge if blocks

Improve dev_menu usability, local build and virtualenv (#13529)

a1cc2ba

* Improve dev_menu, add build command and virtualenv creation with local builds for easy testing * Update dev_menu.py Co-Authored-By: larroy <pedro.larroy.lists@gmail.com> * Cuda off by default, use ccache * address CR

[Clojure] Correct the versions in the README so they correspond to th…

aca93d8

…e latest maven.org release (#13507) * Correct the versions so they correspond to the latest maven.org release * trigger build * feedback from @kohr-h

Optimization of metric evaluation (#13471)

98750fe

* Change argsort to argpartition * Global statistics in metrics * Fix lint * Fixes from review * Trigger * Fixes from review, fix to F1, MCC and perplexity metrics, added test for global stats * Fix lint * Fix compatibility with Python 2

Revert "Feature/mkldnn static (#13628)" (#13638)

9206c40

This reverts commit 5bcf2bd.

support mkl log when dtype is fp32 or fp64 (#13150)

30655f9

* support mkl log when dtype is fp32 or fp64 * remove macro * ensure data size less than or equal MKL_INT_MAX * code specification * fix indent * for retrigger

stu1130 and others added 24 commits February 4, 2019 14:13

update the scala installation tutorial on intellij (#14033)

9aba77b

* update the scala installation tutorial on intellij * update the so answer * update the so answer

rewrote the concat test to avoid flaky failures (#14049)

ac962cd

ran 10000 times with no failures

Fix website scala doc (#14065)

8ba6402

* Fix doc building * Remove deplicate in

Update NOTICE (#14043)

41b81c3

modifying SyncBN doc for FP16 use case (#14041)

3f6778b

LGTM

add new cloud providers to install page (#14039)

78870c0

* add new cloud providers * fix colon

CUDNN dropout (#13896)

18b8704

* cudnn dropout * test dropout as stateful op * add cudnn_off * refactor * fix bug when using inf forward * turn on cudnn in gluon * reuse dropout state space * dropout passthrough * address comments

ONNX export: broadcast_to, tile ops (#13981)

ae997eb

* Expand,tile op export * fix * adding test cases * adding comments

[MXNET-1258]fix unittest for ROIAlign Operator (#13609)

7c7af3a

* fix roi align test * retrigger unittest * add more test detail for ROIAlign test * remove url in test_op_roi_align * remove blank line in test_op_roi_align in test_operator * merge master * Update test_operator.py * retrigger CI

Add maven wraper to scala project. (#13702)

9e14f14

Increase perfomance of BulkAppend and BulkFlush (#14067)

149d810

* Better bulkappend * Fix lint

[MXNET-1178] updating scala docs (#14070)

8191585

* updating scala docs * Addressed PR feedback

update the version name (#14076)

d684c59

hybridize rnn and add model graph (#13244)

506a25c

* hybridize rnn and add model graph * trigger CI * separate mxboard visualization * add options and she-bang * add defaults * trigger CI * rename export-model

Exclude concat layer for gpu quantization (#14060)

26ca37c

* exclude concat for gpu quantization * remove quantized_concat test in non-subgraph flow

Remove inplace support for ToTensor operator (#14083)

2ff8ce0

* Remove stale check for op req type * Do not register to tensor operator with in place option.

merge upstream to resolve USE_MKLDNN issue

aef3384

ashokei requested review from mbrookhart and cconvey February 11, 2019 19:17

ashokei added 2 commits February 11, 2019 12:21

fix merge issue

2d80d03

skip output_names unittest for mxnet-ngraph

1b7b635

mbrookhart approved these changes Feb 12, 2019

View reviewed changes

ashokei merged commit 5a444a1 into master Feb 12, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

merge upstream to resolve use_mkldnn issue #557

merge upstream to resolve use_mkldnn issue #557

ashokei commented Feb 11, 2019

merge upstream to resolve use_mkldnn issue #557

merge upstream to resolve use_mkldnn issue #557

Conversation

ashokei commented Feb 11, 2019

Description

Checklist

Essentials

Changes

Comments