Merge in upstream `master` through 4fe546 #543

cconvey · 2019-01-17T20:49:53Z

Description

(Brief description on what this PR is about)

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

Feature1, tests, (and when applicable, API doc)
Feature2, tests, (and when applicable, API doc)

Comments

If this change is a backward incompatible change, why must this change be made.
Interesting edge cases to note here

* fix quantized pooling and enable it in INT8 SqueezeNet * add test * fix test * address review comments * refine the test for quantized pooling

* edge_id op csr forward on CPU (#34) * add node subgraph generator. (#35) * create DGLSubgraph. * fix. * return old eids in node_subgraph. * accelerate subgraph construction. * Add neighborhood op (#37) * add csr_neighborhood op * update neighborhood sample * Update csr_neighborhood_sample-inl.h * Update csr_neighborhood_sample-inl.h * Update csr_neighborhood_sample.cc * add graph compact operator. * fix a bug in dgl_subgraph. * fix a bug in dgl_graph_compact. * Update csr sample op (#39) * add csr_neighborhood op * update neighborhood sample * Update csr_neighborhood_sample-inl.h * Update csr_neighborhood_sample-inl.h * Update csr_neighborhood_sample.cc * Update csr_neighborhood_sample-inl.h * Update csr_neighborhood_sample.cc * Update csr_neighborhood_sample-inl.h * remove space. * move to dgl_graph to contrib. * move code. * move edge id. * fix compilation error. * add test for subgraph. * cleanup. * fix. * fix. * fix compile error. * fix compile error. * fix compile error. * fix. * add operator doc. * remove graph_compact * update doc. * address comments. * retrigger. * address comments. * retrigger * fix a bug in test. * retrigger * add check_format

* Updated the paths for images * Empty commit * Empty commit * Nudge to CI

* add flag to disable mkldnn cache * update docs * fix typos * update var name * fix ordering * set cache size * fix log message * update docs * fix lint * fix lint * fix comparison * update method name * fix missing * fix logging * remove random item when cache exceeded * update helper name * update hash namespace * make ophash template * udpate function params * fix return * fix return * update return for helper * chagne class to typename * add typename * fix lint * update doc * pass ptr to cache * retrigger * retrigger * retrigger * change env var name to MXNET_MKLDNN_CACHE_NUM * fix log env name * retrigger

* Initial website documentation for Java API * Changing paths to be relative * Refactoring Java API website landing page * Update Java web docs based on feedback * Minor formatting fixes * Update maven repo to nightly build so that java will be available prior to 1.4.0 release * Adding java tutorial index to test_sanity_tutorials whitelist * Fix link to javadocs * Fix javadoc for infer package and minor install doc fix * Minor path fix

…(#13402) * Replace mxnetci dockcross with public dockcross due to missing image * Remove source lists change * Disable Jetson * Move to mxnetcipinned

* Correct shapes of images in cifar10 and cifar100 cifar10 and cifar100 have 3 channels * Retrigger build

* initial modification recommender * Recommender updates * fix notebooks * Update README.md * trigger build * Update README.md * Retrigger build

* improving multi-processing reliability for gluon dataloader I found some multi-processing-related issues in the Gluon DataLoader. 1) Each time a _MultiWorkerIter shuts down, it could leave some dangling processes. The shutdown mechanism could not guarantee that all worker processes can be terminated. As a result, after running for several epochs, more and more dangling processes will stay there. This problem barely happens during training. In this case, there is a decent time interval between the last-batch data prefetching and the _MultiWorkerIter's shutting down). But the problem frequently happens 1) when I stop the iter before the end of an epoch, and 2) when I use the DataLoader for a data loading service and load the data as fast as possible. In both cases, the time interval between the most recent data prefetching and the iter shutdown are short. I guess that the _MultiWorkerIter iter is unable to shut down properly during active data prefetching. To fix this, I explicitly terminate the worker processes inside the shutdown function. 2) When loading data fast (still mostly during testing and data serving), there seems to be a risk of data racing. The data iter uses a _MultiWorkerIter to cache prefetched data, but the dict does not seem to be thread-safe for concurrent inserting and deleting elements. So occasionally, the data can be missing from the dict. To prevent this, I use a scope lock to guard the dict access. * do not wait for the workers to join, and kill any alive wokers as soon as possible

* Fix ONNX export to support multi-output graphs * Add ONNX unit-test * Added multi-output shape inference. - Removed unnecessary forward_pass() call - Modified infer_output_shape to return multiple shapes for multiple outputs as well as output names. * Fixed pylint

Update the main website links

* dynamic omp for dot update heuristic * add doc * Update mxnet_op.h * Update dot-inl.h

- Update notebook to avoid divide by 0 causing a warning. - Add MXBoard dependency.

* Minor fixes to documentation * Updated the Maven Repository URL to point to staging repo

* fix inception-bn and training acc issue * add parameter initialization, fix lint * fix comparison * change optimizer to sgd * update sgd and update model name * add inception_bn in jenkins build * make max epoch an argument * remove inception_bn test * trigger ci * remove ci test * trigger ci

* add instruction to get the data and fix typo * fix typo * update file name * trigger CI * add unit_test for unit_test_mlp_csv * add mlp_csv to jenkinsfile * revert jenkinsfile to another PR * trigger CI * trigger CI

* Fix scaladoc and javadoc errors * Stop on errors starting on scala 1.3.x build

…420) * Adding Java to ubuntu setup install page and minor fixes to other java api docs * Improving javadoc for java-api predictor class Mostly documentation changes

* randint operator add along with add optional tag to params * register param * lint space issue * randn issue fix * uniform_int_distribution doesn't support int8, uint8 fix * dtype ftype * ftype to dtype - invalid template arg * fix template arg issue * test with int dtype for windows * removed int8,uint8 from test * gpu implementation * gpu engine state diff * removed gpu support * empty commit * temporary fix : batchnorm flaky test skip * removed randn symbol specific code since other PR is on it * revert ndarray/randn for compatibility * added unit test for checking extremes and uniform distribution for sufficient samples * increased the high val * int32 to int64 support, indentation fix, check for optype correctly based on type of random function * gpu support, revert finfertype using template specialization, remove defaults, prints, test other low high val * fix for invalid template arg by checking for int32,int64 * gpu randint in random_generator * sample_uniform issue and param, removed old flaky test skip line * replaced discrete_uniform function by rand_int64 for consistency * formula update and removed itype * change ctx to include gpu, randint samepl_op.cu typo * trigger ci * doc fix, check fix, whitespace remove * added the without dtype testcase

* fix on ubuntu * add readme instruction * fix intellij Tutorials * fix intelliJ tutorial * fix the document * update demo * revert the change on intelliJ tutorial * fix make process * fix documentation

* Add quantized concat * Fix non-mkldnn build * Add size check for MKLDNNQuantizedConcatForward * use all capital for constant * Rename constant with Google C++ style. * Address apeforest comments * Address apeforest comments * fix lint * Add frontend interface. * Retrigger CI

* Add ARMv7 builds to dev_menu.py * Add Python3 CPU Intel MKLDNN unittests to dev_menu

* fix Makefile for rpkg * update R and roxygen2 requirements * add roxygen requirement * add roxygen requirement

* Prevent timeouts when rebuilding containers with docker. Increase timeout from 120 to 180 for pipelines * Increase docker cache timeout * Increase timeout also for docs * limit parallel builds to 10

…y example (#12498) * example testcase modified * rcnn file add * license add * license init * CI test trigger * rcnn modify give up * trigger * modify for better user experience * change the default parameter to xpu=None * Update bdk_demo.py * Update fcn_xs.py * Update test.py * Update train.py * Update bdk_demo.py * Update bdk_demo.py * modify review comments * refine * modify Readmes according to the changed code. * finetune READMEs * re-trigger ci * re-trigger ci twice

* add fallback for gpu topology detection using CUDA 9.2 * add fallback for gpu topology detection using CUDA 9.2 * add log * update 3rdparty to master * add fallback for gpu topology detection using CUDA 9.2 * add log * update 3rdparty to master * bring 3rdparty packages to upstream/master * rebase to master * Update gpu_topology.h

@kedarbellare

* change object detection prediction to be a map * change predictions to a map for image-classifiers * change return types of the classifiers to be a map - add tests for base classifier and with-ndarray as well * tweak return types and inputs for predict - add test for plain predict * updated infer-classify examples * adjust the infer/object detections tests * tweak predictor test * Feedback from @kedarbellare review * put scaling back in * put back predict so it can handle multiple inputs * restore original functions signatures (remove first)

* Modifying clojure CNN text classification example * Small fixes * Another minor fix

* adding tolerance * retrigger ci * retrigger ci

@rpath

This is a regression of addning @rpath name to libmxnet.so on Mac, example executable is not able to find libmxnet.so anymore. Add @rpath search path to fix this issue.

* Fix launch bounds in spatial transformer * Adding explanation in comment.

* Adding Scala Demo to be run as a part of Nightly CI * Addressed PR feedback : making a profile to fetch nightly jars only on CI * Changed name from scalacidemo to scala_ci_demo * Synchronized the scala-demo and java-demo for nightly CI runs * Pruned the maven command to simply maven install * changed running from ./.sh to bash .sh to be consistent

* fix ssd quantization script error * update readme for ssd * move quantized SSD instructions from quantization/README.md to ssd/README.md * update ssd readme and accuracy * update readme for SSD-vGG16

- update mkldnn and mshadow to version used by upstream master - update ngraph-mxnet-bridge to current master Renames nGraph README to follow MXnet conventions.

mbrookhart

Looks like the cmake build got lost in the merge?

CMakeLists.txt

TaoLv and others added 30 commits November 22, 2018 19:42

Support full convention in quantized pooling (#13260)

cd0ce3b

* fix quantized pooling and enable it in INT8 SqueezeNet * add test * fix test * address review comments * refine the test for quantized pooling

Add utility slave (#13383)

aee0953

Fixes #13386 - Refer Warnings (#13387)

79857a4

Updated the paths for images for java tutorial (#13361)

6b5cf4f

* Updated the paths for images * Empty commit * Empty commit * Nudge to CI

Replace mxnetci dockcross with public dockcross due to missing image …

b7b0d3f

…(#13402) * Replace mxnetci dockcross with public dockcross due to missing image * Remove source lists change * Disable Jetson * Move to mxnetcipinned

Correct shapes of images in cifar10 and cifar100 (#13348)

afa4c3a

* Correct shapes of images in cifar10 and cifar100 cifar10 and cifar100 have 3 channels * Retrigger build

Updated recommenders example (#13041)

8b3fa78

* initial modification recommender * Recommender updates * fix notebooks * Update README.md * trigger build * Update README.md * Retrigger build

Change docker login (#13408)

bc2f13d

Fixing doc links and minor edits for Java API (#13405)

e9c8db7

Update the main website links

Fix repeated typo in mxnet_op.h (#13406)

14ae5d4

Use dynamic omp schedule for sparse dot with large matrix (#13398)

f1de8e5

* dynamic omp for dot update heuristic * add doc * Update mxnet_op.h * Update dot-inl.h

Added proper default value in cpp-package for optional<bool> (#13415)

48bfc8d

Fix infoGan Gluon tutorial errors. (#13416)

075120e

- Update notebook to avoid divide by 0 causing a warning. - Add MXBoard dependency.

📝 Fixes #13388 Adds Clojure to MXNet installation docs (#13393)

c542a7a

Minor fixes to documentation (#13412)

e14482d

* Minor fixes to documentation * Updated the Maven Repository URL to point to staging repo

[Example]Fix mlp_csv example (#13273)

4f8aa09

* add instruction to get the data and fix typo * fix typo * update file name * trigger CI * add unit_test for unit_test_mlp_csv * add mlp_csv to jenkinsfile * revert jenkinsfile to another PR * trigger CI * trigger CI

Java doc (#13368)

5c73fc3

* Fix scaladoc and javadoc errors * Stop on errors starting on scala 1.3.x build

Adding Java to ubuntu setup install page and minor fixes to docs (#13…

e6fffe9

…420) * Adding Java to ubuntu setup install page and minor fixes to other java api docs * Improving javadoc for java-api predictor class Mostly documentation changes

Java demo file-path fix (#13358)

7542b2b

* fix on ubuntu * add readme instruction * fix intellij Tutorials * fix intelliJ tutorial * fix the document * update demo * revert the change on intelliJ tutorial * fix make process * fix documentation

Updated README and NEWS with 1.3.1 release information (#13423)

2d1c627

Be more explicit about the exit status of the container (#13425)

d6e42d8

Add ARMv7 builds to dev_menu.py (#13432)

212364b

* Add ARMv7 builds to dev_menu.py * Add Python3 CPU Intel MKLDNN unittests to dev_menu

lanking520 and others added 20 commits January 10, 2019 19:56

change to compile time (#13835)

83b9165

fix Makefile for rpkg (#13590)

35c3383

* fix Makefile for rpkg * update R and roxygen2 requirements * add roxygen requirement * add roxygen requirement

[CI] Prevent timeouts when rebuilding containers with docker. (#13818)

9c3253d

* Prevent timeouts when rebuilding containers with docker. Increase timeout from 120 to 180 for pipelines * Increase docker cache timeout * Increase timeout also for docs * limit parallel builds to 10

Add copyrights for third party licenses to license file (#13851)

c4761d0

Modifying clojure CNN text classification example (#13865)

0e57930

* Modifying clojure CNN text classification example * Small fixes * Another minor fix

adding tolerance to flaky test (#13850)

f554835

* adding tolerance * retrigger ci * retrigger ci

Julia v0.7/1.0 support and drop v0.6 support (#12845)

d22b323

Fix cpp examples build on Mac. (#13826)

bf0185c

This is a regression of addning @rpath name to libmxnet.so on Mac, example executable is not able to find libmxnet.so anymore. Add @rpath search path to fix this issue.

Fix launch bounds in spatial transformer (#13188)

0faa5b7

* Fix launch bounds in spatial transformer * Adding explanation in comment.

Update example scripts classpath. (#13849)

ac25eca

Add CODEOWNERS for Julia package (#13872)

6be0b9a

fix ssd quantization script error (#13843)

4fe5461

* fix ssd quantization script error * update readme for ssd * move quantized SSD instructions from quantization/README.md to ssd/README.md * update ssd readme and accuracy * update readme for SSD-vGG16

Rename to avoid merge conflict with upstream.

95ff8da

Attempt to merge in upstream master (4fe5461)

5b1fd31

Update submodule versions.

f08be8c

- update mkldnn and mshadow to version used by upstream master - update ngraph-mxnet-bridge to current master Renames nGraph README to follow MXnet conventions.

Merge branch 'master' into cconvey/mfi2

977f522

cconvey requested a review from mbrookhart January 17, 2019 20:51

cconvey and others added 3 commits January 22, 2019 00:19

Merge branch 'master' into cconvey/mfi2

91a7786

merge in origin/master

c8a53bc

Merge remote-tracking branch 'origin/master' into cconvey/mfi2

39aa524

mbrookhart suggested changes Jan 28, 2019

View reviewed changes

CMakeLists.txt Outdated Show resolved Hide resolved

cconvey and others added 2 commits January 28, 2019 16:42

Fix merge error for nGraph support in CMakeLists.txt

2fc7b03

Fixes CMake file error.

ce56ab7

mbrookhart approved these changes Jan 28, 2019

View reviewed changes

cconvey merged commit 8b3f6a6 into master Jan 28, 2019

cconvey deleted the cconvey/mfi2 branch January 28, 2019 22:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge in upstream `master` through 4fe546 #543

Merge in upstream `master` through 4fe546 #543

cconvey commented Jan 17, 2019

mbrookhart left a comment

Merge in upstream master through 4fe546 #543

Merge in upstream master through 4fe546 #543

Conversation

cconvey commented Jan 17, 2019

Description

Checklist

Essentials

Changes

Comments

mbrookhart left a comment

Choose a reason for hiding this comment

Merge in upstream `master` through 4fe546 #543

Merge in upstream `master` through 4fe546 #543