Sync with Branch 0.18 #21

daxiongshu · 2020-12-28T02:30:45Z

No description provided.

* splitting `cpp/src/metrics.cu` into seperately compiled files * updated CHANGELOG.md * file-naming cleanup from camelCase to under_score * addings related changes from PR rapidsai#3072 that affected this PR

* Speeding up MNMG KNN Cl&Re testing * Update changelog * Testing with extreme values

Fixes rapidsai#3057 Co-authored-by: Corey J. Nolet <cjnolet@users.noreply.github.com>

* Use single random seed in kmeans tests * Prune redundant kmeans parameterization tests * Update changelog * Add extra k-means|| test Co-authored-by: Dante Gama Dessavre <dante.gamadessavre@gmail.com>

* Speed up test_lightgbm * Speed up test_fil_regression * Update changelog * Test FIL predict() with binary classifier * Add a TODO comment * Explicitly indicate skipped tests in test_fil_skl_classification * Test n_classes=25 with n_estimators=1 * Address reviewer's feedback * Fix style

…e to underscore format (rapidsai#3065) * splitting `cpp/src/metrics.cu` into seperately compiled files * updated CHANGELOG.md * file-naming cleanup from camelCase to under_score * refactoring randIndex instances to rand_index * refactored `silhouetteScore` instances to `silhouette_score` * refactoring all `adjustedRandIndex` and `adjustedrandindex` to `adjusted_rand_index` * adjusted_rand_index more fixes * refactored `klDivergence` instances to `kl_divergence` * refactoring `mutualInfoScore` instances to `mutual_info_score` * refactoring `homogeneityScore` instances to `homogeneity_score` * refactoring `completenessScore` instances to `completeness_score` * refactoring `vMeasure` instances to `v_measure` * refactoring `pairwiseDistance` and related instances to `pairwise_distance` * preserving camelcase in relevant places * rand_index refactoring further nooks and corners * updating CHANGELOG.md * FIX clang-format fixes * flake8 fix * adding related changes from PR rapidsai#3072 that affected this PR * resolving function name conflicts in the cython layer * adding a `cython_` prefix to cython headers wherever conflicted * updating appropriately in `__init__.py` files

* ENH speed test_array * DOC Added entry to changelog

* Speedup umap MNMG tests by lowering data sizes and removing parameters to test * Reomving accidental change * Updating changelog Co-authored-by: Dante Gama Dessavre <dante.gamadessavre@gmail.com>

…not fit with probability=True [skip-ci] (rapidsai#3114) * Fixed typo in AttributeError (line 464) with at the end of the second line, and probability at the beginning of the third line did not have a space between them. * Update CHANGELOG.md

* FIX Fix memset args for benchmark * DOC Update changelog

* Adding ability to build with --linetrace=1 to support cython codecov * Adding PR to CHANGELOG * Style cleanup * Converting BUILD_PYTHON_ARGS to be a argument in build.sh

* Update README * UPDATE changelog * Apply suggestions from code review Co-authored-by: Dante Gama Dessavre <dante.gamadessavre@gmail.com> Co-authored-by: Nanthini Balasubramanian <nathanb@nvidia.com> Co-authored-by: Dante Gama Dessavre <dante.gamadessavre@gmail.com>

* Return Python string from dump_as_json() of RF * Add changelog

…3117) * Patch and test for RF crash rapidsai#3107 * Cleanups of RF regression fixes * Add failing tests to RF regression * Expand experimental backend testing and align pointers * Expand python RF regression test * Updates based on review feedback * Update changelog * Add classification tests * Review comments and style fixes for RF

* draft 1 of better test parameter specification * refactor using variadic macros; move fil enums to own namespace * changelog; fixed fil.pyx enum import * simpler FIL_TEST_PARAMS macro, remove the ::enums:: changes * leaner change * renamed struct responsible for non default FIL test parameters * style

…apidsai#2956) * Change get_params and set_params to a property params * Update deprecated docstring * Update changelog * Fix style * Change ARIMA parameters into cuML arrays, write variant of llf to avoid unnecessary memory copies, rename setter/getter, override get_params and set_params with NotImplementedError * Mark get_param_names as not implemented instead of get_params and set_params * Cleanup PR, remove redundancy, more efficient pack/unpack * Fix Python style Co-authored-by: John Zedlewski <904524+JohnZed@users.noreply.github.com>

…ai#3096)

rapidsai#3134) * Improving the deprecation message formatting in pydocs * Adding PR to CHANGELOG

…ators [skip-ci] (rapidsai#3040) * Adding additional checking for incorrect use cases. Added CumlArrayDescriptor * Cleaning up more use cases * Initial commit of CumlArrayDescriptor in PCA * Incrementally updating CumlArray uses * Adding some improvements to decorators to auto detect certain scenarios where a function returns CumlArray * Adding internals.func_utils to test wrapping all functions and checking output types * Commit before merging upstream * Updating native_bayes * Partial working state * Updating KMeans * Partial pass over all Base subclasses * Mostly complete pass of removing to_output * Completed cleanup of Base method removal * Cleaning up more to_output uses. Fixing test errors * Adding tartet_arg property and fixing tests that can use it * More cleanup and test fixing * Updating types derived from Base to properly use get_param_names and allow setting Base values in constructor * Fixing import order. Adding support for sparse arrays * Attempting to fix nearest neighbors * Removing commented code * Fixing failing tests * Fixing more tests * Adding PR to CHANGELOG and style fixes * Fixing missing import * Removing protocol interface for python 3.7 * Fixing ARIMA. Required including changes from PR#2956 * Fixing labelbinarizer and KNN failing tests * Removing "invalid syntax" so flake8 can run * Adding more wrappers to ARIMA so tests pass. * Committing CI change to allow tests to run. * Moving memory check to plugin * Adding ability to load SPD environment variables to the logger * Changing pytest import-mode to better support development * Changing relative imports to absolute * Adding first iteration of dev guide to see how it looks * Improving the quick_run plugin * Removing skip_* from cuml decorators * Fixing cuml_decorators test. * Removing the logger environment addition * Updating non-Base methods to use decorators * Large cleanup of remaining to_output, with_cupy_rmm and input_to_dev_ptr * Style cleanup * Apply John's suggestions from code review on Dev Guide Co-authored-by: John Zedlewski <904524+JohnZed@users.noreply.github.com> * Large update to Estimator Guide incorporating feedback from JohnZ * Removing array tracking and putting in plugin * Removing PR Description file * Removing ArrayOutputable * Removing test plugins * Cleaning up code to remove unnecessary diffs * Style cleanup * Defaulting to cp array instead of np, per feedback * Adding additional tests * Separating func_tools into separate files * Removing extra changes to conftest.py which should not have been committed. * Renaming base.py back to base.pyx * Apply suggestions from code review Co-authored-by: Dante Gama Dessavre <dante.gamadessavre@gmail.com> * Incorporating feedback from Dante's code review * Removing straggling TODO * Applying Dante's Revisions to ESTIMATOR_GUIDE Co-authored-by: Dante Gama Dessavre <dante.gamadessavre@gmail.com> * Updateing ESTIMATOR_GUIDE from feedback from Dante * Cleaning up straggling to_output * Another iteration on code review feedback * Style cleanup * More small items from code review * One final change to ESTIMATOR_GUIDE * Updaing all *_mg.pyx files to use the new naming conventions and CumlArrayDescriptor Co-authored-by: John Zedlewski <904524+JohnZed@users.noreply.github.com> Co-authored-by: Dante Gama Dessavre <dante.gamadessavre@gmail.com>

* Update all DistanceType references * Style fix * Update changelog

…rapidsai#3069) * Maintain dataframe output for single-series frames * Add unit test for single-series input type check * Update changelog * Add test for Series to DataFrame preprocessing * Handle output from preprocessors increasing dims * Allow norms to be returned as Series

* Fix Stochastic Gradient Descent Example The example that is currently in the docs does not run. dtype, penalty, lrate, loss are not defined. This new version sets the default values for the parameters of cumlSGD, and copies Mini Batch SGD Regression's dtype for pred_data['col1'], pred_data['col2']. When running this example, I also got slightly different values for the output, so these were also updated. * Added PR rapidsai#3136 to 0.17 Bug Fixes

sync with upstream

`#include <cuml/manifold/umap.hpp>` works now. Co-authored-by: Corey J. Nolet <cjnolet@users.noreply.github.com>

…3137) * Moving conftest.py files around and adding quick_run plugin * Adding PR to CHANGELOG * Incorporating feedback from code review

* Initial cython test commit * Update changelog * Style fixes Co-authored-by: Nanthini Balasubramanian <nathanb@nvidia.com> Co-authored-by: Dante Gama Dessavre <dante.gamadessavre@gmail.com>

…precation warnings (rapidsai#3155) * Get rid of warnings in random projections test * Update changelog * Fix style * Update other deprecated make_blob imports

* FIX Force local install by specifying exact build string * DOC Update changelog * Update ci/gpu/build.sh Co-authored-by: AJ Schmidt <ajschmidt8@users.noreply.github.com> Co-authored-by: AJ Schmidt <ajschmidt8@users.noreply.github.com>

* Update flake8 config to join python/cython configuration and improve setup to check __init__.py files * Fixing linting issues in previously ignored __init__.py files * Update flake8 config to join python/cython configuration and improve setup to check __init__.py files * Fixing linting issues in previously ignored __init__.py files * Adding PR to CHANGELOG * Incorporating feedback from code review * Fixing style issues after merge with branch-0.17 Co-authored-by: Corey J. Nolet <cjnolet@users.noreply.github.com> Co-authored-by: Dante Gama Dessavre <dante.gamadessavre@gmail.com>

…kip-ci] (rapidsai#3144) * Adding ability to set arbitrary cmake flags in ./build.sh via the $CUML_ADDL_CMAKE_ARGS variable * Adding PR to CHANGELOG * Adding more help info requested from code review. Co-authored-by: John Zedlewski <904524+JohnZed@users.noreply.github.com>