Issue on sparse dataframe with pandas >= 0.24.0 #4

gfournier · 2019-06-07T18:46:36Z

LightGBM no more accepts Sparse DataFrame.
See discussion in microsoft/LightGBM#2143
Tested with latest LightGBM (2.2.3)

Reverted pandas to version <0.24.0 in #2

* Bump version to 0.1.0 * Change output type vectorizer (#1) * change setup * change default output type of countvectorizer to bet in32 * change dtype to numerical encoder as well + tests * add output type test on NumImputer * fix bug NumericalEncoder when new column (#4) * Block Search + other (#2) * add make_pipeline function (works like sklearn) * fix type "_if_fitted" -> "_already_fitted" * * add handling of columns_to_encode == "--object--" in target encoder * corresponding test * add Numerical encoder test for "columns_to_encode == '--object--' " * expose command argument parser outside, to be able to add new arguments. * change WordVectorizer in char mod distributions + fix bug in HyperRangeBetaInt * change default behavior : encode "columns_to_encode == '--object--' " * remove 'bug' (double return) * allow text preprocessors to concat their inputs * add 'RandomTrainTestCv' and 'IndexTrainCv' cv-like object. * same api as a regular cv object ... * ... but only one split * add 'use_for_block_search' attribute + filter models based on that * * add block search iterator * automl config : models_to_keep_block_search * fix typo in test * ignore Warning in test * Graph pipeline subgraph from dev (#3) * fix casting bug + test on filter/map function on dicos * add function to retrieve 2-uple list of edges from generic tuple edges * fix bug on DebugPassThrough * add 'get_subpipeline' methods to create sub GraphPipeline from a given GraphPipeline * add docstring get_subpipeline * Fix numerical encoder max_cum_proba (#6) * Fix bug automl group (#5) * allow reload of groups * * add average_precision default transformation * go back to default transformation if unknown * return dataframe in command * Fix dataset load from SG premises * Fix dummy encoding type in NumericalEncoder

gfournier · 2019-09-23T21:14:14Z

Fix by #15

gfournier added the bug Something isn't working label Jun 7, 2019

gfournier pushed a commit that referenced this issue Aug 7, 2019

fix bug NumericalEncoder when new column (#4)

63cd55f

gfournier pushed a commit that referenced this issue Aug 7, 2019

fix bug NumericalEncoder when new column (#4)

de28136

gfournier closed this as completed Sep 23, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue on sparse dataframe with pandas >= 0.24.0 #4

Issue on sparse dataframe with pandas >= 0.24.0 #4

gfournier commented Jun 7, 2019 •

edited

Loading

gfournier commented Sep 23, 2019

Issue on sparse dataframe with pandas >= 0.24.0 #4

Issue on sparse dataframe with pandas >= 0.24.0 #4

Comments

gfournier commented Jun 7, 2019 • edited Loading

gfournier commented Sep 23, 2019

gfournier commented Jun 7, 2019 •

edited

Loading