check the shape for mat, csr and csc in prediction #2464

guolinke · 2019-09-27T12:02:34Z

partially improve #812

StrikerRUS · 2019-09-27T20:08:03Z

Close-reopen for CI.

StrikerRUS

Can something similar be done for predictions for data from a file?

src/c_api.cpp

StrikerRUS · 2019-09-27T21:44:41Z

Please add a simple test from the original issue:

import numpy as np
import lightgbm as lgb

x_data = np.random.rand(100, 10)
x_bad_data = np.random.rand(100, 11)
y_data =  np.random.rand(100) > .5
self.assertNotEqual(x_data.shape[-1], x_bad_data.shape[-1])
train_dataset = lgb.Dataset(x_data, y_data)
bst = lgb.train({'objective': 'binary'}, train_dataset)
with np.testing.assert_raises_regex(lgb.basic.LightGBMError,
                                    'The number of features in data*'):
    bst.predict(x_bad_data)

guolinke · 2019-09-28T03:55:03Z

There are more changes than I expected. So I think we should test all cases, including the the mat, libsvm file and CSR format.
@StrikerRUS could you help for the test cases?

StrikerRUS · 2019-09-28T12:26:33Z

@guolinke

could you help for the test cases?

Sure!

What do you think about changing the original type to avoid casting? #2464 (comment)

Does this PR already include that check?

guolinke · 2019-09-29T09:51:07Z

Does this PR already include that check?

Yeah, as the zero-based and one-based libsvm format have the different number of columns.

StrikerRUS · 2019-10-02T22:55:49Z

@guolinke I added tests in the latest commit.

As we do not support a case when data for prediction is a list of arrays, I tried to check it for validation. And it seems that validation shape is not covered by this PR, right?

bad_valid_data = train_data.create_valid(bad_X_test, label=y_test)
bst.add_valid(bad_valid_data, "valid_bad")
bst.eval_valid()  # no error risen

guolinke · 2019-10-03T15:18:10Z

yeah, validation is not converted for now.

guolinke · 2019-10-03T15:19:06Z

due the need of pr #2485, I will merge this

check the shape for mat, csr and csc

3f92c53

guolinke requested a review from chivee as a code owner September 27, 2019 12:02

guolinke changed the title ~~check the shape for mat, csr and csc~~ check the shape for mat, csr and csc in prediction Sep 27, 2019

guolinke mentioned this pull request Sep 27, 2019

Ranking model giving different results when model.txt file loaded in python. #2457

Closed

guolinke requested a review from StrikerRUS September 27, 2019 12:08

StrikerRUS closed this Sep 27, 2019

StrikerRUS reopened this Sep 27, 2019

StrikerRUS reviewed Sep 27, 2019

View reviewed changes

src/c_api.cpp Outdated Show resolved Hide resolved

src/c_api.cpp Outdated Show resolved Hide resolved

src/c_api.cpp Outdated Show resolved Hide resolved

src/c_api.cpp Show resolved Hide resolved

src/c_api.cpp Show resolved Hide resolved

guolinke and others added 5 commits September 28, 2019 10:26

guess from csr

b184edb

support file checking

a38f9ed

better error msg

22d340e

grammar

ad22c8e

clean code

39ac72b

code clean

f7e24fe

guolinke and others added 5 commits September 29, 2019 17:52

Merge branch 'master' into predict_shape_check

194e8af

check range for CSR

6d1b87f

Update test_.py

1a3e67d

Update test_.py

ac981fb

added tests

4b1f0c0

StrikerRUS force-pushed the predict_shape_check branch from 3dc9c08 to 4b1f0c0 Compare October 2, 2019 22:47

guolinke merged commit dee7215 into master Oct 3, 2019

StrikerRUS deleted the predict_shape_check branch October 3, 2019 18:27

StrikerRUS mentioned this pull request Oct 3, 2019

[python package]: suggestion: lgb.Booster.predict() should check that the input X data makes sense #812

Closed

guolinke mentioned this pull request Jan 6, 2020

2.3.1 does not handle model.predict("file.svm") well if the file has a smaller cardinality of features than what the model was trained with #2668

Closed

lock bot locked as resolved and limited conversation to collaborators Mar 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

check the shape for mat, csr and csc in prediction #2464

check the shape for mat, csr and csc in prediction #2464

guolinke commented Sep 27, 2019 •

edited

Loading

StrikerRUS commented Sep 27, 2019

StrikerRUS left a comment

StrikerRUS commented Sep 27, 2019

guolinke commented Sep 28, 2019

StrikerRUS commented Sep 28, 2019

guolinke commented Sep 29, 2019 •

edited

Loading

StrikerRUS commented Oct 2, 2019 •

edited

Loading

guolinke commented Oct 3, 2019

guolinke commented Oct 3, 2019

check the shape for mat, csr and csc in prediction #2464

check the shape for mat, csr and csc in prediction #2464

Conversation

guolinke commented Sep 27, 2019 • edited Loading

StrikerRUS commented Sep 27, 2019

StrikerRUS left a comment

Choose a reason for hiding this comment

StrikerRUS commented Sep 27, 2019

guolinke commented Sep 28, 2019

StrikerRUS commented Sep 28, 2019

guolinke commented Sep 29, 2019 • edited Loading

StrikerRUS commented Oct 2, 2019 • edited Loading

guolinke commented Oct 3, 2019

guolinke commented Oct 3, 2019

guolinke commented Sep 27, 2019 •

edited

Loading

guolinke commented Sep 29, 2019 •

edited

Loading

StrikerRUS commented Oct 2, 2019 •

edited

Loading