Fix SingleRowPredictor::IsPredictorEqual comparison (invert) #2799

AlbertoEAF · 2020-02-22T22:35:12Z

This fixes the issue #2798.

As stated in that issue, this should fix the bug in case the parameters change, and improve performance when they don't.

Please give me feedback if I'm doing anything in a less than desirable way.

guolinke · 2020-02-23T09:47:01Z

ping @imatiach-msft to confirm

imatiach-msft · 2020-02-24T01:58:09Z

@AlbertoEAF great catch, this should improve performance during scoring. The main bugfix that introduced this bug was changing the correctness from:
(single_row_predictor_.get() == nullptr)
to:
single_row_predictor_[predict_type].get() == nullptr
so that when calling raw predict and predict for probabilities the function would return the correct values, but it looks like now we are not re-using the cached predictor which is horrible. Sorry I introduced this performance issue after fixing the correctness issue before.

"worse, not creating a new one if conditions change"
I don't think this can happen from someone using mmlspark which is calling those APIs (since the model is immutable after it is trained) but this is definitely something that could help anyone who is using the native API directly, although I'm not aware of anyone other than mmlspark using LGBM_BoosterPredictForMatSingleRow and LGBM_BoosterPredictForCSRSingleRow. Regardless this is a great fix, and thank you for sending it out!

imatiach-msft

great catch!

imatiach-msft · 2020-02-24T01:59:22Z

also adding @eisber for context who added the original optimizations, it looks like his optimizations of improving scoring by 3-4X are missing in latest mmlspark

AlbertoEAF · 2020-02-24T07:47:18Z

Thank you @matiach-smft :)

Regarding no one else using the native API, there might be in the next months :p. If I can help a bit this project great, I'd like to learn and help more :)

Fix SingleRowPredictor::IsPredictorEqual comparison (invert)

466f035

AlbertoEAF requested review from chivee and guolinke as code owners February 22, 2020 22:35

imatiach-msft approved these changes Feb 24, 2020

View reviewed changes

guolinke approved these changes Feb 24, 2020

View reviewed changes

guolinke merged commit 60710c7 into microsoft:master Feb 24, 2020

guolinke mentioned this pull request Feb 24, 2020

Wrong implementation of SingleRowPredictor::IsPredictorEqual in c_api #2798

Closed

guolinke added the fix label Mar 1, 2020

imatiach-msft mentioned this pull request Mar 4, 2020

Mini-batch inference can improve speed of the inference stage microsoft/SynapseML#814

Closed

AlbertoEAF deleted the fix/c-api-IsPredictorEqual branch March 21, 2020 12:14

lock bot locked as resolved and limited conversation to collaborators May 20, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix SingleRowPredictor::IsPredictorEqual comparison (invert) #2799

Fix SingleRowPredictor::IsPredictorEqual comparison (invert) #2799

AlbertoEAF commented Feb 22, 2020 •

edited

Loading

guolinke commented Feb 23, 2020

imatiach-msft commented Feb 24, 2020

imatiach-msft left a comment

imatiach-msft commented Feb 24, 2020

AlbertoEAF commented Feb 24, 2020 •

edited

Loading

Fix SingleRowPredictor::IsPredictorEqual comparison (invert) #2799

Fix SingleRowPredictor::IsPredictorEqual comparison (invert) #2799

Conversation

AlbertoEAF commented Feb 22, 2020 • edited Loading

guolinke commented Feb 23, 2020

imatiach-msft commented Feb 24, 2020

imatiach-msft left a comment

Choose a reason for hiding this comment

imatiach-msft commented Feb 24, 2020

AlbertoEAF commented Feb 24, 2020 • edited Loading

AlbertoEAF commented Feb 22, 2020 •

edited

Loading

AlbertoEAF commented Feb 24, 2020 •

edited

Loading