How exactly LightGBM predictions are obtained? #3571

maksymiuks · 2020-11-16T19:05:30Z

Hi

In the beginning, I'd like to appreciate your work covering that package. I consider it a great tool. That's why I'm working on a dedicated R interface for tree ensemble models that allows calculating shap values fast using C++ code via Rcpp. LightGBM package is one in the scope of my interest. But for that, I need the information on how exactly the committee of trees is aggregated. From my inspection of the code, I have a hunch that the final prediction is a sum of prediction for all trees with some intercept subtracted. Am I correct? If so how to find that intercept because I wasn't able to in the model object. The goal for me is to acquire a plain sum of predictions for all trees.

Best Regards
Szymon Maksymiuk

guolinke · 2020-11-21T04:47:50Z

except for mutli-class tasks, the prediction is the sum of prediction of each tree.
For some tasks, like binary classification, there could be a transformation after sum, like sigmoid.
For multi-class, you need to sum the prediction by class first (trees are organized as tree[i * K + j], where i is iteration, j is class-id, and K is the number of class), and use softmax to get the probabilities for classes.

btrotta · 2020-12-07T08:18:14Z

I'm not sure if I'm understanding your question correctly, but I think when you talk about the "intercept" you mean something like the baseline constant prediction? E.g. for a binary prediction problem where the training labels are 90% ones and 10% zeros, we would start with a constant prediction of 0.9 and then add trees to improve the accuracy. This is indeed how LightGBM works, and this constant value is added to the leaf values of the first tree. So if you use Booster.save_model() (https://lightgbm.readthedocs.io/en/latest/pythonapi/lightgbm.Booster.html?highlight=save%20model#lightgbm.Booster.save_model) the leaf values for the first tree include this baseline value.
The relevant part of the C++ code is in TrainOneIter

LightGBM/src/boosting/gbdt.cpp

Line 350 in f38f118

bool GBDT::TrainOneIter(const score_t* gradients, const score_t* hessians) {

In the first iteration (when gradients and hessians are nullptr), it calls BoostFromAverage which calculates the constant initial prediction. Then it calculates the optimal tree (fitting to the error from the constant prediction), and later it calls AddBias to add the constant to the individual leaf values.

github-actions · 2023-08-23T18:54:18Z

This issue has been automatically locked since there has not been any recent activity since it was closed. To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues including a reference to this.

maksymiuks changed the title ~~How exactly LightGBM predictions are acquire?~~ How exactly LightGBM predictions are obtained? Nov 16, 2020

jameslamb added the question label Nov 24, 2020

StrikerRUS closed this as completed Dec 25, 2020

jameslamb mentioned this issue Jan 17, 2021

[dask] Support pred_contrib in Dask predict() methods (fixes #3713) #3774

Merged

github-actions bot locked as resolved and limited conversation to collaborators Aug 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How exactly LightGBM predictions are obtained? #3571

How exactly LightGBM predictions are obtained? #3571

maksymiuks commented Nov 16, 2020 •

edited

Loading

guolinke commented Nov 21, 2020

btrotta commented Dec 7, 2020

github-actions bot commented Aug 23, 2023

How exactly LightGBM predictions are obtained? #3571

How exactly LightGBM predictions are obtained? #3571

Comments

maksymiuks commented Nov 16, 2020 • edited Loading

guolinke commented Nov 21, 2020

btrotta commented Dec 7, 2020

github-actions bot commented Aug 23, 2023

maksymiuks commented Nov 16, 2020 •

edited

Loading