Consistency of min_child_weight parameter #5444

RAMitchell · 2020-03-27T04:38:15Z

The min_child_weight parameter (default value 1.0) has different effects based on scaling of objective functions. I noticed this when developing a new objective function that had a small Hessian and the tree was not able to grow with default parameters. Objectives like squared error and logistic loss will be regularised very differently as a consequence. For example using logistic loss where the hessian values can be much smaller, it can require a much larger number of training instances to split. In #2483 it is noted that the hessian in the case of logistic loss is proportional to variance, however this is not true of other objectives in general.

This is relevant to the task of finding good default parameters across a range of objectives (#4986).

One obvious solution is normalising all objective functions in some consistent way.

Another solution is deprecating min_child_weight and moving to a parameter like min_child_instances, regularising based on the amount of training data without respect to the objective function.

@trivialfis has also proposed implementing multiclass objective functions via vector leaves, if we do this the hessian will be a vector and it is not obvious how to correctly apply min_child_weight.

The text was updated successfully, but these errors were encountered:

thvasilo · 2020-06-10T18:02:34Z

This is a good point. LightGBM uses a default of 1e-3 for comparison, with a min of 20 data points per leaf.

QuantHao · 2020-08-06T02:53:50Z

I think one problem of replace min_child_weight with min_child_instances is that: how to deal with sample weight? A good questions is raised and answered at a LightGBM issue.

trivialfis mentioned this issue Aug 5, 2020

set min_child_weight as a float instead of int #5976

Closed

QuantHao mentioned this issue Aug 6, 2020

Maping min_child_weight of xgboost with min_sum_hessian_in_leaf of LightGBM #5987

Closed

trivialfis mentioned this issue May 24, 2022

[WIP] Implement min_child_samples for hist and approx. #7932

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consistency of min_child_weight parameter #5444

Consistency of min_child_weight parameter #5444

RAMitchell commented Mar 27, 2020

thvasilo commented Jun 10, 2020

QuantHao commented Aug 6, 2020

Consistency of min_child_weight parameter #5444

Consistency of min_child_weight parameter #5444

Comments

RAMitchell commented Mar 27, 2020

thvasilo commented Jun 10, 2020

QuantHao commented Aug 6, 2020