Bugs in Metrics #6731

Roffild · 2021-02-24T09:51:43Z

gamma-nloglik:

bst_float psi = 1.0;
bst_float c = 1. / psi * std::log(y/psi) - std::log(y) - common::LogGamma(1. / psi);

c == 0

logloss:

  XGBOOST_DEVICE bst_float EvalRow(bst_float y, bst_float py) const {
    const bst_float eps = 1e-16f;
    const bst_float pneg = 1.0f - py;
    if (py < eps) {
      return -y * std::log(eps) - (1.0f - y)  * std::log(1.0f - eps);
    } else if (pneg < eps) {
      return -y * std::log(1.0f - eps) - (1.0f - y)  * std::log(eps);
    } else {
      return -y * std::log(py) - (1.0f - y) * std::log(pneg);
    }
  }

std::log(1.0f - eps) == std::log(1.0) == 0

gamma-deviance needs to be removed because the formula is not correct!(#6728)

I don't understand what math formula was used in poisson-nloglik.
-log( Poisson_regression )

With tweedie-nloglik it is also unclear. And the test is missing.

Tests for regression metrics with weights.(#6729)

If metrics are used in forest creation.....

Roffild · 2021-02-24T10:19:01Z

I wanted to use metrics from XGBoost for Pytorch.

But now only sklearn.metrics for all models!

trivialfis · 2021-02-25T05:54:11Z

I need to take a closer look.

trivialfis · 2021-03-19T18:51:09Z

The bug in gamma deviance is fixed. Better documentation for other metrics will be a different topic. Thanks for raising the issue!

trivialfis · 2021-03-20T07:57:54Z

Just a quick note for everyone who has been following this thread. I believe these metrics and objectives are derived from the generalized linear model.

Roffild · 2021-03-20T08:37:42Z

All metrics are calculated for each result separately.

Loss is calculated for each individual result, but metrics must be calculated for the entire result matrix. Therefore, the metrics in XGBoost are approximate.

Roffild · 2021-03-20T08:42:17Z

pytorch/pytorch#22439

trivialfis · 2021-03-22T18:11:06Z

I don't understand what math formula was used in poisson-nloglik.

See Evaluating the Poisson distribution section of https://en.wikipedia.org/wiki/Poisson_distribution

trivialfis · 2021-03-22T18:13:06Z

gamma-nloglik:

Yeah, this one is a bit confusing, I tracked down the PR for -log(\gamma): #1369 , which hard coded the dispersion to 1. Not entirely sure why.

Original PR for adding gamma regression: #1258 .

trivialfis · 2021-03-22T18:28:55Z

The weird logloss you see is just a way to work around numerical issues.

Roffild mentioned this issue Feb 24, 2021

[Roadmap] 1.4.0 Roadmap #6500

Closed

23 tasks

trivialfis added the type: bug label Mar 17, 2021

trivialfis closed this as completed Mar 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bugs in Metrics #6731

Bugs in Metrics #6731

Roffild commented Feb 24, 2021 •

edited

Loading

Roffild commented Feb 24, 2021

trivialfis commented Feb 25, 2021

trivialfis commented Mar 19, 2021

trivialfis commented Mar 20, 2021 •

edited

Loading

Roffild commented Mar 20, 2021

Roffild commented Mar 20, 2021

trivialfis commented Mar 22, 2021

trivialfis commented Mar 22, 2021 •

edited

Loading

trivialfis commented Mar 22, 2021

Bugs in Metrics #6731

Bugs in Metrics #6731

Comments

Roffild commented Feb 24, 2021 • edited Loading

Roffild commented Feb 24, 2021

trivialfis commented Feb 25, 2021

trivialfis commented Mar 19, 2021

trivialfis commented Mar 20, 2021 • edited Loading

Roffild commented Mar 20, 2021

Roffild commented Mar 20, 2021

trivialfis commented Mar 22, 2021

trivialfis commented Mar 22, 2021 • edited Loading

trivialfis commented Mar 22, 2021

Roffild commented Feb 24, 2021 •

edited

Loading

trivialfis commented Mar 20, 2021 •

edited

Loading

trivialfis commented Mar 22, 2021 •

edited

Loading