Benchmark reports logloss, but benchmark does not tell AutoML systems to optimize for logloss #3

ledell · 2020-10-05T18:22:54Z

I noticed that you're reporting logloss as the metric to evaluate systems, but you're not passing this information to any of the AutoML systems. Both auto-sklearn and H2O AutoML (maybe MLJar too?) have the ability to optimize and choose a leader model based on the metric which you want to evaluate, so this should be explicitly specified in a benchmark.

H2O AutoML has two parameters that should be set when evaluating on a non-default metric. Those are stopping_metric and sort_metric and should both be set to "logloss". More info here. By default on binary classification problems, H2O is optimized for AUC, unless you change it to logloss.
Auto-sklearn also has a metric argument which should be used and set to "logloss". More info here.

The text was updated successfully, but these errors were encountered:

pplonski · 2020-10-06T06:36:26Z

You are right. There is no metric passed. I don't remember why it wasn't set.

Anyway, I've moved MLJAR AutoML engine into open-source https://github.com/mljar/mljar-supervised (docs: https://supervised.mljar.com) and added it to openml/automlbenchmark (although I need to update the mljar-supervised version there, after adding golden features and features selection as new steps).

ledell · 2020-10-06T23:09:59Z

@pplonski I've seen the new MLJar supervised; cool to see it open sourced! I saw it's been added to the openml/benchmark too, thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark reports logloss, but benchmark does not tell AutoML systems to optimize for logloss #3

Benchmark reports logloss, but benchmark does not tell AutoML systems to optimize for logloss #3

ledell commented Oct 5, 2020 •

edited

Loading

pplonski commented Oct 6, 2020

ledell commented Oct 6, 2020

Benchmark reports logloss, but benchmark does not tell AutoML systems to optimize for logloss #3

Benchmark reports logloss, but benchmark does not tell AutoML systems to optimize for logloss #3

Comments

ledell commented Oct 5, 2020 • edited Loading

pplonski commented Oct 6, 2020

ledell commented Oct 6, 2020

ledell commented Oct 5, 2020 •

edited

Loading