Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Long refit time of AutoSklearn models #1683

Open
AmirAlavi opened this issue Aug 25, 2023 · 1 comment
Open

Long refit time of AutoSklearn models #1683

AmirAlavi opened this issue Aug 25, 2023 · 1 comment

Comments

@AmirAlavi
Copy link

AmirAlavi commented Aug 25, 2023

A note to investigate further: We noticed extremely long refit time for AutoSklearn models. For example, if we set the time_left_for_this_task to be just 1hr, and so the per_run_time_limit is just 6minutes, we noticed that the refit could somehow take hours in some cases.

We didn't investigate it thoroughly, but I think I recall discovering that the HistGradientBoosting models would take a very long time to fit1. I know ensembling adds another layer to this, but I think we had observed this even for ensemble_size=1. @eddiebergman in #1677 illuminated that ensembling also includes all the models from different folds in cross validation.

I'm wondering if you had also noticed any performance issues with that, and if the new updates address it? (perhaps the upgrade to newer sklearn will take care of it)

Footnotes

  1. And maybe even also their prediction time is slow too. We attempted to do permutation feature importance analysis on an AutoSklearnClassifier and it was extremely slow if the classifier was the HistogramGradientBoostingTree, compared to a raw sklearn pipeline.

@eliwoods
Copy link

I am noticing this as well. Even for small models where the run time is 2 minutes and the task time is 20 minutes, it takes over an hour (sometimes multiple hours) to refit. Here are the following relevant versions:

  • Ubuntu 20.04
  • Python 3.8.10
  • auto-sklearn 0.15.0

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants