Visualizing hyperparameter tuning results for arbitrary numbers of parameters #416

ablaom · 2020-01-03T03:31:21Z

Suggestion of @baggepinnen, copied from #85:

I find it quite useful to visualize hyper-parameter tuning for more parameters than 2 as well. Simply plotting ranges vs function values for all parameters is a reasonable way of presenting the information. You can't determine interaction between parameters from this, but you can see overall trends for individual parameters and it quickly becomes apparent if one parameter is much more important than the others
example: https://github.com/baggepinnen/Hyperopt.jl/blob/master/figs/ho.svg

ablaom · 2020-01-03T03:33:47Z

This sounds like a good suggestion to me.

azev77 · 2020-01-06T21:47:43Z

@ablaom @tlienart
While we're discussing tuning, it would be awesome if MLJ borrowed the best features from the Caret interface (& try to avoid their mistakes).

Each model in Caret (it has 238) has a default grid for each hyper-parameter. In some cases the default grid is a function of the number of features.
Example: glmnet (=elasticnet) has two hyper-parameters,
alpha (default grid =.1, .55, 1)
lambda default grid of 3 values

if you run:

m= train(Species ~ ., method = "glmnet", data = trainSet)
m 
plot(m)

You will automatically get scores (Accuracy/Kappa) for 9 combinations of alpha/lambda.
The plot will show you 3 curves, one per value of lambda.

Users in Caret can of-course supply their own grids.
Caret has a cool option called "tuneLength", where users can set the number of elements in the total grid.

m = train(Species ~ ., method = "glmnet", data = trainSet, tuneLength = 5)
m
plot(m)

This automatically generates a grid w/ 5 values of alpha, & 5 for lambda, giving a total grid w/ 25 elements.

I'm not sure if this plot option from Caret is what @baggepinnen had in mind, but I kinda like it.

I'd love to see an option like "tuneLength" in MLJ but you can also include a smarter option.
tuneLength = 20; produces 20 values FOR EACH hyper-parameter.
For a model w/ H hyper-parameters, the grid space will have 20^H points, which can be too big.

Perhaps you can include
tuneLength2 = 20; which produces a grid space w/ 20 points in total

azev77 · 2020-01-06T21:51:15Z

Here is the code

library(caret); data("iris"); set.seed(123);
my_index= createDataPartition(iris$Sepal.Length, p = 0.75, list = F)
trainSet= iris[my_index, ]; testSet= iris[-my_index, ];
####
#https://github.com/topepo/caret/blob/master/models/files/glmnet.R
getModelInfo("glmnet")
####
set.seed(123)
m= train(Species ~ .,method = "glmnet",data = trainSet )
m         #Accuracy/Kappa. alpha =.1/.55/1. lambda= 3 default values
plot(m)
#
set.seed(123)
m = train(Species ~ .,method = "glmnet",data = trainSet,tuneLength = 5)
m
plot(m)

ablaom · 2020-01-09T18:43:47Z

@azev77 The key challenge for us would be setting up default grids for our existing models. Is possible to scrape a list of default grids for each caret model? This could be quite useful for MLJ devs.

(Although, in the case of nominal parameters, I am proposing we specify default ranges (ParamRange objects), which (roughly) specify the search space without specifying the resolution. These are bounded intervals, or, in the semi-bounded case, an upper/lower limit plus an "origin" and "unit". From these either grids or pdfs could be constructed, depending on further parameters appropriate to the particular tuning strategy - random, latin cube, Bayesian, and so forth).

azev77 · 2020-02-04T02:44:30Z

@ablaom @tlienart
I forgot to mention. Some other ML interfaces also have a timelimit option.
For example suppose I wanna train 45 regression models overnight I can set timelimit=60 minutes so it doesn't spend more than 60 minutes tuning hyper-parameters etc for a single model.

ablaom · 2020-02-04T03:06:32Z

@azev77. Thanks for that. Suggestion noted.

As Tuning is iterative, control is to be externalised, for he plan for implementing any kind of control of any iterative model (including the TunedModel wrapper) will be though a common API. Other controls of this kind are stopping criterion and incremental serialisation of results.

See here: #139

azev77 · 2020-02-04T16:21:29Z

Btw, just to be clear I don't mean a timelimit just for tuning, I mean a timelimit for training in general.
Suppose I'm training 45 regression models overnight, I don't wanna let the computer spend more than timelimit=60 amount of time on any one of those models.

baggepinnen · 2020-02-08T06:37:26Z

Facebook AI has a new take on hyperparameter Visualization
https://ai.facebook.com/blog/hiplot-high-dimensional-interactive-plots-made-easy/

azev77 · 2020-06-08T19:20:21Z

Hey guys my conversation w/ @yalwan-iqvia about TreeParzen.jl got me thinking about HP optimization frameworks.
I wanted to tell you guys about Optuna (repo & paper) a new framework for HP optimization.

A nice comparison w/ Hyperopt shows what can be done for HP visualization:
https://neptune.ai/blog/optuna-vs-hyperopt

Here are a few snips:

A 3 minute clip: https://www.youtube.com/watch?v=-UeC4MR3PHM

It would really be amazing for MLJ to incorporate this!

tlienart · 2020-06-08T19:55:37Z

Yeah Optuna is cool and the team behind it is pretty solid.

This is a project by itself though: to do a Optuna.jl (with interface to MLJ). Maybe something worth announcing on discourse to see if there’s any takers

ablaom · 2020-06-09T00:27:08Z

@azev77 Could you please re-post this suggestion at MLJTuning.jl? Thanks

ablaom mentioned this issue Jan 3, 2020

Visualization tool for one and two-parameter tuning #85

Closed

2 tasks

ablaom mentioned this issue Jan 8, 2020

Improve the tuning strategy interface #315

Closed

9 tasks

ablaom added the MLJTuning label Apr 29, 2020

ablaom mentioned this issue Apr 29, 2020

Add plot recipe for visualising multiple hyperparameter tuning outcomes JuliaAI/MLJTuning.jl#41

Closed

azev77 mentioned this issue Jun 9, 2020

Frameworks for HP optimization JuliaAI/MLJTuning.jl#55

Open

21 tasks

ablaom mentioned this issue Oct 11, 2020

Some tricks when wrapping tuning #668

Open

ablaom closed this as completed Dec 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Visualizing hyperparameter tuning results for arbitrary numbers of parameters #416

Visualizing hyperparameter tuning results for arbitrary numbers of parameters #416

ablaom commented Jan 3, 2020

ablaom commented Jan 3, 2020

azev77 commented Jan 6, 2020

azev77 commented Jan 6, 2020

ablaom commented Jan 9, 2020 •

edited

Loading

azev77 commented Feb 4, 2020

ablaom commented Feb 4, 2020

azev77 commented Feb 4, 2020

baggepinnen commented Feb 8, 2020

azev77 commented Jun 8, 2020 •

edited

Loading

tlienart commented Jun 8, 2020

ablaom commented Jun 9, 2020

Visualizing hyperparameter tuning results for arbitrary numbers of parameters #416

Visualizing hyperparameter tuning results for arbitrary numbers of parameters #416

Comments

ablaom commented Jan 3, 2020

ablaom commented Jan 3, 2020

azev77 commented Jan 6, 2020

azev77 commented Jan 6, 2020

ablaom commented Jan 9, 2020 • edited Loading

azev77 commented Feb 4, 2020

ablaom commented Feb 4, 2020

azev77 commented Feb 4, 2020

baggepinnen commented Feb 8, 2020

azev77 commented Jun 8, 2020 • edited Loading

tlienart commented Jun 8, 2020

ablaom commented Jun 9, 2020

ablaom commented Jan 9, 2020 •

edited

Loading

azev77 commented Jun 8, 2020 •

edited

Loading