Add `history_additions` to resolve #202 at least for grid searches #205

dpaetzel · 2024-02-12T08:38:22Z

This adds the history_additions option to TunedModel which allows to specify a function f which is then called on each set of folds during tuning as f(model, fitted_params_per_fold) where

model is the configuration the tuner is currently looking at and
fitted_params_per_fold is the vector of fitted_params(mach) for each mach trained during resampling (e.g. this has 5 entries if 5-fold CV is used)—see here.

This closes #202 to some extent since when using searches that do not adjust their search space based on their trajectory (e.g. Grid and LatinHypercube), this allows to optimize with respect to functions that are not exclusively predictive performance–based. For example:

using MLJTuning
using MLJ
DTRegressor = @load DecisionTreeRegressor pkg = DecisionTree verbosity = 0
using DecisionTree: DecisionTree

N = 800
X, y = rand(N, 5), rand(N)
X = MLJ.table(X)

model = DTRegressor()

space = [
    range(model, :max_depth; lower=1, upper=5),
    range(
        model,
        :min_samples_split;
        lower=ceil(0.001 * N),
        upper=ceil(0.05 * N),
    ),
]

function histadds(model, fitted_params_per_fold)
    return DecisionTree.depth.(getproperty.(fitted_params_per_fold, :raw_tree))
end

struct HistoryAdditionsSelection <: MLJTuning.SelectionHeuristic end

function MLJTuning.best(::HistoryAdditionsSelection, history)
    # Compute the mean of the depths stored in `history_additions`.
    scores = mean.(getproperty.(history, :history_additions))
    # Within this contrived example, the best hyperparametrization is the one
    # resulting in the least mean depth.
    index_best = argmin(scores)
    return history[index_best]
end

# Let's pirate some types. Julia, please forgive me.
function MLJTuning.supports_heuristic(
    ::LatinHypercube,
    ::HistoryAdditionsSelection,
)
    return true
end

modelt = TunedModel(;
    model=model,
    resampling=CV(; nfolds=3),
    tuning=LatinHypercube(; gens=30),
    range=space,
    measure=mae,
    n=10,
    history_additions=histadds,
    selection_heuristic=HistoryAdditionsSelection(),
)

macht = machine(modelt, X, y)
MLJ.fit!(macht; verbosity=1000)
display(getproperty.(report(macht).history, :history_additions))

ablaom · 2024-02-21T22:23:26Z

Thanks for this. The sample application is very helpful. This can probably be adapted for documentation.

I think we are being too specific with the signature, and are unnecessarily ruling out some other possible use-cases. Rather than history_additions(model, fitted_params_per_model) can we widen the signature to history_additions(model, E) where E is the PerformanceEvaluation object return by evaluate?

Otherwise, this looks good to me.

codecov · 2024-02-21T22:29:23Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 87.40%. Comparing base (e3293dc) to head (5f4b83d).
Report is 17 commits behind head on dev.

❗ Current head 5f4b83d differs from pull request most recent head e2cbcd6. Consider uploading reports for the commit e2cbcd6 to get more accurate results

Additional details and impacted files

@@            Coverage Diff             @@
##              dev     #205      +/-   ##
==========================================
+ Coverage   86.44%   87.40%   +0.96%     
==========================================
  Files          13       13              
  Lines         649      659      +10     
==========================================
+ Hits          561      576      +15     
+ Misses         88       83       -5

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

ablaom · 2024-02-22T01:52:51Z

Or, we could just insist a history entry:

includes every property that PerformanceEvaluation objects include, or
(breaking) includes a single property evaluation with value the PerformanceEvaluationobject.

Thoughts @dpaetzel ?

dpaetzel · 2024-02-27T13:59:14Z

includes every property that PerformanceEvaluation objects include

As in copy all properties of E = evaluate(resampling_machine) into the history entry (as opposed to only E.measure, E.measurement and E.per_fold)?

(breaking) includes a single property evaluation with value the PerformanceEvaluation object.

Why would this be breaking? This seems to me to be no more breaking than the first proposal (where we add more than just a single property called evaluation)?

dpaetzel · 2024-02-27T14:00:02Z

(I'm with you on allowing more than just my use case.)

ablaom · 2024-02-29T02:02:09Z

As in copy all properties of E = evaluate(resampling_machine) into the history entry (as opposed to only E.measure, E.measurement and E.per_fold)?

yes.

Why would this be breaking? This seems to me to be no more breaking than the first proposal (where we add more than just a single property called evaluation)?

Well, breaking if we remove the existing properties that will now become redundant. How about we go with this option but leave the redundant properties, and I'll add an issue to remove them in the next breaking release?

dpaetzel · 2024-03-08T14:27:21Z

Well, breaking if we remove the existing properties that will now become redundant.

Got it.

How about we go with this option but leave the redundant properties, and I'll add an issue to remove them in the next breaking release?

I think this is a sensible way to move forward. 👍 I'll update the PR in the next days (sorry for the delay!).

dpaetzel · 2024-03-11T15:16:47Z

I undid the many small changes and only added the PerformanceEvaluation object as a field evaluation to the history entry. Updated usage example:

import MLJBase: recursive_getproperty
using MLJTuning
using MLJ
DTRegressor = @load DecisionTreeRegressor pkg = DecisionTree verbosity = 0
using DecisionTree: DecisionTree

N = 300
X, y = rand(N, 3), rand(N)
X = MLJ.table(X)

model = DTRegressor()

space = [
    range(model, :max_depth; lower = 1, upper = 5),
    range(model, :min_samples_split; lower = ceil(0.001 * N), upper = ceil(0.05 * N)),
]

struct TreeDepthSelection <: MLJTuning.SelectionHeuristic end

function MLJTuning.best(::TreeDepthSelection, history)
    # Extract the depths of all folds of all history entries.
    fparams = recursive_getproperty.(history, Ref(:(evaluation.fitted_params_per_fold)))
    depths = [DecisionTree.depth.(getproperty.(fparam, :raw_tree)) for fparam in fparams]

    # Compute the mean of the depths stored in `history_additions`.
    scores = mean.(depths)
    # Within this contrived example, the best hyperparametrization is the one
    # resulting in the least mean depth.
    index_best = argmin(scores)
    return history[index_best]
end

function MLJTuning.supports_heuristic(::LatinHypercube, ::TreeDepthSelection)
    return true
end

modelt = TunedModel(;
    model = model,
    resampling = CV(; nfolds = 3),
    tuning = LatinHypercube(; gens = 30),
    range = space,
    measure = mae,
    selection_heuristic = TreeDepthSelection(),
    n = 5,
)

macht = machine(modelt, X, y)
MLJ.fit!(macht; verbosity = 1000)

display(report(macht).history[1].evaluation)

ablaom · 2024-03-17T22:32:04Z

Thanks @dpaetzel for the help with this. I've fixed the invalidated test in a new PR.

Closing in favour of #210.

Expose PerformanceEvaluation in tuning history

e2cbcd6

dpaetzel force-pushed the add-history-additions branch from 5f4b83d to e2cbcd6 Compare March 11, 2024 13:11

ablaom mentioned this pull request Mar 17, 2024

Add entire evaluation objects to history #210

Merged

ablaom closed this Mar 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `history_additions` to resolve #202 at least for grid searches #205

Add `history_additions` to resolve #202 at least for grid searches #205

dpaetzel commented Feb 12, 2024

ablaom commented Feb 21, 2024

codecov bot commented Feb 21, 2024 •

edited

Loading

ablaom commented Feb 22, 2024

dpaetzel commented Feb 27, 2024

dpaetzel commented Feb 27, 2024

ablaom commented Feb 29, 2024

dpaetzel commented Mar 8, 2024

dpaetzel commented Mar 11, 2024

ablaom commented Mar 17, 2024

Add history_additions to resolve #202 at least for grid searches #205

Add history_additions to resolve #202 at least for grid searches #205

Conversation

dpaetzel commented Feb 12, 2024

ablaom commented Feb 21, 2024

codecov bot commented Feb 21, 2024 • edited Loading

Codecov Report

ablaom commented Feb 22, 2024

dpaetzel commented Feb 27, 2024

dpaetzel commented Feb 27, 2024

ablaom commented Feb 29, 2024

dpaetzel commented Mar 8, 2024

dpaetzel commented Mar 11, 2024

ablaom commented Mar 17, 2024

Add `history_additions` to resolve #202 at least for grid searches #205

Add `history_additions` to resolve #202 at least for grid searches #205

codecov bot commented Feb 21, 2024 •

edited

Loading