resample calibration post-processors with an internal split #894

simonpcouch · 2024-04-26T16:18:30Z

Related to tidymodels/workflows#225, tidymodels/tailor#12.

Code looks something like (updated 5/22/2024):

library(tidymodels)
library(tailor)

y <- seq(0, 7, .001)
dat <- data.frame(y = y, x = y + (y-3)^2)

dat

wflow <- 
  workflow(
    y ~ x, 
    boost_tree("regression", trees = 3),
    tailor("regression") %>% adjust_numeric_calibration("linear")
  )

fit_resamples(wflow, vfold_cv(dat))

Previous PR description

This PR proposes resampling calibrators using an "internal split"—it's very scrappy at the moment and intended only for internal testing.

library(tidymodels)
library(container)
library(probably)
#> 
#> Attaching package: 'probably'
#> The following objects are masked from 'package:base':
#> 
#>     as.factor, as.ordered

# create example data
set.seed(1)
dat <- tibble(y = rnorm(100), x = y/2 + rnorm(100))

dat
#> # A tibble: 100 × 2
#>         y      x
#>     <dbl>  <dbl>
#>  1 -0.626 -0.934
#>  2  0.184  0.134
#>  3 -0.836 -1.33 
#>  4  1.60   0.956
#>  5  0.330 -0.490
#>  6 -0.820  1.36 
#>  7  0.487  0.960
#>  8  0.738  1.28 
#>  9  0.576  0.672
#> 10 -0.305  1.53 
#> # ℹ 90 more rows

dat_boots <- bootstraps(dat)

# construct workflow
wf_simple <- workflow(y ~ x, boost_tree("regression", trees = 3))

# specify calibration
reg_ctr <-
  container(mode = "regression") %>%
  adjust_numeric_calibration(type = "linear")

wf_post <- wf_simple %>% add_container(reg_ctr)

# resample workflows
set.seed(1)
wf_simple_res <- 
  fit_resamples(
    wf_simple,
    dat_boots,
    control = control_grid(save_pred = TRUE)
  )

set.seed(1)
wf_post_res <- 
  fit_resamples(
    wf_post,
    dat_boots,
    control = control_grid(save_pred = TRUE)
  )

# ...train the post-processor post-hoc
cal_manual <- cal_estimate_linear(wf_simple_res, truth = y)
cal_manual_preds <- cal_apply(wf_simple_res, cal_manual)

simple_preds <- collect_predictions(wf_simple_res, summarize = TRUE)
cal_auto_preds <- collect_predictions(wf_post_res, summarize = TRUE)
cal_manual_preds
#> # A tibble: 100 × 4
#>      .pred  .row      y .config             
#>      <dbl> <int>  <dbl> <chr>               
#>  1 -0.167      1 -0.626 Preprocessor1_Model1
#>  2  0.267      2  0.184 Preprocessor1_Model1
#>  3  0.215      3 -0.836 Preprocessor1_Model1
#>  4  0.273      4  1.60  Preprocessor1_Model1
#>  5 -0.118      5  0.330 Preprocessor1_Model1
#>  6  0.269      6 -0.820 Preprocessor1_Model1
#>  7  0.140      7  0.487 Preprocessor1_Model1
#>  8  0.219      8  0.738 Preprocessor1_Model1
#>  9  0.254      9  0.576 Preprocessor1_Model1
#> 10  0.0856    10 -0.305 Preprocessor1_Model1
#> # ℹ 90 more rows

Averaged predictions from the uncalibrated model:

ggplot(simple_preds, aes(x = y, y = .pred)) + geom_point()

Averaged predictions from the model calibrated internally in tune:

ggplot(cal_auto_preds, aes(x = y, y = .pred)) + geom_point()

Averaged predictions from the uncalibrated model, calibrated manually
after the fact with probably (I’m not sure I got the flow right with
cal_estimate_linear(...) %>% cal_apply(...)?):

ggplot(cal_manual_preds, aes(x = y, y = .pred)) + geom_point()

^{Created on 2024-04-26 with reprex v2.1.0}

As-is, this PR doesn't apply any postprocessor if there's not a calibrator in the postprocessor--mostly intended to allow for experimentation on the statistical properties of resampling calibrators in this way.

topepo · 2024-04-26T18:08:09Z

I was thinking a lot about this this morning. Some thoughts not in our google doc:

Find a way to control the randomness of the sampling. Before this step, there may be other things that consume random numbers.
Export all functions involved in this; finetune and others will need to do this (maybe a standalone?)

R/grid_code_paths.R

+    # * the model (including the post-processor) generates predictions on the
+    #   assessment set (not internal, i.e. `assessment(split)`) and those
+    #   predictions are assessed with performance metrics
+    split <- rsample::initial_split(training)


R/grid_code_paths.R

topepo · 2024-04-26T18:13:10Z

R/grid_code_paths.R

@@ -373,6 +377,25 @@ tune_grid_loop_iter <- function(split,

  training <- rsample::analysis(split)


We should change this name just on principle

R/grid_helpers.R

…split()`

the first step to being able to use `inner_split()`

simonpcouch · 2024-05-31T14:25:09Z

With an eye for reducing Remotes hoopla, I'm going to go ahead and merge and open issues for smaller todos.

github-actions · 2024-06-15T00:37:23Z

This pull request has been automatically locked. If you believe you have found a related problem, please file a new issue (with a reprex: https://reprex.tidyverse.org) and link to this issue.

simonpcouch added 2 commits April 26, 2024 11:14

resample calibration post-processors with an internal split

312a698

update Remotes ref

1e8376c

topepo reviewed Apr 26, 2024

View reviewed changes

R/grid_code_paths.R Show resolved Hide resolved

topepo reviewed Apr 26, 2024

View reviewed changes

simonpcouch commented Apr 30, 2024

View reviewed changes

R/grid_helpers.R Outdated Show resolved Hide resolved

simonpcouch added 12 commits May 2, 2024 13:20

update Remotes ref

2b7887e

container -> tailor

d2074d5

migrate tune:::should_internal_split() -> `workflows::should_inner_…

a2903bd

…split()`

namespace workflows

ceac8ed

route rset_info through to tune_grid_loop_iter()

3ba3c49

the first step to being able to use `inner_split()`

transition to inner_split()

3515f6d

move tailor to Suggests

956ce6e

namespace assessment()

5470b36

add rsample Remotes ref

cd42187

extract from workflow only once fully trained

2b12b1d

apply postprocessor in predict_model()

0047faf

spec out unit test for comparison to workflows output

8deac08

simonpcouch mentioned this pull request May 23, 2024

Potato/inner/calibration split tidymodels/rsample#483

Merged

simonpcouch added 4 commits May 23, 2024 09:08

update Remotes ref [no ci]

20898e9

replicate RNG state in test

cce9ca4

migrated tailor_fully_trained() to tailor

a59ac70

note ignoring workflow's method argument

544d541

hfrick mentioned this pull request May 24, 2024

Don't export .get_split_args() tidymodels/rsample#495

Closed

update Remotes ref

10798b9

simonpcouch marked this pull request as ready for review May 31, 2024 14:25

simonpcouch merged commit c0996ed into main May 31, 2024
1 of 9 checks passed

simonpcouch deleted the postprocessing branch May 31, 2024 14:26

github-actions bot locked and limited conversation to collaborators Jun 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

resample calibration post-processors with an internal split #894

resample calibration post-processors with an internal split #894

simonpcouch commented Apr 26, 2024 •

edited

Loading

topepo commented Apr 26, 2024

This comment was marked as outdated.

topepo Apr 26, 2024 •

edited

Loading

simonpcouch commented May 31, 2024

github-actions bot commented Jun 15, 2024

		@@ -373,6 +377,25 @@ tune_grid_loop_iter <- function(split,

		training <- rsample::analysis(split)

resample calibration post-processors with an internal split #894

resample calibration post-processors with an internal split #894

Conversation

simonpcouch commented Apr 26, 2024 • edited Loading

topepo commented Apr 26, 2024

This comment was marked as outdated.

topepo Apr 26, 2024 • edited Loading

Choose a reason for hiding this comment

simonpcouch commented May 31, 2024

github-actions bot commented Jun 15, 2024

simonpcouch commented Apr 26, 2024 •

edited

Loading

topepo Apr 26, 2024 •

edited

Loading