Integrate multiseries into AutoMLSearch #4270

eccabay · 2023-08-11T20:57:14Z

codecov · 2023-08-11T21:04:55Z

Codecov Report

Merging #4270 (193ef57) into main (24ba211) will increase coverage by 0.1%.
The diff coverage is 100.0%.

@@           Coverage Diff           @@
##            main   #4270     +/-   ##
=======================================
+ Coverage   99.7%   99.7%   +0.1%     
=======================================
  Files        355     355             
  Lines      38959   39073    +114     
=======================================
+ Hits       38838   38953    +115     
+ Misses       121     120      -1

Files Changed	Coverage Δ
evalml/pipelines/components/component_base.py	`100.0% <ø> (ø)`
...sors/multiseries_time_series_baseline_regressor.py	`100.0% <ø> (ø)`
evalml/problem_types/__init__.py	`100.0% <ø> (ø)`
evalml/tests/component_tests/test_utils.py	`99.3% <ø> (ø)`
evalml/tests/conftest.py	`98.4% <ø> (ø)`
...lml/tests/problem_type_tests/test_problem_types.py	`100.0% <ø> (ø)`
evalml/utils/gen_utils.py	`99.3% <ø> (ø)`
...valml/automl/automl_algorithm/default_algorithm.py	`99.7% <100.0%> (+0.1%)`	⬆️
evalml/automl/automl_search.py	`99.8% <100.0%> (+0.1%)`	⬆️
evalml/automl/utils.py	`97.3% <100.0%> (+0.1%)`	⬆️
... and 20 more

eccabay · 2023-08-15T14:10:46Z

evalml/pipelines/components/estimators/regressors/varmax_regressor.py

- use_covariates: bool = True,
+ use_covariates: bool = False,


Flipped this default for the sake of speed. My test example did not train within a 5 minute window when use_covariates was True, but it ran in <10 seconds when use_covariates was False.

I think it would be nice to see performance tests for use_covariates turned on or off. Could possibly turn it off only for tests if performance is greatly improved with covariates!

jeremyliweishih

LGTM and agreed with @chukarsten on potentially refactoring is_multiseries into a separate problem type!

jeremyliweishih · 2023-08-16T14:15:24Z

evalml/pipelines/components/utils.py

-def get_estimators(problem_type, model_families=None, excluded_model_families=None):
+def _filter_multiseries_estimators(estimators, is_multiseries):
+ if is_multiseries:
+ return [estimator for estimator in estimators if estimator.is_multiseries]


nit and maybe for a follow up: could estimator.is_multiseries be something like estimator.supports_multiseries? Think its more clear especially since we're passing is_multiseries everywhere for now.

jeremyliweishih · 2023-08-16T14:20:15Z

evalml/pipelines/components/estimators/regressors/varmax_regressor.py

- use_covariates: bool = True,
+ use_covariates: bool = False,


I think it would be nice to see performance tests for use_covariates turned on or off. Could possibly turn it off only for tests if performance is greatly improved with covariates!

* Add multiseries time series regression as problem type * Completely revamp to multiseries based on problem type

jeremyliweishih

LGTM again

MichaelFu512

Looks good to me

MichaelFu512 · 2023-08-18T16:55:01Z

evalml/automl/automl_search.py

@@ -651,6 +653,14 @@ def __init__(
 f"Dataset size is too small to create holdout set. Minimum dataset size is {self._HOLDOUT_SET_MIN_ROWS} rows, X_train has {len(X_train)} rows. Holdout set evaluation is disabled.",
 )

+ # For multiseries problems, we need to mke sure that the data is primarily ordered by the time_index rather than the series_id


Nit: we need to mke sure -> we need to make sure

eccabay added 2 commits August 11, 2023 16:49

Integration runs, no tests

fddbd1c

Merge branch 'main' into 4266_msts_search

2f80d7d

eccabay added 6 commits August 14, 2023 09:21

Merge branch 'main' into 4266_msts_search

4eb542a

Current tests passing

51634b1

Can I be done writing tests yet

a1e28eb

Merge branch 'main' into 4266_msts_search

aec8278

Test fix

85fe157

Codecov fixes

65a42a5

eccabay marked this pull request as ready for review August 15, 2023 14:12

auto-assign bot assigned eccabay Aug 15, 2023

eccabay commented Aug 15, 2023

View reviewed changes

eccabay requested review from christopherbunn, jeremyliweishih, MichaelFu512, chukarsten and remyogasawara August 15, 2023 14:13

eccabay added 2 commits August 15, 2023 17:24

Fix edge case

3bfec8c

Tiny test fix

9f3cb79

jeremyliweishih approved these changes Aug 16, 2023

View reviewed changes

eccabay added 3 commits August 17, 2023 09:43

Swap is_multiseries logic to problem type (#4278)

41f8e87

* Add multiseries time series regression as problem type * Completely revamp to multiseries based on problem type

Merge branch 'main' into 4266_msts_search

47b495d

Missing test

826a418

jeremyliweishih approved these changes Aug 17, 2023

View reviewed changes

MichaelFu512 approved these changes Aug 18, 2023

View reviewed changes

eccabay added 2 commits August 18, 2023 15:44

typo

890ac12

Merge branch 'main' into 4266_msts_search

193ef57

eccabay merged commit 7781c77 into main Aug 21, 2023
24 checks passed

eccabay deleted the 4266_msts_search branch August 21, 2023 14:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate multiseries into AutoMLSearch #4270

Integrate multiseries into AutoMLSearch #4270

eccabay commented Aug 11, 2023

codecov bot commented Aug 11, 2023 •

edited

Loading

eccabay Aug 15, 2023

jeremyliweishih Aug 16, 2023

jeremyliweishih left a comment

jeremyliweishih Aug 16, 2023

jeremyliweishih Aug 16, 2023

jeremyliweishih left a comment

MichaelFu512 left a comment

MichaelFu512 Aug 18, 2023

Integrate multiseries into AutoMLSearch #4270

Integrate multiseries into AutoMLSearch #4270

Conversation

eccabay commented Aug 11, 2023

codecov bot commented Aug 11, 2023 • edited Loading

Codecov Report

eccabay Aug 15, 2023

Choose a reason for hiding this comment

jeremyliweishih Aug 16, 2023

Choose a reason for hiding this comment

jeremyliweishih left a comment

Choose a reason for hiding this comment

jeremyliweishih Aug 16, 2023

Choose a reason for hiding this comment

jeremyliweishih Aug 16, 2023

Choose a reason for hiding this comment

jeremyliweishih left a comment

Choose a reason for hiding this comment

MichaelFu512 left a comment

Choose a reason for hiding this comment

MichaelFu512 Aug 18, 2023

Choose a reason for hiding this comment

codecov bot commented Aug 11, 2023 •

edited

Loading