Fix/operand error with encoders #2034

madtoinou · 2023-10-25T09:52:37Z

Fixes #1875, fixes #1991

Summary

When encoders are used to generate covariates, they have the minimum time requirements. In tabularization, an arithmetic operation on Timedelta and pandas.offset must be performed to realign the covariate and target time indexes. However, some frequencies ('M', 'Y' and 'y') conversion to Timedelta are ambiguous (pandas doc), causing the unsupported operand error.

To solve the problem for these specific cases, a temporary DatetimeIndex is created and the information is extracted without relying on the conversion (slower than the arithmetic operation).

…ts a ambiguous timedelta value to extract the start time index

codecov-commenter · 2023-10-25T12:15:06Z

Codecov Report

Attention: 1 lines in your changes are missing coverage. Please review.

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Files	Coverage Δ
darts/utils/data/tabularization.py	`98.82% <66.66%> (-0.29%)`	⬇️

... and 6 files with indirect coverage changes

📢 Thoughts on this report? Let us know!.

dennisbader

Very nice, thanks a lot @madtoinou.
Just had a minor suggestion and that we should add a test for it

dennisbader · 2023-10-27T12:38:02Z

darts/utils/data/tabularization.py

+                    start_time_idx = (
+                        len(
+                            pd.date_range(
+                                start=time_index_i[0],


It might be more efficient to generate the index from the end of the series instead of from the beginning and then just add the len(time_index_i) to it?

Also we could use our darts.utils.timeseries_generation.generate_index for that

Yup, I don't know if this is much faster but at least, it looks similar to the other case

darts/utils/data/tabularization.py

… possible

…converted to Timedelta

dennisbader

Nice, looks great thanks @madtoinou 🚀

dennisbader · 2023-10-28T13:42:12Z

darts/tests/utils/tabularization/test_create_lagged_training_data.py

@@ -1132,37 +1132,44 @@ def test_lagged_training_data_extend_past_and_future_covariates_range_idx(self):
            assert np.allclose(expected_X, X[:, :, 0])
            assert np.allclose(expected_y, y[:, :, 0])

-    def test_lagged_training_data_extend_past_and_future_covariates_datetime_idx(self):
+    @pytest.mark.parametrize("freq", ["D", "MS", "Y"])


fix: create a temporary Datetime index when series frequency represen…

94dcc41

…ts a ambiguous timedelta value to extract the start time index

madtoinou requested a review from dennisbader as a code owner October 25, 2023 09:52

dennisbader reviewed Oct 27, 2023

View reviewed changes

madtoinou added 3 commits October 27, 2023 15:09

feat: updated changelog

7afb93b

fix: fixed corner case, generate the shortest temporary datetimeindex…

12ba6ee

… possible

feat: added tests to cover the cases where the series freq cannot be …

1e9ed89

…converted to Timedelta

dennisbader approved these changes Oct 28, 2023

View reviewed changes

dennisbader merged commit e6f2208 into master Oct 28, 2023

dennisbader deleted the fix/encoders_operand_error branch October 28, 2023 13:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix/operand error with encoders #2034

Fix/operand error with encoders #2034

madtoinou commented Oct 25, 2023

codecov-commenter commented Oct 25, 2023

dennisbader left a comment

dennisbader Oct 27, 2023 •

edited

Loading

madtoinou Oct 27, 2023

dennisbader left a comment

dennisbader Oct 28, 2023

Fix/operand error with encoders #2034

Fix/operand error with encoders #2034

Conversation

madtoinou commented Oct 25, 2023

Summary

codecov-commenter commented Oct 25, 2023

Codecov Report

dennisbader left a comment

Choose a reason for hiding this comment

dennisbader Oct 27, 2023 • edited Loading

Choose a reason for hiding this comment

madtoinou Oct 27, 2023

Choose a reason for hiding this comment

dennisbader left a comment

Choose a reason for hiding this comment

dennisbader Oct 28, 2023

Choose a reason for hiding this comment

dennisbader Oct 27, 2023 •

edited

Loading