How are samples generated with RegressionModel / MLPRegressor? #2461

dwolffram · 2024-07-16T10:30:27Z

Hi there,

I tried sklearn's MLPRegressor with the RegressionModel wrapper, and to my surprise, I was able to generate samples with historical_forecasts (e.g. with num_samples = 1000). How is this possible? Neither RegressionModel nor MLPRegressor accepts any kind of likelihood or loss function as an argument or am I missing something?

model.supports_probabilistic_prediction is actually false in my case but I can still generate samples.

I would be happy to use this but it is not officially supported, right?

The text was updated successfully, but these errors were encountered:

madtoinou · 2024-07-16T13:58:01Z

Hi @dwolffram,

Can you please share a reproducible code snippet? If the model is not probabilistic, it should indeed not be able to generate samples but I might be missing something...

dwolffram · 2024-07-16T15:00:06Z

Hi @madtoinou,

that's what I thought, but somehow I get samples anyway 😅 Or am I doing something wrong?

import matplotlib.pyplot as plt
from darts import concatenate
from darts.datasets import AirPassengersDataset
from sklearn.neural_network import MLPRegressor
from darts.models.forecasting.regression_model import RegressionModel

series = AirPassengersDataset().load()
validation_start = 60

mlp = MLPRegressor(
    hidden_layer_sizes = (8),
    max_iter = 5000
)

model = RegressionModel(
    model = mlp,
    output_chunk_length=4,
    multi_models = True,
    lags = 4
    )

model.supports_probabilistic_prediction # False

model.fit(series)

hfc = model.historical_forecasts(
    series=series,
    start=validation_start,
    forecast_horizon=4,
    stride=4,
    last_points_only=False,
    retrain=False,
    verbose=True,
    num_samples=1000
)

hfc = concatenate(hfc, axis=0)

series.plot()
hfc.plot()
plt.show()

dwolffram · 2024-07-16T15:14:32Z

I just realized that if I set output_chunk_length < 4, I indeed get the error "ValueError: num_samples > 1 is only supported for probabilistic models." No idea if that helps 🤔

madtoinou · 2024-07-17T07:46:33Z

I did a bit of investigation and this is the combination of several things;

the optimized historical forecast method is called because retrain=False and forecast_horizon<=output_chunk_length. This method does not rely on predict() and parallelize all the predictions to speed up things. And in this parallelization, it also duplicate the axes along the num_samples dimension.
if you look at the forecast, all the samples for a given position/time are exactly identical (with output_chunk_length different values).
in the plotting, the quantiles are taken from this repeated values.

An easy way to prevent this would just add some sanity check on the num_samples argument (which is usually taken care of by predict()) in the optimized historical forecast routine. There might be more to it, notably when the predictions are reshaped but it would require further testing.

dwolffram · 2024-07-17T12:20:16Z

Thanks for looking into it!

eschibli · 2024-09-03T20:14:59Z

Madtoinou, why are the quantiles so wide then? Wouldn't 1000 identical predictions imply zero uncertainty?

madtoinou · 2024-09-04T06:42:44Z

Because there are still output_chunk_length different forecasts, due to the erroneous shape of the input tensor.

madtoinou added the bug Something isn't working label Jul 17, 2024

madtoinou added the good first issue Good for newcomers label Aug 28, 2024

dwolffram mentioned this issue Sep 6, 2024

[BUG] Quantiles/samples are wrong when using enable_optimization=True (default!) #2524

Closed

madtoinou mentioned this issue Nov 8, 2024

Fix/hfc opti reg prob #2588

Merged

3 tasks

madtoinou closed this as completed in #2588 Nov 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How are samples generated with RegressionModel / MLPRegressor? #2461

How are samples generated with RegressionModel / MLPRegressor? #2461

dwolffram commented Jul 16, 2024 •

edited

Loading

madtoinou commented Jul 16, 2024

dwolffram commented Jul 16, 2024 •

edited

Loading

dwolffram commented Jul 16, 2024 •

edited

Loading

madtoinou commented Jul 17, 2024

dwolffram commented Jul 17, 2024

eschibli commented Sep 3, 2024

madtoinou commented Sep 4, 2024

How are samples generated with RegressionModel / MLPRegressor? #2461

How are samples generated with RegressionModel / MLPRegressor? #2461

Comments

dwolffram commented Jul 16, 2024 • edited Loading

madtoinou commented Jul 16, 2024

dwolffram commented Jul 16, 2024 • edited Loading

dwolffram commented Jul 16, 2024 • edited Loading

madtoinou commented Jul 17, 2024

dwolffram commented Jul 17, 2024

eschibli commented Sep 3, 2024

madtoinou commented Sep 4, 2024

dwolffram commented Jul 16, 2024 •

edited

Loading

dwolffram commented Jul 16, 2024 •

edited

Loading

dwolffram commented Jul 16, 2024 •

edited

Loading