Unify QuantileOutput and DistributionOutput #3093

shchur · 2023-12-27T13:19:16Z

Issue #, if available: closes #3083

Description of changes:

Move loss computation into Output object and remove all losses defined in gluonts.torch.module.loss.
Make following torch models compatible with both DistributionOutput and QuantileOutput:
- SimpleFeedForward
- TemporalFusionTransformer
- DLinear
- PatchTST
- LagTST
Change return type of the forward method for the following models to Tuple[Tuple[Tensor, ...], Tensor, Tensor]:
- MQCNN (MXNet)
- MQRNN (MXNet)
- TemporalFusionTransformer (MXNet)
- TemporalFusionTransformer (PyTorch)
Update the logic inside QuantileForecastGenerator to support the new unified signature of forward method
Replace predict_to_numpy method with to_numpy

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

Please tag this pr with at least one of these labels to make our release process faster: BREAKING, new feature, bug fix, other change, dev setup

src/gluonts/torch/distributions/quantile.py

lostella

Thanks @shchur, left some comments but I’ll take a deeper look

src/gluonts/model/forecast_generator.py

src/gluonts/torch/distributions/distribution_output.py

src/gluonts/torch/distributions/quantile.py

lostella · 2024-01-09T12:46:04Z

src/gluonts/model/forecast_generator.py

+            (outputs,), loc, scale = prediction_net(*inputs.values())
+            if scale is not None:
+                outputs = outputs * scale[..., None]
+            if loc is not None:
+                outputs = outputs + loc[..., None]


I'm wondering: would it be better to turn to_numpy here? Otherwise the type of the output of prediction_net depend on the type of prediction_net, and we're using indexing/multiplication/addition without really knowing if it will work. Of course, it's just indexing/multiplication/addition, but I'm wondering if it would be somehow clearer if these objects were np.ndarray already here

I agree, your suggestion feels cleaner. Implemented it.

lostella · 2024-01-09T12:47:04Z

src/gluonts/model/forecast_generator.py

                yield QuantileForecast(
-                    output,
+                    output.T,


why move the transposition here, from inside the model?

QuantileForecast expects array of shape [num_quantiles, prediction_length], but all models usually produce the output of shape [batch_size, prediction_length, *additional_dims] - both in case of DistributionOutput and QuantileOutput. I think it's better to keep model output shape consistent and do the transpose here.

lostella · 2024-01-09T12:49:14Z

test/torch/model/test_modules.py

@@ -77,7 +78,7 @@ def assert_shapes_and_dtypes(tensors, shapes, dtypes):
            TemporalFusionTransformerModel(
                context_length=24,
                prediction_length=12,
-                quantiles=[0.2, 0.25, 0.5, 0.9, 0.95],
+                distr_output=QuantileOutput([0.2, 0.25, 0.5, 0.9, 0.95]),


I guess this class now also supports other parametric distribution families, right? Should some test case be added for that?

We have a similar test in https://github.com/awslabs/gluonts/pull/3093/files#diff-636e8492b38a2a549ef86320a8d00ca2b531aff6e31b4d61260ce6ed5ea7d132R104, do you think that's sufficient?

Yes, sorry, somehow I missed that

lostella · 2024-01-09T13:10:20Z

src/gluonts/torch/model/mqf2/distribution.py

@@ -292,7 +292,7 @@ def __init__(
        es_num_samples: int = 50,
        beta: float = 1.0,
    ) -> None:
-        super().__init__(self)
+        super().__init__(self, beta=beta)


Then I guess there's no need to set self.beta further below (line 306)

Good catch!

shchur marked this pull request as draft December 27, 2023 13:19

shchur added BREAKING This is a breaking change (one of pr required labels) enhancement New feature or request labels Dec 27, 2023

shchur changed the title ~~[WIP] Unify QuantileOutput and DistributionOutput~~ Unify QuantileOutput and DistributionOutput Dec 27, 2023

shchur marked this pull request as ready for review December 27, 2023 15:25

shchur requested a review from lostella December 27, 2023 15:26

shchur commented Dec 27, 2023

View reviewed changes

src/gluonts/torch/distributions/quantile.py Outdated Show resolved Hide resolved

lostella added models This item concerns models implementations mxnet This concerns the MXNet side of GluonTS torch This concerns the PyTorch side of GluonTS labels Dec 28, 2023

lostella reviewed Dec 29, 2023

View reviewed changes

src/gluonts/model/forecast_generator.py Outdated Show resolved Hide resolved

src/gluonts/torch/distributions/distribution_output.py Outdated Show resolved Hide resolved

src/gluonts/torch/distributions/quantile.py Outdated Show resolved Hide resolved

shchur and others added 19 commits January 8, 2024 09:46

Unify QuantileOutput and DistributionOutput

9f65e2e

Fix imports

ba63a1b

Update import order

72c9c1c

Update all models

2a67523

Update mx tensor

2b1da19

Fix typing

0087c28

Fix ruff

6b8d7f4

Fix tests

f9d0856

Add license

c624abe

Fix tests

894b24d

Fix loss and add extra tests

552e4ca

Fix return type for MX

999b119

Fix return type

8819377

Remove DistributionLoss class

263e620

Fix iTransformer output

3451cf1

Fix outputs

476763f

Fix SFF

b9f2e9e

Fix linter

3d44900

Unify Output class with loss computation

a9f1573

shchur force-pushed the unify_output branch from 1e70c41 to a9f1573 Compare January 8, 2024 09:47

shchur added 4 commits January 8, 2024 09:49

Fix docstring

384517c

Fix mypy

11123c3

Fix incorrect usage of super()

c6e91b6

Fix shape

08b32cb

lostella reviewed Jan 9, 2024

View reviewed changes

lostella mentioned this pull request Jan 9, 2024

add TiDE model #3096

Merged

lostella reviewed Jan 9, 2024

View reviewed changes

shchur added 2 commits January 10, 2024 13:39

Address PR comments

cd63c2a

Fix typo

9804eac

lostella approved these changes Jan 10, 2024

View reviewed changes

shchur merged commit c99dafa into awslabs:dev Jan 10, 2024
19 checks passed

maxc01 added a commit to maxc01/gluonts that referenced this pull request Jan 12, 2024

adapt changes in awslabs#3093

344e9ac

lostella mentioned this pull request May 29, 2024

ModuleNotFoundError: No module named 'gluonts.torch.modules.loss' #3184

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unify QuantileOutput and DistributionOutput #3093

Unify QuantileOutput and DistributionOutput #3093

shchur commented Dec 27, 2023 •

edited by lostella

Loading

lostella left a comment

lostella Jan 9, 2024

shchur Jan 10, 2024

lostella Jan 9, 2024

shchur Jan 10, 2024

lostella Jan 9, 2024

shchur Jan 10, 2024

lostella Jan 10, 2024

lostella Jan 9, 2024

shchur Jan 10, 2024

Unify QuantileOutput and DistributionOutput #3093

Unify QuantileOutput and DistributionOutput #3093

Conversation

shchur commented Dec 27, 2023 • edited by lostella Loading

lostella left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

shchur commented Dec 27, 2023 •

edited by lostella

Loading