Fix scaling for MQ-(C|R)NN when distribution outputs are used #1070

dcmaddix · 2020-10-01T21:17:42Z

Issue #, if available:
#1069

Description of changes:

Updated default scaling for large to True to be used with DistrOutput
Left default for scaling_decoder_dynamic_feature to be False
Pass scaling_decoder_dynamic_feature to training and prediction networks
Fix the shape of loc and scale

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

…istrOutput

lostella · 2020-10-02T08:12:02Z

@dcmaddix thanks! I think scaling must be off when quantile_output is used, because apparently the output doesn’t get scaled back: so you don’t want to scale the input, otherwise for fixed model parameters and two series ts and alpha*ts, you will get the same predictions for any alpha > 0. To allow for scaling in this case, the output quantiles should be scaled back.

On the other hand, scaling appears to be necessary when using distributions, otherwise NaNs are all over the place unless data is nicely scaled (like all time series with the same magnitude or something).

So I ended up making the default value for scaling dependent on what is being used, which maintains backwards compatibility in the quantile_output case, and fixese the distr_output one. Plus I've fixed the shape issue described in #1069.

Edit: I had to bump the test accuracy for the distr_output cases, since now the default scaling behaviour changed.

dcmaddix · 2020-10-02T08:21:03Z

src/gluonts/model/seq2seq/_forking_network.py

@@ -77,7 +77,7 @@ def __init__(
 embedding_dimension: List[int],
 distr_output: Optional[DistributionOutput] = None,
 quantile_output: Optional[QuantileOutput] = None,
- scaling: bool = False,
+ scaling: bool = True,


Should we update the default for scaling back to False here?

dcmaddix · 2020-10-02T08:29:51Z

@dcmaddix thanks! I think scaling must be off when quantile_output is used, because apparently the output doesn’t get scaled back: so you don’t want to scale the input, otherwise for fixed model parameters and two series ts and alpha*ts, you will get the same predictions for any alpha > 0. To allow for scaling in this case, the output quantiles should be scaled back.

On the other hand, scaling appears to be necessary when using distributions, otherwise NaNs are all over the place unless data is nicely scaled (like all time series with the same magnitude or something).

So I ended up making the default value for scaling dependent on what is being used, which maintains backwards compatibility in the quantile_output case, and fixese the distr_output one. Plus I've fixed the shape issue described in #1069.

Edit: I had to bump the test accuracy for the distr_output cases, since now the default scaling behaviour changed.

Looks great, thanks for updating the default to cover both cases! :)

dcmaddix · 2020-10-02T08:30:35Z

src/gluonts/model/seq2seq/_forking_estimator.py

@@ -154,7 +155,7 @@ def __init__(
 enable_encoder_dynamic_feature: bool = True,
 enable_decoder_dynamic_feature: bool = True,
 trainer: Trainer = Trainer(),
- scaling: bool = False,
+ scaling: Optional[bool] = None,
 scaling_decoder_dynamic_feature: bool = False,


Do we want to turn on scaling of the dynamic features or this should be unrelated to the distribution change right?

I think this is unrelated maybe, we could do it but maybe as a separate story

…s#1070) * Setting default for target scaling with MQ-CNN to True to work with DistrOutput * fix scaling option default, fix scale shape * bump test accuracy after changing default scaling behaviour Co-authored-by: Danielle Robinson <dmmaddix@amazon.com> Co-authored-by: Lorenzo Stella <stellalo@amazon.com>

Setting default for target scaling with MQ-CNN to True to work with D…

c9db8d8

…istrOutput

dcmaddix requested a review from lostella October 1, 2020 21:19

fix scaling option default, fix scale shape

de9e57f

bump test accuracy after changing default scaling behaviour

73bbc1f

dcmaddix commented Oct 2, 2020

View reviewed changes

dcmaddix closed this Oct 2, 2020

dcmaddix reopened this Oct 2, 2020

dcmaddix commented Oct 2, 2020

View reviewed changes

lostella changed the title ~~Setting default for target scaling with MQ-CNN to True to work with D…~~ Fix scaling for MQ-(C|R)NN when distribution outputs are used Oct 2, 2020

lostella approved these changes Oct 2, 2020

View reviewed changes

lostella merged commit 48a8b04 into awslabs:master Oct 2, 2020

dcmaddix deleted the mqcnn_distr_scale branch October 19, 2020 19:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix scaling for MQ-(C|R)NN when distribution outputs are used #1070

Fix scaling for MQ-(C|R)NN when distribution outputs are used #1070

dcmaddix commented Oct 1, 2020 •

edited by lostella

Loading

lostella commented Oct 2, 2020 •

edited

Loading

dcmaddix Oct 2, 2020

lostella Oct 2, 2020

dcmaddix commented Oct 2, 2020

dcmaddix Oct 2, 2020

lostella Oct 2, 2020

Fix scaling for MQ-(C|R)NN when distribution outputs are used #1070

Fix scaling for MQ-(C|R)NN when distribution outputs are used #1070

Conversation

dcmaddix commented Oct 1, 2020 • edited by lostella Loading

lostella commented Oct 2, 2020 • edited Loading

dcmaddix Oct 2, 2020

Choose a reason for hiding this comment

lostella Oct 2, 2020

Choose a reason for hiding this comment

dcmaddix commented Oct 2, 2020

dcmaddix Oct 2, 2020

Choose a reason for hiding this comment

lostella Oct 2, 2020

Choose a reason for hiding this comment

dcmaddix commented Oct 1, 2020 •

edited by lostella

Loading

lostella commented Oct 2, 2020 •

edited

Loading