-
Notifications
You must be signed in to change notification settings - Fork 750
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix scaling for MQ-(C|R)NN when distribution outputs are used #1070
Conversation
@dcmaddix thanks! I think scaling must be off when On the other hand, scaling appears to be necessary when using distributions, otherwise NaNs are all over the place unless data is nicely scaled (like all time series with the same magnitude or something). So I ended up making the default value for Edit: I had to bump the test accuracy for the |
@@ -77,7 +77,7 @@ def __init__( | |||
embedding_dimension: List[int], | |||
distr_output: Optional[DistributionOutput] = None, | |||
quantile_output: Optional[QuantileOutput] = None, | |||
scaling: bool = False, | |||
scaling: bool = True, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should we update the default for scaling
back to False
here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Right!
Looks great, thanks for updating the default to cover both cases! :) |
@@ -154,7 +155,7 @@ def __init__( | |||
enable_encoder_dynamic_feature: bool = True, | |||
enable_decoder_dynamic_feature: bool = True, | |||
trainer: Trainer = Trainer(), | |||
scaling: bool = False, | |||
scaling: Optional[bool] = None, | |||
scaling_decoder_dynamic_feature: bool = False, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we want to turn on scaling of the dynamic features or this should be unrelated to the distribution change right?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this is unrelated maybe, we could do it but maybe as a separate story
…s#1070) * Setting default for target scaling with MQ-CNN to True to work with DistrOutput * fix scaling option default, fix scale shape * bump test accuracy after changing default scaling behaviour Co-authored-by: Danielle Robinson <dmmaddix@amazon.com> Co-authored-by: Lorenzo Stella <stellalo@amazon.com>
Issue #, if available:
#1069
Description of changes:
scaling
for large toTrue
to be used withDistrOutput
scaling_decoder_dynamic_feature
to beFalse
scaling_decoder_dynamic_feature
to training and prediction networksloc
andscale
By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.