Add TFT model #962

Gandor26 · 2020-07-31T22:46:33Z

Issue #, if available:

Description of changes:
Add TFT model [1], benchmark

Datasets	Context Length	Prediction Length	epochs / batches per epoch	Methods	wP10QL	wP50QL	wP90QL	Running Time (s)
Electricity	168	24	10/1000	DeepAR	0.0292	0.0687	0.0369	900
				Transformer	0.0317	0.0761	0.0474	500
				TFT	0.0677	0.1509	0.0974	1200
Traffic	168	24	10/1000	DeepAR	0.0618	0.1585	0.1361	1140
				Transformer	0.0803	0.1656	0.1062	600
				TFT	0.0646	0.1531	0.1077	1500
Parts	24	12	5/2000	DeepAR	0.2221	1.0792	1.6332	360
				Transformer	0.2068	1.002	1.7712	480
				TFT	0.2135	1.0312	0.9918	1060
Wiki-10k	28	7	5/2000	DeepAR	0.0744	0.2185	0.2147	1300
				Transformer	NaN	NaN	NaN	720
				TFT	0.0646	0.2042	0.1539	1150
M4-Daily	28	7	10/1000	DeepAR	0.0174	0.0274	0.0151	400
				Transformer	0.0455	0.0649	0.039	240
				TFT	0.0105	0.0189	0.009	1200

[1] Temporal Fusion Transformers for Interpretable Multi-horizon Time Series Forecasting

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

jaheba

Thanks for the PR!

I've just quickly skimmed over the PR and have some minor comments.

jaheba · 2020-08-04T15:39:21Z

src/gluonts/model/tft/_estimator.py

+ self.static_cardinalities = static_cardinalities or {}
+ self.static_feature_dims = static_feature_dims or {}
+ self.dynamic_cardinalities = dynamic_cardinalities or {}
+ self.dynamic_feature_dims = dynamic_feature_dims or {}
+ self.past_dynamic_features = past_dynamic_features or []


I think pydantic is able to handle this.

E.g.:

time_features:List[TimeFeature] = [],

Normally you wouldn't do this in Python, but pydantic will return you a copy of the empty list, so this should work.

fixed in abdc711

jaheba · 2020-08-04T15:40:27Z

src/gluonts/model/tft/_network.py

+ embedding_dims
+ ), "Length of `embedding_dims` and `embedding_dims` should match"
+ assert all(
+ [c > 0 for c in feature_dims]


The [] are not needed here.

fixed in abdc711

jaheba · 2020-08-04T15:41:49Z

src/gluonts/model/tft/_engine.py

+OutputTransform = Callable[[DataEntry, np.ndarray], np.ndarray]
+
+
+class Trainer(BaseTrainer):


Why can't use use the Trainer defined in gluonts.mx.trainer? Or ask differently, what would you need to be able to use it?

+1. In the ConvModel PR, he had also overrode it and said that the base trainer do not accept default argument values in hybrid_forward and overloaded the trainer to give default None to unused feature types. But I think we can use the Trainer directly using boolean or whether to use the features or not in the estimator or directly extend the Trainer class.

jaheba · 2020-08-04T15:44:32Z

src/gluonts/model/tft/_network.py

+
+ self.feature_dims = feature_dims
+ self.dtype = dtype
+ self.__num_features = len(feature_dims)


It's uncommon to use two leading _ in Python, is this needed here?

Well, I just followed the implementation in gluonts.mx.block.feature.FeatureEmbedder, in which it also has such a property with name mangling
https://github.com/Gandor26/gluon-ts/blob/san/src/gluonts/mx/block/feature.py#L65

jaheba · 2020-08-04T15:45:14Z

src/gluonts/model/tft/_network.py

+ assert (
+ len(feature_dims) > 0
+ ), "Length of `cardinalities` list must be greater than zero"


Suggested change

assert (

len(feature_dims) > 0

), "Length of `cardinalities` list must be greater than zero"

assert (feature_dims), "Length of `cardinalities` list must be greater than zero"

fixed in abdc711

lostella

Hey @Gandor26, it seems that there is substantial code duplication in this PR which could be avoided. See my inline comments, which are probably related to @jaheba's and @dcmaddix's previous comments regarding the Trainer class.

Is the rewriting needed because of optional features/None inputs to the network? Then I think this is a separate issue.

lostella · 2020-08-10T14:16:53Z

src/gluonts/model/tft/_engine.py

+ args = inspect.signature(
+ net.hybrid_forward
+ ).parameters
+ inputs = []
+ for n, (name, arg) in enumerate(args.items()):
+ if n == 0:
+ if name == "F":
+ continue
+ else:
+ raise RuntimeError(
+ f"Expected first argument of HybridBlock to be `F`, "
+ f"but found `{name}`"
+ )
+ if name in data_entry:
+ inputs.append(data_entry[name])
+ elif not (arg.default is inspect._empty):
+ inputs.append(arg.default)
+ else:
+ raise RuntimeError(
+ f"The value of argument `{name}` of HybridBlock is not provided, "
+ f"and no default value is set."
+ )


Some observations:

The code here seems like a refined version of this-plus-this allowing for defaults;

This is duplicated in the forecast generator defined below;

Both in the trainer and forecast generator now have an obsolete input_names constructor argument.

If you really want to push for this mechanism for TFT, my suggestion here is to propose this new mechanism in a separate PR where you showcase it with a minimal example model. This way you can focus on getting this into the default Trainer class and ForecastGenerator types in a backward compatible way (so that other models keep working fine with the default Trainer and ForecastGenerator classes).

Then you should then be able to proceed with the current PR, but avoiding a lot of code duplication.

This of course applies to #961 as well, so the code duplication savings double :-)

dcmaddix

Looks great, Xiaoyong! I think we can merge it in. Thanks for updating to using the default Trainer.

Gandor26 · 2020-09-30T00:59:25Z

@dcmaddix Thanks Danielle. Just let you know I ran one of the benchmark tests and got similar metrics, so it's safe to merge it.

lostella · 2020-09-30T07:05:19Z

Thanks @Gandor26 for pushing this through! If I remember correctly you had a mechanism to allow for optional arguments to hybrid_forward to work with the training loop, which I thought was pretty cool: we may want to revisit that at some point

* remove logging and add new tft impl * fix multiple bugs in tft * fix import conflict * avoid nan in cold-start * avoid nan in cold-start * add default dummy static feature * add license headers * remove redundant chars in assertions and arg list * remove customized trainer by adding dummy features * update __init__.py * fix import conflict Co-authored-by: Xiaoyong Jin <jxiaoyon@amazon.com> Co-authored-by: Danielle Robinson <dcmaddix@gmail.com>

Xiaoyong Jin added 7 commits July 23, 2020 15:22

remove logging and add new tft impl

bd9b02b

fix multiple bugs in tft

a740061

fix import conflict

84560ab

avoid nan in cold-start

3b7ac36

avoid nan in cold-start

0c7cd0e

add default dummy static feature

f4a8f75

add license headers

3ec1779

jaheba reviewed Aug 4, 2020

View reviewed changes

remove redundant chars in assertions and arg list

abdc711

lostella reviewed Aug 10, 2020

View reviewed changes

timoschowski mentioned this pull request Sep 21, 2020

Add Temporal Fusion Transformer (TFT) #1049

Closed

Xiaoyong Jin added 3 commits September 29, 2020 16:22

remove customized trainer by adding dummy features

cfa0d07

update __init__.py

b217002

fix import conflict

092344e

dcmaddix approved these changes Sep 30, 2020

View reviewed changes

Merge branch 'master' into tft

ce7fb13

dcmaddix merged commit 4af09c7 into awslabs:master Sep 30, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add TFT model #962

Add TFT model #962

Gandor26 commented Jul 31, 2020 •

edited

Loading

jaheba left a comment

jaheba Aug 4, 2020

Gandor26 Aug 4, 2020

jaheba Aug 4, 2020

Gandor26 Aug 4, 2020

jaheba Aug 4, 2020

dcmaddix Aug 4, 2020

jaheba Aug 4, 2020

Gandor26 Aug 4, 2020

jaheba Aug 4, 2020

Gandor26 Aug 4, 2020

lostella left a comment

lostella Aug 10, 2020

lostella Aug 10, 2020

dcmaddix left a comment

Gandor26 commented Sep 30, 2020

lostella commented Sep 30, 2020

		OutputTransform = Callable[[DataEntry, np.ndarray], np.ndarray]


		class Trainer(BaseTrainer):

Add TFT model #962

Add TFT model #962

Conversation

Gandor26 commented Jul 31, 2020 • edited Loading

jaheba left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lostella left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dcmaddix left a comment

Choose a reason for hiding this comment

Gandor26 commented Sep 30, 2020

lostella commented Sep 30, 2020

Gandor26 commented Jul 31, 2020 •

edited

Loading