Split `input_transform` into `context_input_transform` and `label_input_transform` #82

abdulfatir · 2024-05-27T12:34:15Z

Description of changes: This splits input_transform into context_input_transform and label_input_transform. Previously, input_transform was being used for both context and label during training which would lead to incorrect results where prediction_length > context_length.

TODO:

Update docstrings
Test the training script

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

src/chronos/chronos.py

lostella · 2024-05-27T14:29:02Z

src/chronos/chronos.py

+        if length > self.config.context_length:
+            context = context[..., -self.config.context_length :]
+
+        token_ids, attention_mask, scale = self._input_transform(context=context)


I'm wondering: is _input_transform needed, or could context_input_transform just piggy-back on label_input_transform here?

lostella

I have a proposal for slightly shorter names. Don't hate me, names are hard:

context_input_transform -> encode_context
label_input_transform -> encode_label
output_transform -> decode_samples

What do you think? Of course if we change names then docstrings are to be updated

src/chronos/chronos.py

abdulfatir · 2024-05-28T07:35:54Z

In general, I don't disagree that names can be improved but I am wondering if encode and decode have other connotations, i.e., encoder and decoder in an encoder-decoder transformer model.

Abdul Fatir Ansari added 9 commits May 27, 2024 14:31

Add abstractions

e1b1bbf

Add abstractions for context/label input transform

84260c5

Push tests

2e9e85d

Remove duplicate code

849ec78

Fix mypy

dcb01f8

Revert abstractions

7c45692

Fix type

5f171d8

Update docstring

9d992cb

Fix docstring

e49ebce

abdulfatir changed the title ~~Add abstractions: Config, Tokenizer, Pipeline~~ Split input_transform into context_input_transform and label_input_transform May 27, 2024

lostella reviewed May 27, 2024

View reviewed changes

src/chronos/chronos.py Outdated Show resolved Hide resolved

src/chronos/chronos.py Show resolved Hide resolved

lostella reviewed May 27, 2024

View reviewed changes

Fix

f8ca232

lostella reviewed May 28, 2024

View reviewed changes

src/chronos/chronos.py Show resolved Hide resolved

Fix docstring

a961f59

lostella approved these changes May 28, 2024

View reviewed changes

abdulfatir merged commit 223e576 into amazon-science:main May 28, 2024
2 checks passed

abdulfatir deleted the add-abstractions branch November 29, 2024 11:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split `input_transform` into `context_input_transform` and `label_input_transform` #82

Split `input_transform` into `context_input_transform` and `label_input_transform` #82

abdulfatir commented May 27, 2024 •

edited

Loading

lostella May 27, 2024

lostella left a comment

abdulfatir commented May 28, 2024

Split input_transform into context_input_transform and label_input_transform #82

Split input_transform into context_input_transform and label_input_transform #82

Conversation

abdulfatir commented May 27, 2024 • edited Loading

lostella May 27, 2024

Choose a reason for hiding this comment

lostella left a comment

Choose a reason for hiding this comment

abdulfatir commented May 28, 2024

Split `input_transform` into `context_input_transform` and `label_input_transform` #82

Split `input_transform` into `context_input_transform` and `label_input_transform` #82

abdulfatir commented May 27, 2024 •

edited

Loading