Add post_dataloading_processing method to Trainer #1985

fegin · 2025-11-04T08:03:30Z

Stack from ghstack (oldest at bottom):

We are adding more actions to convert the raw inputs and label.

The new CP can do the input/label/BlockMask sharding this in this method.
The experimental full dtensor model can simply override this method without changing too many Trainer code.

This method is extracted from #1857

Makeing this a standalone PR allows us to continue the two projects above without one blocks another.

[ghstack-poisoned]

We are adding more actions to convert the raw inputs and label. 1. The new CP can do the input/label/BlockMask sharding this in this method. 2. The experimental full dtensor model can simply override this method without changing too many Trainer code. This method is extracted from #1857 Makeing this a standalone PR allows us to continue the two projects above without one blocks another. ghstack-source-id: d1882a7 Pull-Request: #1985

torchtitan/train.py

[ghstack-poisoned]

Stack from [ghstack](https://github.com/ezyang/ghstack/tree/0.12.0) (oldest at bottom): * pytorch#2002 * pytorch#2001 * pytorch#1995 * __->__ pytorch#1985 We are adding more actions to convert the raw inputs and label. 1. The new CP can do the input/label/BlockMask sharding this in this method. 2. The experimental full dtensor model can simply override this method without changing too many Trainer code. This method is extracted from pytorch#1857 Makeing this a standalone PR allows us to continue the two projects above without one blocks another.

fegin added 2 commits November 4, 2025 00:03

Update (base update)

7daf10c

[ghstack-poisoned]

Update

f6a66d9

[ghstack-poisoned]

fegin requested review from tianyu-l, wconstab and wwwjn as code owners November 4, 2025 08:03

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Nov 4, 2025

tianyu-l reviewed Nov 5, 2025

View reviewed changes

torchtitan/train.py Outdated Show resolved Hide resolved

torchtitan/train.py Outdated Show resolved Hide resolved

tianyu-l approved these changes Nov 5, 2025

View reviewed changes

torchtitan/train.py Outdated Show resolved Hide resolved

fegin mentioned this pull request Nov 5, 2025

Deduplicate TorchTitan main function #1995

Merged

fegin added 2 commits November 6, 2025 23:28

Update (base update)

97e2925

[ghstack-poisoned]

Update

c747ac5

[ghstack-poisoned]

This was referenced Nov 7, 2025

[SimpleFSDP] Add typing to simple_fsdp.py #2001

Merged

[Full DTensor] Add full_dtensor flag #2002

Closed

Update

5ede5b6

[ghstack-poisoned]

fegin changed the base branch from gh/fegin/24/base to main November 7, 2025 18:06

fegin merged commit 157d30d into main Nov 7, 2025
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add post_dataloading_processing method to Trainer #1985

Add post_dataloading_processing method to Trainer #1985

Uh oh!

fegin commented Nov 4, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add post_dataloading_processing method to Trainer #1985

Add post_dataloading_processing method to Trainer #1985

Uh oh!

Conversation

fegin commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fegin commented Nov 4, 2025 •

edited

Loading