Skip to content

Conversation

dstaay-fb
Copy link
Contributor

Summary: PrefetchTrainPipelineSparseDist - use legacy TrainPipeline API and will refactor newer internals assuming memory neutral / or better.

Differential Revision: D57143337

dstaay-fb added 2 commits May 8, 2024 22:30
…h#1967)

Summary:
X-link: facebookresearch/recipes#43


As users highlighted, TrainPipeline refactoring introduced memory regression ~2% due to more context management for code readability.  This results in higher peak memory (takes longer for a context to drop out of refcount)

relatively easy to get a lot more aggressive about releasing memory stored in TrainPipelineContext.

broader internal discusion:
https://fb.workplace.com/groups/970281557043698/permalink/1664528510952329/

Differential Revision:
D57123339

Privacy Context Container: 1203980333745195
Summary: PrefetchTrainPipelineSparseDist - use legacy TrainPipeline API and will refactor newer internals assuming memory neutral / or better.

Differential Revision: D57143337
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 9, 2024
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D57143337

dstaay-fb added a commit to dstaay-fb/torchrec that referenced this pull request May 9, 2024
…#1969)

Summary:

PrefetchTrainPipelineSparseDist - use legacy TrainPipeline API and will refactor newer internals assuming memory neutral / or better.

Differential Revision: D57143337
dstaay-fb added a commit to dstaay-fb/torchrec that referenced this pull request May 9, 2024
…#1969)

Summary:

PrefetchTrainPipelineSparseDist - use legacy TrainPipeline API and will refactor newer internals assuming memory neutral / or better.

Differential Revision: D57143337
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants