Skip to content

Conversation

dstaay-fb
Copy link
Contributor

Summary:
As users highlighted, TrainPipeline refactoring introduced memory regression ~2% due to more context management for code readability. This results in higher peak memory (takes longer for a context to drop out of refcount)

relatively easy to get a lot more aggressive about releasing memory stored in TrainPipelineContext.

Differential Revision:
D57123339

Privacy Context Container: 1203980333745195

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 8, 2024
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D57123339

dstaay-fb added a commit to dstaay-fb/torchrec that referenced this pull request May 8, 2024
…h#1967)

Summary:

As users highlighted, TrainPipeline refactoring introduced memory regression ~2% due to more context management for code readability.  This results in higher peak memory (takes longer for a context to drop out of refcount)

relatively easy to get a lot more aggressive about releasing memory stored in TrainPipelineContext.

Differential Revision:
D57123339

Privacy Context Container: 1203980333745195
@dstaay-fb dstaay-fb force-pushed the export-D57123339 branch from 48623f7 to 287c942 Compare May 8, 2024 20:54
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D57123339

dstaay-fb added a commit to dstaay-fb/torchrec that referenced this pull request May 8, 2024
…h#1967)

Summary:

As users highlighted, TrainPipeline refactoring introduced memory regression ~2% due to more context management for code readability.  This results in higher peak memory (takes longer for a context to drop out of refcount)

relatively easy to get a lot more aggressive about releasing memory stored in TrainPipelineContext.

broader internal discusion:
https://fb.workplace.com/groups/970281557043698/permalink/1664528510952329/

Differential Revision:
D57123339

Privacy Context Container: 1203980333745195
@dstaay-fb dstaay-fb force-pushed the export-D57123339 branch from 287c942 to 33f349c Compare May 8, 2024 23:22
dstaay-fb added a commit to dstaay-fb/recipes that referenced this pull request May 8, 2024
Summary:
X-link: meta-pytorch/torchrec#1967

As users highlighted, TrainPipeline refactoring introduced memory regression ~2% due to more context management for code readability.  This results in higher peak memory (takes longer for a context to drop out of refcount)

relatively easy to get a lot more aggressive about releasing memory stored in TrainPipelineContext.

broader internal discusion:
https://fb.workplace.com/groups/970281557043698/permalink/1664528510952329/

Differential Revision:
D57123339

Privacy Context Container: 1203980333745195
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D57123339

…h#1967)

Summary:
X-link: facebookresearch/recipes#43


As users highlighted, TrainPipeline refactoring introduced memory regression ~2% due to more context management for code readability.  This results in higher peak memory (takes longer for a context to drop out of refcount)

relatively easy to get a lot more aggressive about releasing memory stored in TrainPipelineContext.

broader internal discusion:
https://fb.workplace.com/groups/970281557043698/permalink/1664528510952329/

Differential Revision:
D57123339

Privacy Context Container: 1203980333745195
dstaay-fb added a commit to dstaay-fb/recipes that referenced this pull request May 9, 2024
Summary:

X-link: meta-pytorch/torchrec#1967

As users highlighted, TrainPipeline refactoring introduced memory regression ~2% due to more context management for code readability.  This results in higher peak memory (takes longer for a context to drop out of refcount)

relatively easy to get a lot more aggressive about releasing memory stored in TrainPipelineContext.

broader internal discusion:
https://fb.workplace.com/groups/970281557043698/permalink/1664528510952329/

Differential Revision:
D57123339

Privacy Context Container: 1203980333745195
@dstaay-fb dstaay-fb force-pushed the export-D57123339 branch from 33f349c to 915606b Compare May 9, 2024 05:03
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D57123339

dstaay-fb added a commit to dstaay-fb/torchrec that referenced this pull request May 9, 2024
…h#1967)

Summary:
X-link: facebookresearch/recipes#43


As users highlighted, TrainPipeline refactoring introduced memory regression ~2% due to more context management for code readability.  This results in higher peak memory (takes longer for a context to drop out of refcount)

relatively easy to get a lot more aggressive about releasing memory stored in TrainPipelineContext.

broader internal discusion:
https://fb.workplace.com/groups/970281557043698/permalink/1664528510952329/

Differential Revision:
D57123339

Privacy Context Container: 1203980333745195
dstaay-fb added a commit to dstaay-fb/torchrec that referenced this pull request May 9, 2024
…h#1967)

Summary:
X-link: facebookresearch/recipes#43


As users highlighted, TrainPipeline refactoring introduced memory regression ~2% due to more context management for code readability.  This results in higher peak memory (takes longer for a context to drop out of refcount)

relatively easy to get a lot more aggressive about releasing memory stored in TrainPipelineContext.

broader internal discusion:
https://fb.workplace.com/groups/970281557043698/permalink/1664528510952329/

Reviewed By: joshuadeng

Differential Revision:
D57123339

Privacy Context Container: 1203980333745195
henrylhtsang pushed a commit to henrylhtsang/torchrec that referenced this pull request May 9, 2024
…h#1967)

Summary:
X-link: facebookresearch/recipes#43

Pull Request resolved: meta-pytorch#1967

As users highlighted, TrainPipeline refactoring introduced memory regression ~2% due to more context management for code readability.  This results in higher peak memory (takes longer for a context to drop out of refcount)

relatively easy to get a lot more aggressive about releasing memory stored in TrainPipelineContext.

broader internal discusion:
https://fb.workplace.com/groups/970281557043698/permalink/1664528510952329/

Differential Revision:
D57123339

Privacy Context Container: 1203980333745195
henrylhtsang pushed a commit to henrylhtsang/torchrec that referenced this pull request May 9, 2024
…h#1967)

Summary:
X-link: facebookresearch/recipes#43

Pull Request resolved: meta-pytorch#1967

As users highlighted, TrainPipeline refactoring introduced memory regression ~2% due to more context management for code readability.  This results in higher peak memory (takes longer for a context to drop out of refcount)

relatively easy to get a lot more aggressive about releasing memory stored in TrainPipelineContext.

broader internal discusion:
https://fb.workplace.com/groups/970281557043698/permalink/1664528510952329/

Differential Revision:
D57123339

Privacy Context Container: 1203980333745195
dstaay-fb added a commit to dstaay-fb/torchrec that referenced this pull request May 9, 2024
…h#1967)

Summary:
X-link: facebookresearch/recipes#43


As users highlighted, TrainPipeline refactoring introduced memory regression ~2% due to more context management for code readability.  This results in higher peak memory (takes longer for a context to drop out of refcount)

relatively easy to get a lot more aggressive about releasing memory stored in TrainPipelineContext.

broader internal discusion:
https://fb.workplace.com/groups/970281557043698/permalink/1664528510952329/

Reviewed By: joshuadeng, henrylhtsang

Differential Revision:
D57123339

Privacy Context Container: 1203980333745195
facebook-github-bot pushed a commit to facebookresearch/recipes that referenced this pull request May 10, 2024
Summary:
Pull Request resolved: #43

X-link: meta-pytorch/torchrec#1967

As users highlighted, TrainPipeline refactoring introduced memory regression ~2% due to more context management for code readability.  This results in higher peak memory (takes longer for a context to drop out of refcount)

relatively easy to get a lot more aggressive about releasing memory stored in TrainPipelineContext.

broader internal discusion:
https://fb.workplace.com/groups/970281557043698/permalink/1664528510952329/

Reviewed By: joshuadeng, henrylhtsang

Differential Revision:
D57123339

Privacy Context Container: 1203980333745195

fbshipit-source-id: e2bd0c95d59619786168b31745c3b79a52fd5969
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants