Add profiling to dataloader `next()` #12124

akashkw · 2022-02-26T00:57:28Z

What does this PR do?

Adds profiling to next(dataloader_iter) in train, eval, and prediction loops.

Does your PR introduce any breaking changes? If yes, please list them.

Before submitting

Was this discussed/approved via a GitHub issue? (not for typos and docs)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you make sure to update the documentation with your changes? (if necessary)
Did you write any new necessary tests? (not for typos and docs)
Did you verify new and existing tests pass locally with your changes?
Did you list all the breaking changes introduced by this pull request?
Did you update the CHANGELOG? (not for typos, docs, test updates, or internal minor changes/refactorings)

PR review

Anyone in the community is welcome to review the PR.
Before you start reviewing make sure you have read Review guidelines. In short, see the following bullet-list:

Is this pull request ready for review? (if not, please submit in draft mode)
Check that all items from Before submitting are resolved
Make sure the title is self-explanatory and the description concisely explains the PR
Add labels and milestones (and optionally projects) to the PR so it can be classified

Did you have fun?

Make sure you had fun coding 🙃

pytorch_lightning/loops/epoch/evaluation_epoch_loop.py

carmocca

Hi @akashkw!

We cannot profile next() like this as this would not cover prefetched batches or inter-batch paralelism.

I recently had a call with @daniellepintz where I went over possible solutions. Here's some code pointers:

The concrete lines to profile:
https://github.com/PyTorchLightning/pytorch-lightning/blob/a0655611de460f1659b46150cda256d2e3fa974e/pytorch_lightning/utilities/fetching.py#L267

https://github.com/PyTorchLightning/pytorch-lightning/blob/a0655611de460f1659b46150cda256d2e3fa974e/pytorch_lightning/utilities/fetching.py#L336

Option 1:
Pass a profiler reference to the data fetchers, maybe also the stage and dataloader idx for more fine-grained profiling, then use these directly inside the fetching functions
Option 2:
Inject the profiler usage through callables that could be part of these hooks:
https://github.com/PyTorchLightning/pytorch-lightning/blob/a0655611de460f1659b46150cda256d2e3fa974e/pytorch_lightning/utilities/fetching.py#L61-L65
with a pattern similar to the one used in this PR to inject logic around the optimizer.step()
4cc05b2

akashkw · 2022-03-02T19:17:54Z

Option 1:
Pass a profiler reference to the data fetchers, maybe also the stage and dataloader idx for more fine-grained profiling, then use these directly inside the fetching functions
Option 2:
Inject the profiler usage through callables that could be part of these hooks:

@carmocca do you have any preference between these two approaches?

carmocca · 2022-03-02T19:44:27Z

I prefer option (2) so the fetching does not need to keep a reference to the trainer. (the profiler is owned by the trainer)

pytorch_lightning/loops/epoch/prediction_epoch_loop.py

carmocca

Good progress!

pytorch_lightning/loops/epoch/evaluation_epoch_loop.py

pytorch_lightning/loops/epoch/prediction_epoch_loop.py

pytorch_lightning/loops/epoch/evaluation_epoch_loop.py

pytorch_lightning/utilities/fetching.py

carmocca · 2022-05-03T15:45:18Z

tests/utilities/test_fetching.py

@@ -485,3 +486,88 @@ def validation_step(self, batch, batch_idx):
    assert dm.count_called_on_before_batch_transfer == 4
    assert dm.count_called_transfer_batch_to_device == 4
    assert dm.count_called_on_after_batch_transfer == 4
+
+
+@RunIf(skip_windows=True)  # TODO: all durations are 0 on Windows


I'm unsure if this is a known issue with the profiler on Windows or a bug...

we should open an issue. It just uses time module. Nothing fancy.

pytorch_lightning/loops/epoch/evaluation_epoch_loop.py

Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>

pytorch_lightning/utilities/fetching.py

Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>

akashkw added 2 commits February 25, 2022 16:56

Add profiler to dataloader next

86eabad

Update action names

63f574d

akashkw changed the title ~~Add profiling to dataloader~~ Add profiling to dataloader next() Feb 26, 2022

akashkw added the profiler label Feb 26, 2022

ananthsub reviewed Feb 26, 2022

View reviewed changes

pytorch_lightning/loops/epoch/evaluation_epoch_loop.py Outdated Show resolved Hide resolved

akashkw changed the title ~~Add profiling to dataloader next()~~ Add profiling to dataloader next() Feb 26, 2022

akashkw added 2 commits February 25, 2022 18:11

update action names with prefix

6ace747

Merge branch 'master' into profile-dataloader

2f78794

akashkw marked this pull request as ready for review February 26, 2022 02:41

akashkw requested review from tchaton, awaelchli, justusschock and carmocca as code owners February 26, 2022 02:41

mergify bot added the has conflicts label Feb 28, 2022

carmocca previously requested changes Feb 28, 2022

View reviewed changes

carmocca added this to the 1.7 milestone Feb 28, 2022

carmocca self-assigned this Feb 28, 2022

fix merge conflicts

d1587a6

akashkw force-pushed the profile-dataloader branch from f878d68 to d1587a6 Compare March 1, 2022 01:30

mergify bot removed the has conflicts label Mar 1, 2022

akashkw requested review from Borda and SeanNaren as code owners March 3, 2022 23:05

inject profiler into fetchers for train, eval, test loops

12a3ba5

akashkw force-pushed the profile-dataloader branch from f6109d5 to 12a3ba5 Compare March 3, 2022 23:08

akashkw commented Mar 3, 2022

View reviewed changes

pytorch_lightning/loops/epoch/prediction_epoch_loop.py Outdated Show resolved Hide resolved

carmocca reviewed Mar 4, 2022

View reviewed changes

akashkw added 2 commits March 4, 2022 12:21

Merge branch 'master' into profile-dataloader

77326bf

updates based on suggestions

535458b

akashkw requested review from kaushikb11 and rohitgr7 as code owners May 2, 2022 16:48

Remove local functions

8cbc247

carmocca approved these changes May 2, 2022

View reviewed changes

carmocca added 4 commits May 3, 2022 16:26

Merge branch 'master' into profile-dataloader

eb502a8

Add test

f5281e9

Test predict

c914571

Skip windows

d313ad8

carmocca reviewed May 3, 2022

View reviewed changes

carmocca enabled auto-merge (squash) May 3, 2022 17:20

carmocca mentioned this pull request May 4, 2022

Profile LightningDataModule hooks #12971

Merged

11 tasks

rohitgr7 approved these changes May 4, 2022

View reviewed changes

pytorch_lightning/loops/epoch/evaluation_epoch_loop.py Outdated Show resolved Hide resolved

mergify bot added the ready PRs ready to be merged label May 4, 2022

carmocca and others added 3 commits May 4, 2022 18:20

Update pytorch_lightning/loops/epoch/evaluation_epoch_loop.py

29f0a55

Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>

mypy

ab5f038

Merge branch 'master' into profile-dataloader

d52ccdb

akihironitta reviewed May 5, 2022

View reviewed changes

pytorch_lightning/utilities/fetching.py Outdated Show resolved Hide resolved

pytorch_lightning/utilities/fetching.py Outdated Show resolved Hide resolved

Apply suggestions from code review

03cac3b

Co-authored-by: Akihiro Nitta <nitta@akihironitta.com>

carmocca requested a review from akihironitta May 5, 2022 14:48

mergify bot added has conflicts and removed ready PRs ready to be merged labels May 5, 2022

Merge branch 'master' into profile-dataloader

17e14f4

mergify bot added ready PRs ready to be merged and removed has conflicts ready PRs ready to be merged labels May 5, 2022

otaj approved these changes May 6, 2022

View reviewed changes

carmocca merged commit c5e1002 into Lightning-AI:master May 6, 2022

rohitgr7 mentioned this pull request May 6, 2022

SimpleProfiler not working on windows #12998

Open

carmocca mentioned this pull request May 18, 2022

Before/After dataloader batch callback hooks #13095

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add profiling to dataloader `next()` #12124

Add profiling to dataloader `next()` #12124

akashkw commented Feb 26, 2022 •

edited

Loading

carmocca left a comment

akashkw commented Mar 2, 2022

carmocca commented Mar 2, 2022

carmocca left a comment

carmocca May 3, 2022 •

edited

Loading

rohitgr7 May 4, 2022

Add profiling to dataloader next() #12124

Add profiling to dataloader next() #12124

Conversation

akashkw commented Feb 26, 2022 • edited Loading

What does this PR do?

Does your PR introduce any breaking changes? If yes, please list them.

Before submitting

PR review

Did you have fun?

carmocca left a comment

Choose a reason for hiding this comment

akashkw commented Mar 2, 2022

carmocca commented Mar 2, 2022

carmocca left a comment

Choose a reason for hiding this comment

carmocca May 3, 2022 • edited Loading

Choose a reason for hiding this comment

rohitgr7 May 4, 2022

Choose a reason for hiding this comment

Add profiling to dataloader `next()` #12124

Add profiling to dataloader `next()` #12124

akashkw commented Feb 26, 2022 •

edited

Loading

carmocca May 3, 2022 •

edited

Loading