Add `pin_memory` as a DataPipe #1013

ejguan · 2023-02-14T14:36:51Z

🚀 The feature

In previous DataLoader, we relies on the argument of pin_memory to launch a thread to move Tensor from CPU to GPU shared memory. This feature should be implemented as a DataPipe with is_replicable() -> False to keep it in the main process.

This should be easily achieved by doing the similar thing as prefetch with buffer size 1.

Motivation, pitch

Feature parity with DataLoader
This DataPipe can also become an indicator that the subsequent operations are on GPU.

Alternatives

No response

Additional context

No response

The text was updated successfully, but these errors were encountered:

Summary: Fixes pytorch#1013 ## Changes - Simplify the control flow of prefetcher - Delay Exception raised from thread worker to main thread in `__iter__` - Stop prefetching whenever Exception is received - As long as `stop_iteration` is not turned on or `buffer` is not empty, continue yielding data from `__iter__`. - Add serialization test - Add `PinMemory` DataPipe - `is_replciable() -> False` to keep it in the main process - Add unit tests - Update `test_proto_multi_rs.py` to `test_mprs.py` Pull Request resolved: pytorch#1014 Reviewed By: NivekT Differential Revision: D43329696 Pulled By: ejguan fbshipit-source-id: da4326dbe2388f4e23b9a1a3a5c43da09d29185a

Summary: Fixes #1013 ## Changes - Simplify the control flow of prefetcher - Delay Exception raised from thread worker to main thread in `__iter__` - Stop prefetching whenever Exception is received - As long as `stop_iteration` is not turned on or `buffer` is not empty, continue yielding data from `__iter__`. - Add serialization test - Add `PinMemory` DataPipe - `is_replciable() -> False` to keep it in the main process - Add unit tests - Update `test_proto_multi_rs.py` to `test_mprs.py` Pull Request resolved: #1014 Reviewed By: NivekT Differential Revision: D43329696 Pulled By: ejguan fbshipit-source-id: da4326dbe2388f4e23b9a1a3a5c43da09d29185a

ejguan mentioned this issue Feb 15, 2023

Update Prefetcher and Implement PinMemory IterDataPipe #1014

Closed

facebook-github-bot closed this as completed in a3b34a0 Feb 17, 2023

ejguan mentioned this issue Feb 21, 2023

Update Prefetcher and Implement PinMemory IterDataPipe (#1014) #1035

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `pin_memory` as a DataPipe #1013

Add `pin_memory` as a DataPipe #1013

ejguan commented Feb 14, 2023 •

edited

Loading

Add pin_memory as a DataPipe #1013

Add pin_memory as a DataPipe #1013

Comments

ejguan commented Feb 14, 2023 • edited Loading

🚀 The feature

Motivation, pitch

Alternatives

Additional context

Add `pin_memory` as a DataPipe #1013

Add `pin_memory` as a DataPipe #1013

ejguan commented Feb 14, 2023 •

edited

Loading