You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm trying to stream a large data file instead of loading it so it doesn't have to be pickled for multi-processing. However, open-file objects give a TypeError: cannot serialize '_io.TextIOWrapper' object error, so I have to open it within a subprocess instead-- but train_dataloader and val_dataloader methods get called in the main process of pytorch-lightning! How can I bypass issue without changing the source code?
The text was updated successfully, but these errors were encountered:
sabetAI
changed the title
pytorch-lightning multi-processing is not compatible with large datasets
pytorch-lightning multi-processing is incompatible with large datasets
May 27, 2020
sabetAI
changed the title
pytorch-lightning multi-processing is incompatible with large datasets
DDP is incompatible with large datasets
May 27, 2020
I'm trying to stream a large data file instead of loading it so it doesn't have to be pickled for multi-processing. However, open-file objects give a
TypeError: cannot serialize '_io.TextIOWrapper' object
error, so I have to open it within a subprocess instead-- but train_dataloader and val_dataloader methods get called in the main process of pytorch-lightning! How can I bypass issue without changing the source code?The text was updated successfully, but these errors were encountered: