Adding support for persistent workers and logger fix #198

jlotthammer · 2024-11-02T20:42:11Z

What is the goal of this PR?

This PR adds enhanced configurability and stability to data loading and logging features within pyrelational's DataManager and LightningModelManager classes. These changes aim to improve both the efficiency of data handling and logging management by adding support for persistent workers in data loading and establishing the ability to pass loggers to the pytorch lightning trainer. This update ensures that data loading can better utilize resources over prolonged tasks and provides support for pytorch lightning loggers.

What are the changes implemented in this PR?

Persistent Workers in DataManager:

The DataManager class now includes an option to keep workers persistent across data loading iterations, controlled by a new loader_persistent_workers parameter. By setting loader_persistent_workers: bool = False by default, backwards compatibility is maintained, but it can be enabled to reduce initialization overhead in multi-epoch training, benefiting users handling larger datasets or needing faster data loader setup times.

In the LightningModelManager class, default logging has been configured using PyTorch Lightning's CSVLogger. If no logger is specified in trainer_config, a CSVLogger is automatically set up to log to config["checkpoints_dir"] consistent with pytorch lightning.

Lastly, the abstract model manager now uses a context manager to load JSON data.

thomasgaudelet · 2024-11-11T10:29:16Z

Thanks for this! Adding @paulmorio for review! We'll review asap 🙂

thomasgaudelet

Thanks for this!

I only would ask to remove the CSVLogger default.

pyrelational/model_managers/abstract_model_manager.py

thomasgaudelet · 2024-11-12T11:16:50Z

pyrelational/model_managers/lightning_model_manager.py

@@ -60,6 +61,10 @@ def init_trainer(self) -> Tuple[Trainer, ModelCheckpoint]:
        config = self.trainer_config
        config = _add_pyl_trainer_defaults(config)
        callbacks: List[Callback] = []
+
+        if config["logger"] is None:


I think default should remain with no logger, at the very least there should be an option to not have a logger.

Hi, Thomas. I was trying to match the Pytorch Lightning API which sets as None and uses default behavior of CSVLogger. To have no logger - consistent with the lightning API - you'd set it as logger=False. So we currently support all this functionality and we match the defaults of PyTorch Lightning. If you want to change it however - we can!

I see, but wouldn't that mean that we don't need this since by default None would get converted as CSVLogger anyway by the Trainer?

jlotthammer and others added 3 commits October 20, 2024 00:28

add support for persistent workers in data loaders

d93fb42

add logger to trainer in lightning model manager

b0483a1

ensure opened files get closed in model manager

2b2bace

jlotthammer mentioned this pull request Nov 2, 2024

Pyrelational - logging to Wandb #196

Open

thomasgaudelet requested review from thomasgaudelet and paulmorio November 11, 2024 10:28

thomasgaudelet requested changes Nov 12, 2024

View reviewed changes

Merge branch 'main' into feature/persistent-workers-logging-fix

a92a414

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding support for persistent workers and logger fix #198

Adding support for persistent workers and logger fix #198

jlotthammer commented Nov 2, 2024

thomasgaudelet commented Nov 11, 2024

thomasgaudelet left a comment

thomasgaudelet Nov 12, 2024

jlotthammer Nov 12, 2024

thomasgaudelet Nov 19, 2024

Adding support for persistent workers and logger fix #198

Are you sure you want to change the base?

Adding support for persistent workers and logger fix #198

Conversation

jlotthammer commented Nov 2, 2024

What is the goal of this PR?

What are the changes implemented in this PR?

thomasgaudelet commented Nov 11, 2024

thomasgaudelet left a comment

Choose a reason for hiding this comment

thomasgaudelet Nov 12, 2024

Choose a reason for hiding this comment

jlotthammer Nov 12, 2024

Choose a reason for hiding this comment

thomasgaudelet Nov 19, 2024

Choose a reason for hiding this comment