Failing test: test_running_test_pretrained_model_ddp #979

neggert · 2020-02-28T22:57:52Z

I think this is another problem stemming from the fact that we don't have a way to pass data back from torch.multiprocessing.spawn. Needs more investigation.

def test_running_test_pretrained_model_ddp(tmpdir):
        """Verify `test()` on pretrained model."""
        ...
        # run test set
        new_trainer = Trainer(**trainer_options)
>       new_trainer.test(pretrained_model)
tests/test_restore_models.py:60:
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
pytorch_lightning/trainer/trainer.py:1189: in test
    self.run_evaluation(test_mode=True)
pytorch_lightning/trainer/evaluation_loop.py:299: in run_evaluation
    if test_mode and not self.is_overriden('test_step'):
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
self = <pytorch_lightning.trainer.trainer.Trainer object at 0x7f845ec23f90>, f_name = 'test_step', model = None
    def is_overriden(self, f_name, model=None):
        if model is None:
            model = self.get_model()
        super_object = LightningModule
        # when code pointers are different, it was overriden
>       is_overriden = getattr(model, f_name).__code__ is not getattr(super_object, f_name).__code__
E       AttributeError: 'NoneType' object has no attribute 'test_step'
pytorch_lightning/trainer/model_hooks.py:20: AttributeError

The text was updated successfully, but these errors were encountered:

williamFalcon · 2020-03-01T14:23:30Z

https://pytorch.org/docs/stable/notes/multiprocessing.html#reuse-buffers-passed-through-a-queue

torch.multiprocessing is a drop in replacement for Python’s python:multiprocessing module. It supports the exact same operations, but extends it, so that all tensors sent through a python:multiprocessing.Queue, will have their data moved into shared memory and will only send a handle to another process.

Looks like we can use python:multiprocessing.Queue?

Borda · 2020-03-02T16:15:30Z

@williamFalcon Could we check these two commits on GPU - 20d15c8 and 5dd2afe?

neggert added bug Something isn't working help wanted Open to be worked on labels Feb 28, 2020

Borda added the need fix label Feb 28, 2020

Borda mentioned this issue Mar 2, 2020

'NoneType' object has no attribute 'test_step' when DDP #577

Closed

Borda added the priority: 0 High priority task label Mar 2, 2020

Borda added this to the 0.7.0 milestone Mar 2, 2020

neggert mentioned this issue Mar 2, 2020

Fix broken Trainer.test #1014

Closed

williamFalcon mentioned this issue Mar 3, 2020

fixes test issues on ddp #1017

Merged

williamFalcon closed this as completed in #1017 Mar 3, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Failing test: test_running_test_pretrained_model_ddp #979

Failing test: test_running_test_pretrained_model_ddp #979

neggert commented Feb 28, 2020 •

edited by Borda

Loading

williamFalcon commented Mar 1, 2020 •

edited by Borda

Loading

Borda commented Mar 2, 2020

Failing test: test_running_test_pretrained_model_ddp #979

Failing test: test_running_test_pretrained_model_ddp #979

Comments

neggert commented Feb 28, 2020 • edited by Borda Loading

williamFalcon commented Mar 1, 2020 • edited by Borda Loading

Borda commented Mar 2, 2020

neggert commented Feb 28, 2020 •

edited by Borda

Loading

williamFalcon commented Mar 1, 2020 •

edited by Borda

Loading