Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Horovod with native 16 precision not working #3111

Closed
mutasem-mattar opened this issue Aug 23, 2020 · 2 comments · Fixed by #3404
Closed

Horovod with native 16 precision not working #3111

mutasem-mattar opened this issue Aug 23, 2020 · 2 comments · Fixed by #3404
Assignees
Labels
bug Something isn't working help wanted Open to be worked on

Comments

@mutasem-mattar
Copy link

🐛 Bug

To Reproduce

Steps to reproduce the behavior:

  1. using precision=16 with distributed_backend=horovod
Traceback (most recent call last):
  File "/workspace/main_lightning.py", line 500, in <module>
    main(hyperparams)
  File "/workspace/main_lightning.py", line 492, in main
    trainer.fit(model)
  File "/usr/local/lib/python3.6/dist-packages/pytorch_lightning/trainer/states.py", line 48, in wrapped_fn
    result = fn(self, *args, **kwargs)
  File "/usr/local/lib/python3.6/dist-packages/pytorch_lightning/trainer/trainer.py", line 1068, in fit
    results = self.horovod_train(model)
  File "/usr/local/lib/python3.6/dist-packages/pytorch_lightning/trainer/distrib_parts.py", line 213, in horovod_train
    model, optimizers = model.configure_apex(amp, model, self.optimizers, self.amp_level)
  File "/usr/local/lib/python3.6/dist-packages/pytorch_lightning/core/lightning.py", line 954, in configure_apex
    model, optimizers = amp.initialize(model, optimizers, opt_level=amp_level)

Code sample

trainer = Trainer(
        precision=16,
        gpus=1,
        distributed_backend="horovod")

Environment

  • PyTorch Version: 1.6.0+cu101
  • How you installed PyTorch: pip
@mutasem-mattar mutasem-mattar added bug Something isn't working help wanted Open to be worked on labels Aug 23, 2020
@Borda
Copy link
Member

Borda commented Aug 25, 2020

mind have look @tgaddair 🐰

@tgaddair
Copy link
Contributor

Absolutely, let me take a look today and get back to you, @mutasem-mattar.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working help wanted Open to be worked on
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants