How to compute a block of code using full precision when training with AMP? #5891

GuillaumeTong · 2020-11-24T08:49:46Z

GuillaumeTong
Nov 24, 2020

I have encountered a few cases so far where some losses (such as the ms-ssim implementation from VainF on GitHub ) that used to work under full precision training will no longer work when using AMP.
I would like to know if there is any way to locally disable AMP for just a few steps of computation, either through Lightning or through native PyTorch. (It seems that torch.cuda.amp.autocast(enabled=False) might be able to do the job, but I am worried about it messing with the Lightning internals)
Since I calculate a variety of losses before combining them, all in the same step, I am thinking that doing so will still allow me to leverage some of the advantages from AMP

2021-01-01T03:09:57Z

stale[bot]
bot Jan 1, 2021

This issue has been automatically marked as stale because it hasn't had any recent activity. This issue will be closed in 7 days if no further activity occurs. Thank you for your contributions, Pytorch Lightning Team!

0 replies

GuillaumeTong · 2021-01-06T10:32:08Z

GuillaumeTong
Jan 6, 2021
Author

This issue has been automatically marked as stale because it hasn't had any recent activity. This issue will be closed in 7 days if no further activity occurs. Thank you for your contributions, Pytorch Lightning Team!

Nobody even replied...

0 replies

carmocca · 2021-02-10T00:07:21Z

carmocca
Feb 10, 2021

Hi @GuillaumeTong!

As you mentioned, you should be able to disable AMP for a region with PyTorch's autocast context manager.

Sorry I don't have a clear answer for you, but trying it out will be your best bet to know if it will work, as it also depends on your particular use case.

If you find any bugs, feel free to open an issue and we will try to help you asap 😄

0 replies

tchaton · 2021-02-10T11:14:35Z

tchaton
Feb 10, 2021
Maintainer

Hey @GuillaumeTong,

Context manager can be nested.

Can you try the following:

Trainer(amp_backend="native", precision=16)

In your code.

@autocast(False)
def compute_loss

Best,
T.C

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to compute a block of code using full precision when training with AMP? #5891

{{title}}

Replies: 4 comments

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

How to compute a block of code using full precision when training with AMP? #5891

GuillaumeTong Nov 24, 2020

Replies: 4 comments

stale[bot] bot Jan 1, 2021

GuillaumeTong Jan 6, 2021 Author

carmocca Feb 10, 2021

tchaton Feb 10, 2021 Maintainer

GuillaumeTong
Nov 24, 2020

stale[bot]
bot Jan 1, 2021

GuillaumeTong
Jan 6, 2021
Author

carmocca
Feb 10, 2021

tchaton
Feb 10, 2021
Maintainer