Show CUDA matmul precision info only ever once #17960

awaelchli · 2023-07-01T05:22:32Z

What does this PR do?

Currently, whenever you call Fabric.setup() or Trainer.fit(), the info message

You are using a CUDA device ('NVIDIA A100-SXM4-40GB') that has Tensor Cores. To properly utilize them, you should set torch.set_float32_matmul_precision('medium' | 'high') which will trade-off precision for performance. For more details, read ...

is printed. Since this is a global setting for the entire script, I propose to only show the info once ever during the lifetime of the program. This avoids spamming the info message across the output.

Before submitting

Was this discussed/agreed via a GitHub issue? (not for typos and docs)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you make sure to update the documentation with your changes? (if necessary)
Did you write any new necessary tests? (not for typos and docs)
Did you verify new and existing tests pass locally with your changes?
Did you list all the breaking changes introduced by this pull request?
Did you update the CHANGELOG? (not for typos, docs, test updates, or minor internal changes/refactors)

PR review

Anyone in the community is welcome to review the PR.
Before you start reviewing, make sure you have read the review guidelines. In short, see the following bullet-list:

Reviewer checklist

Is this pull request ready for review? (if not, please submit in draft mode)
Check that all items from Before submitting are resolved
Make sure the title is self-explanatory and the description concisely explains the PR
Add labels and milestones (and optionally projects) to the PR so it can be classified

cc @Borda @carmocca @justusschock @awaelchli

github-actions · 2023-07-01T05:23:04Z

⚡ Required checks status: All passing 🟢

Groups summary

🟢 pytorch_lightning: Tests workflow

Check ID	Status
pl-cpu (macOS-11, lightning, 3.8, 1.11)	success	✅
pl-cpu (macOS-11, lightning, 3.9, 1.12)	success	✅
pl-cpu (macOS-11, lightning, 3.10, 1.13)	success	✅
pl-cpu (macOS-11, lightning, 3.10, 2.0)	success	✅
pl-cpu (macOS-11, lightning, 3.8, 1.11, oldest)	success	✅
pl-cpu (ubuntu-20.04, lightning, 3.8, 1.11)	success	✅
pl-cpu (ubuntu-20.04, lightning, 3.9, 1.12)	success	✅
pl-cpu (ubuntu-20.04, lightning, 3.10, 1.13)	success	✅
pl-cpu (ubuntu-20.04, lightning, 3.10, 2.0)	success	✅
pl-cpu (ubuntu-20.04, lightning, 3.8, 1.11, oldest)	success	✅
pl-cpu (windows-2022, lightning, 3.8, 1.11)	success	✅
pl-cpu (windows-2022, lightning, 3.9, 1.12)	success	✅
pl-cpu (windows-2022, lightning, 3.10, 1.13)	success	✅
pl-cpu (windows-2022, lightning, 3.10, 2.0)	success	✅
pl-cpu (windows-2022, lightning, 3.8, 1.11, oldest)	success	✅
pl-cpu (macOS-11, pytorch, 3.8, 1.13)	success	✅
pl-cpu (ubuntu-20.04, pytorch, 3.8, 1.13)	success	✅
pl-cpu (windows-2022, pytorch, 3.8, 1.13)	success	✅

These checks are required after the changes to src/lightning/fabric/accelerators/cuda.py.

🟢 pytorch_lightning: Azure GPU

Check ID	Status
pytorch-lightning (GPUs)	success	✅

These checks are required after the changes to src/lightning/fabric/accelerators/cuda.py.

🟢 pytorch_lightning: Benchmarks

Check ID	Status
lightning.Benchmarks	success	✅

These checks are required after the changes to src/lightning/fabric/accelerators/cuda.py.

🟢 fabric: Docs

Check ID	Status
make-doctest (fabric)	success	✅
make-html (fabric)	success	✅

These checks are required after the changes to src/lightning/fabric/accelerators/cuda.py.

🟢 lightning_fabric: CPU workflow

Check ID	Status
fabric-cpu (macOS-11, lightning, 3.8, 1.11)	success	✅
fabric-cpu (macOS-11, lightning, 3.9, 1.12)	success	✅
fabric-cpu (macOS-11, lightning, 3.10, 1.13)	success	✅
fabric-cpu (macOS-11, lightning, 3.10, 2.0)	success	✅
fabric-cpu (macOS-11, lightning, 3.8, 1.11, oldest)	success	✅
fabric-cpu (ubuntu-20.04, lightning, 3.8, 1.11)	success	✅
fabric-cpu (ubuntu-20.04, lightning, 3.9, 1.12)	success	✅
fabric-cpu (ubuntu-20.04, lightning, 3.10, 1.13)	success	✅
fabric-cpu (ubuntu-20.04, lightning, 3.10, 2.0)	success	✅
fabric-cpu (ubuntu-20.04, lightning, 3.8, 1.11, oldest)	success	✅
fabric-cpu (windows-2022, lightning, 3.8, 1.11)	success	✅
fabric-cpu (windows-2022, lightning, 3.9, 1.12)	success	✅
fabric-cpu (windows-2022, lightning, 3.10, 1.13)	success	✅
fabric-cpu (windows-2022, lightning, 3.10, 2.0)	success	✅
fabric-cpu (windows-2022, lightning, 3.8, 1.11, oldest)	success	✅
fabric-cpu (macOS-11, fabric, 3.8, 1.13)	success	✅
fabric-cpu (ubuntu-20.04, fabric, 3.8, 1.13)	success	✅
fabric-cpu (windows-2022, fabric, 3.8, 1.13)	success	✅

These checks are required after the changes to src/lightning/fabric/accelerators/cuda.py, tests/tests_fabric/accelerators/test_cuda.py.

🟢 lightning_fabric: Azure GPU

Check ID	Status
lightning-fabric (GPUs)	success	✅

These checks are required after the changes to src/lightning/fabric/accelerators/cuda.py, tests/tests_fabric/accelerators/test_cuda.py.

🟢 mypy

Check ID	Status
mypy	success	✅

These checks are required after the changes to src/lightning/fabric/accelerators/cuda.py.

🟢 install

Check ID	Status
install-pkg (ubuntu-22.04, app, 3.8)	success	✅
install-pkg (ubuntu-22.04, app, 3.10)	success	✅
install-pkg (ubuntu-22.04, fabric, 3.8)	success	✅
install-pkg (ubuntu-22.04, fabric, 3.10)	success	✅
install-pkg (ubuntu-22.04, pytorch, 3.8)	success	✅
install-pkg (ubuntu-22.04, pytorch, 3.10)	success	✅
install-pkg (ubuntu-22.04, lightning, 3.8)	success	✅
install-pkg (ubuntu-22.04, lightning, 3.10)	success	✅
install-pkg (ubuntu-22.04, notset, 3.8)	success	✅
install-pkg (ubuntu-22.04, notset, 3.10)	success	✅
install-pkg (macOS-12, app, 3.8)	success	✅
install-pkg (macOS-12, app, 3.10)	success	✅
install-pkg (macOS-12, fabric, 3.8)	success	✅
install-pkg (macOS-12, fabric, 3.10)	success	✅
install-pkg (macOS-12, pytorch, 3.8)	success	✅
install-pkg (macOS-12, pytorch, 3.10)	success	✅
install-pkg (macOS-12, lightning, 3.8)	success	✅
install-pkg (macOS-12, lightning, 3.10)	success	✅
install-pkg (macOS-12, notset, 3.8)	success	✅
install-pkg (macOS-12, notset, 3.10)	success	✅
install-pkg (windows-2022, app, 3.8)	success	✅
install-pkg (windows-2022, app, 3.10)	success	✅
install-pkg (windows-2022, fabric, 3.8)	success	✅
install-pkg (windows-2022, fabric, 3.10)	success	✅
install-pkg (windows-2022, pytorch, 3.8)	success	✅
install-pkg (windows-2022, pytorch, 3.10)	success	✅
install-pkg (windows-2022, lightning, 3.8)	success	✅
install-pkg (windows-2022, lightning, 3.10)	success	✅
install-pkg (windows-2022, notset, 3.8)	success	✅
install-pkg (windows-2022, notset, 3.10)	success	✅

These checks are required after the changes to src/lightning/fabric/accelerators/cuda.py.

🟢 link-check

Check ID	Status
check-md-links / markdown-link-check	success	✅

These checks are required after the changes to src/lightning/fabric/CHANGELOG.md, src/lightning/pytorch/CHANGELOG.md.

Thank you for your contribution! 💜

Note
This comment is automatically generated and updates for 60 minutes every 180 seconds. If you have any other questions, contact carmocca for help.

codecov · 2023-07-03T16:20:29Z

Codecov Report

Merging #17960 (536a78d) into master (199dc8f) will decrease coverage by 23%.
The diff coverage is 100%.

Additional details and impacted files

@@            Coverage Diff            @@
##           master   #17960     +/-   ##
=========================================
- Coverage      84%      61%    -23%     
=========================================
  Files         425      420      -5     
  Lines       32053    31964     -89     
=========================================
- Hits        26816    19502   -7314     
- Misses       5237    12462   +7225

(cherry picked from commit c5fae64)

awaelchli added 2 commits June 30, 2023 22:17

show cuda matmul info only ever once

98acd48

update test

f24c15b

awaelchli requested review from carmocca and justusschock as code owners July 1, 2023 05:22

awaelchli added bug Something isn't working fabric lightning.fabric.Fabric labels Jul 1, 2023

awaelchli added this to the 2.0.x milestone Jul 1, 2023

changelog

fa69129

awaelchli requested a review from williamFalcon as a code owner July 1, 2023 05:24

awaelchli added pl Generic label for PyTorch Lightning package accelerator: cuda Compute Unified Device Architecture GPU labels Jul 1, 2023

update test

dbece26

carmocca approved these changes Jul 1, 2023

View reviewed changes

mergify bot added the has conflicts label Jul 3, 2023

justusschock approved these changes Jul 3, 2023

View reviewed changes

Merge branch 'master' into bugfix/matmul-message

536a78d

mergify bot added ready PRs ready to be merged and removed has conflicts ready PRs ready to be merged labels Jul 3, 2023

awaelchli merged commit c5fae64 into master Jul 4, 2023
101 checks passed

awaelchli deleted the bugfix/matmul-message branch July 4, 2023 07:47

Borda pushed a commit that referenced this pull request Jul 7, 2023

Show CUDA matmul precision info only ever once (#17960)

baa4ae0

(cherry picked from commit c5fae64)

Borda pushed a commit that referenced this pull request Jul 7, 2023

Show CUDA matmul precision info only ever once (#17960)

4a0c1f5

(cherry picked from commit c5fae64)

Borda pushed a commit that referenced this pull request Jul 7, 2023

Show CUDA matmul precision info only ever once (#17960)

4863382

(cherry picked from commit c5fae64)

Borda pushed a commit that referenced this pull request Jul 7, 2023

Show CUDA matmul precision info only ever once (#17960)

0024df1

(cherry picked from commit c5fae64)

lantiga pushed a commit that referenced this pull request Jul 10, 2023

Show CUDA matmul precision info only ever once (#17960)

ddf37cf

(cherry picked from commit c5fae64)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Show CUDA matmul precision info only ever once #17960

Show CUDA matmul precision info only ever once #17960

awaelchli commented Jul 1, 2023 •

edited by github-actions bot

Loading

github-actions bot commented Jul 1, 2023 •

edited

Loading

codecov bot commented Jul 3, 2023

Show CUDA matmul precision info only ever once #17960

Show CUDA matmul precision info only ever once #17960

Conversation

awaelchli commented Jul 1, 2023 • edited by github-actions bot Loading

What does this PR do?

PR review

github-actions bot commented Jul 1, 2023 • edited Loading

⚡ Required checks status: All passing 🟢

Groups summary

codecov bot commented Jul 3, 2023

Codecov Report

awaelchli commented Jul 1, 2023 •

edited by github-actions bot

Loading

github-actions bot commented Jul 1, 2023 •

edited

Loading