Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Not so fast #6

Open
brycexu opened this issue Aug 30, 2019 · 2 comments
Open

Not so fast #6

brycexu opened this issue Aug 30, 2019 · 2 comments

Comments

@brycexu
Copy link

brycexu commented Aug 30, 2019

Hi,

I ran your code on a single GPU, which is NVIDIA GeForce GTX 1080Ti, but training 24 epochs took me nearly 4 minutes.

I did not change anything in your code except two paths.

Is that we have to configure something before we run your model ?

Thank you.

@bkj
Copy link

bkj commented Aug 30, 2019

A V100 vs the 1080ti will make a big difference I believe, because of the half precision stuff

@99991
Copy link

99991 commented Mar 25, 2021

The GPU makes a huge difference. For example, the 1080 Ti actually has worse GFLOPS with half precision compared to single precision (177 vs 10,339 GFLOPS). I think the reasoning was that NVIDIA would still be able to sell data center GPUs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants