-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
about the training efficiency #1
Comments
Hi, for the first question, we didn't use any speed-up trick, and each batch might take about 20 seconds. Maybe you should double-check which function takes up the most time. For the second question, we use tensorflow 1.6 version. And you should set the |
Hi, when I try to run this code, The tensorflow1.x legacy code made me mad. And it can‘t work in my CUDA version. So I want to ask could you share your reproduced pytorch code. I really need it and appreciate. |
Hi, unfortunately, we didn't prepare pytorch version of our code. We used TensorFlow 1.6.0, and I think you can directly run the code based on this version. |
Hi,you mentioned in your paper that you can run an epoch in 9.7 hours, but when I reproduced it with pytorch, it took me five or six minutes to run a batch. Do you have any tricks to improve the training efficiency?
One more question, when I run your original code with tensorflow, it always reminds me "failed to get article or abstract from example", do you know why? I don't know if it is related to using a lower version of tensorflow, because it will also report warnings that many methods have been deprecated.
thanks very much
The text was updated successfully, but these errors were encountered: