Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

about the training efficiency #1

Open
yyyf-g opened this issue Sep 15, 2022 · 3 comments
Open

about the training efficiency #1

yyyf-g opened this issue Sep 15, 2022 · 3 comments

Comments

@yyyf-g
Copy link

yyyf-g commented Sep 15, 2022

Hi,you mentioned in your paper that you can run an epoch in 9.7 hours, but when I reproduced it with pytorch, it took me five or six minutes to run a batch. Do you have any tricks to improve the training efficiency?

One more question, when I run your original code with tensorflow, it always reminds me "failed to get article or abstract from example", do you know why? I don't know if it is related to using a lower version of tensorflow, because it will also report warnings that many methods have been deprecated.

thanks very much

@iriscxy
Copy link
Owner

iriscxy commented Sep 15, 2022

Hi, for the first question, we didn't use any speed-up trick, and each batch might take about 20 seconds. Maybe you should double-check which function takes up the most time.

For the second question, we use tensorflow 1.6 version. And you should set the json_input_key and json_target_key in corresponding to the keys in the json data. For wikisum datset, the key is document and summary.

@otakusbear
Copy link

Hi,you mentioned in your paper that you can run an epoch in 9.7 hours, but when I reproduced it with pytorch, it took me five or six minutes to run a batch. Do you have any tricks to improve the training efficiency?

One more question, when I run your original code with tensorflow, it always reminds me "failed to get article or abstract from example", do you know why? I don't know if it is related to using a lower version of tensorflow, because it will also report warnings that many methods have been deprecated.

thanks very much

Hi, when I try to run this code, The tensorflow1.x legacy code made me mad. And it can‘t work in my CUDA version. So I want to ask could you share your reproduced pytorch code. I really need it and appreciate.
thanks very much

@iriscxy
Copy link
Owner

iriscxy commented Feb 2, 2024

Hi, unfortunately, we didn't prepare pytorch version of our code. We used TensorFlow 1.6.0, and I think you can directly run the code based on this version.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants