Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training data query #3

Open
riddhirdasani opened this issue Apr 2, 2018 · 5 comments
Open

Training data query #3

riddhirdasani opened this issue Apr 2, 2018 · 5 comments

Comments

@riddhirdasani
Copy link

riddhirdasani commented Apr 2, 2018

Hello Shashi, I am unable to understand why do we need these three directories and what role do they play in training? 1. preprocessed_data_directory 2. gold_summary_directory 3. doc_sentence_directory. Can you provide little more insights of these three.? When I was trying , one epoch has already finished and then this error appeared.

@shashiongithub
Copy link
Owner

For training you only need (1). (2) is used to estimate ROUGE scores. (1) and (3) is used during decoding.
What error do you get after the first epoch?

@riddhirdasani
Copy link
Author

File "/home/beast/riddhi/main/data_utils.py", line 116, in process_predictions_rankedtopthree
docsents = open(sent_filename).readlines()
FileNotFoundError: [Errno 2] No such file or directory: '/home/beast/riddhi/main/JP_herman/cnn/validation-sent/8f6b39e6c63b0ae3546cdfeb8209693f292b060e.summary.final.org_sents'

sent_filename which got generated in line 115 by using FLAGS.doc_sentence_directory, cannot be opened in line 116. I just presumed and made directory , what exactly should be there in this directory?

@shashiongithub
Copy link
Owner

It should point to 3) doc_sentence_directory directory.

@riddhirdasani
Copy link
Author

Yes and what should be there, from where can I get it?

@shashiongithub
Copy link
Owner

Please check: #2

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants