Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About the Results in the paper. #1

Open
littlefive5 opened this issue Apr 22, 2021 · 6 comments
Open

About the Results in the paper. #1

littlefive5 opened this issue Apr 22, 2021 · 6 comments

Comments

@littlefive5
Copy link

I rerun the code in the IR repo. However, I just get the result 22%, which is far lower than the 34% in the paper for USMLE. Is there any other setting for the IR method?

@jind11
Copy link
Owner

jind11 commented Apr 22, 2021

I just now updated the data in the github repo by adding the 4 options version of data I previously used. Let me whether you can replicate the numbers in my paper using this version of data. Thanks!

@zyngielg
Copy link

I tried running the solvers/textsearch.py on the 4_options/dev.jsonl:

  • for topn=10 the accuracy was 28.7%
  • for topn=5 the accuracy was 27.7%

@littlefive5
Copy link
Author

I tried running the solvers/textsearch.py on the 4_options/dev.jsonl:

  • for topn=10 the accuracy was 28.7%
  • for topn=5 the accuracy was 27.7%

Me too. I also tried TW dataset and I got about 31% when the topn is 15.

@jind11
Copy link
Owner

jind11 commented Apr 29, 2021

hmmm, this code was written a year ago and I tried my best to release the old code here without running and verifying. Thanks for helping me find out the issue here. I am gonna check on my side the source of performance discrepancy but it may be finished after 5/17 which is the conference ddl I am now busy with. I am so sorry for the slow process. Thank you for your great patience.

@MotzWanted
Copy link

@jind11, did you get a chance to solve the issue?

And is it possible for you to release the reader model as well?

@vlievin
Copy link

vlievin commented Nov 26, 2021

@jind11 What is the status on reproducing the results? Are you 100% the dataset is correct? Great work by the way, that's a very nice dataset you have built here!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants