Reproduce results in the paper #10

a3616001 · 2019-10-01T18:08:42Z

Hi,

I was trying to reproduce results by running your code, and couldn't get exactly the same precision on SQuAD.
Here is what I got for bert_large model on SQuAD:
all_samples: 303
list_of_results: 303
global MRR: 0.3018861233236291
global Precision at 10: 0.5676567656765676
global Precision at 1: 0.16831683168316833

However, in the paper, the table shows that there should be 305 samples and the precision should be 17.4%.

At first, I guessed that it is because 2 samples are excluded because their object labels are out of the common vocabulary, but even after testing without common vocabulary, I got global Precision at 1: 0.1704918, which is still different to results in the paper.

Is there a way to reproduce the same results in the paper?
Please correct me if I made any mistakes! Thanks!

The text was updated successfully, but these errors were encountered:

fabiopetroni · 2019-10-02T09:47:01Z

Hey @a3616001,

strange.
Just re-executed the run_experiments scripts and I get P@1 : 0.1737704918032787 for the BERT-large model. Are you using BERT-large?
Also, the script should use all the 305 examples.
This is how your output should look like:

jeslev · 2021-01-28T20:31:23Z

Hi, @a3616001 did you finally get the results from the paper?
I got the same results as you (skipping 2 examples after the filter_samples function).

Thanks in advance

Hannibal046 · 2022-05-02T12:45:08Z

Same problem

fabiopetroni closed this as completed Oct 14, 2019

Hannibal046 mentioned this issue May 2, 2022

🐛Bug for common_vocab #51

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reproduce results in the paper #10

Reproduce results in the paper #10

a3616001 commented Oct 1, 2019

fabiopetroni commented Oct 2, 2019 •

edited

Loading

jeslev commented Jan 28, 2021

Hannibal046 commented May 2, 2022

Reproduce results in the paper #10

Reproduce results in the paper #10

Comments

a3616001 commented Oct 1, 2019

fabiopetroni commented Oct 2, 2019 • edited Loading

jeslev commented Jan 28, 2021

Hannibal046 commented May 2, 2022

fabiopetroni commented Oct 2, 2019 •

edited

Loading