-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to evaluate results after prediction? #3
Comments
I've figured out solusions about above questions. With the default parameters in codebase, I got 26.15 with BM25. However, the EPR performs even worse (22.9) after training the BERT-based retriever. I run EPR with |
Hey, this might be related to the fact that you are using a single gpu, the DPR setup benefits greatly from a large batch size. |
Hi, Here is the full list of commends:
On mtop dataset, number of training data is 95961. The training loss is around 0.07 after 30 epoches, avg loss per batch is 0.071158. As I'm using A100 80G, I only use two gpus as it is sufficient for 120 batch size. |
I think dpr_epochs=120 is the correct hyperparameter parameter, the contrastive learning objective is able to improve greatly with more compute. |
I got 49.17 after training 120 epochs on mtop, it's still weird... 😂 |
I will run some tests of my own and try to make sense of this thing. |
Hi Ohad, do you have any updates? :) |
Nice work! Anyone know the enviroment requirement file of EPR? |
Hi Ohad,
Thanks for your awesome work! I have several questions when using the code:
The text was updated successfully, but these errors were encountered: