Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some results issue #19

Closed
viyjy opened this issue Oct 26, 2021 · 4 comments
Closed

Some results issue #19

viyjy opened this issue Oct 26, 2021 · 4 comments

Comments

@viyjy
Copy link

viyjy commented Oct 26, 2021

Hi Junnan, I have the following questions about the result. Hope that you can help to clarify them, thanks.

  1. VQA:
    I get the result folder after fine-tuning on VQA dataset. Which json file should I use to get test-dev and test-std?
    image

  2. SNLI-VE:
    This is the log file after fine-tuning on SNLI-VE dataset. You didn't update the best-epoch, so it is always 0. Should I pick the row which has the best val accuracy as the final result?
    image

  3. Grouding
    This is the log file after fine-tuning on Ref-COCO. Should I pick the row which has the best val_d as the final result?
    image

  4. NLVR2
    This is the log file after fine-tuning on NLVR2, but I did't find dev and test-P as shown in your paper, any idea?
    image

@LiJunnan1992
Copy link
Contributor

Hi, thanks for your interest. Here are my answers.

  1. VQA: vqa_result_epoch7.json is the final result which collects results from all ranks.
  2. SNLI-VE: thanks for spotting my mistake. Yes you should use the row with best val_acc.
  3. Grounding: if I remembered correctly, in the paper I just reported the last epoch's result.
  4. NLVR: dev is val, test-P is test.

@viyjy
Copy link
Author

viyjy commented Oct 26, 2021

Thanks very much!

@viyjy viyjy closed this as completed Oct 26, 2021
@haoshuai714
Copy link

Hello! Can NLVR2 datasets be shared?

@viyjy
Copy link
Author

viyjy commented Jan 18, 2022

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants