Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use version 2 of the Q2Q dataset #1

Open
wael34218 opened this issue Apr 9, 2019 · 7 comments
Open

Use version 2 of the Q2Q dataset #1

wael34218 opened this issue Apr 9, 2019 · 7 comments
Assignees
Labels
enhancement New feature or request

Comments

@wael34218
Copy link

The new data can be found under this link:
https://s3-eu-west-1.amazonaws.com/nsurlworkshop/q2q_similarity_workshop_v2.tsv

This repository will be released for the public, test data shouldn't be in the repo nor its git history.

@wael34218 wael34218 added the enhancement New feature or request label Apr 9, 2019
@hseelawi
Copy link
Contributor

Done. Please check and let me know if all is good.

@hinnaweali
Copy link

Hello,

Could you please share with us the public and private test sets so we can have a fair comparison results with the group that won the competition in 2019?

@hseelawi
Copy link
Contributor

hseelawi commented Jan 8, 2021

Hello @hinnaweali,

I am attaching a zip file that contains all the files we produced for this workshop. It has both the public (train.tsv) and private (test.tsv) datasets.

nsurl-2019-task8.zip

@hinnaweali
Copy link

hinnaweali commented Jan 8, 2021

Thank you @hseelawi.

I downloaded those files from Kaggle website. You mentioned there, the private leaderboard was calculated with approximately 70% of the test data while the public leaderboard was calculated with approximately 30% of the test data.
I need exactly those splits to evaluate my model on both 30% and 70% test data sets.
Best,

@hseelawi
Copy link
Contributor

hseelawi commented Jan 8, 2021

Hello @hinnaweali again,

Actually we can't get the exact split as this is handled by kaggle itself, and is not available even for us the organizers to download. However, what matters is the private score, which you can obtain by submitting a late submission to the leaderboard directly, at the following link: https://www.kaggle.com/c/nsurl-2019-task8/submit.

[Update]

You can also find the public score after you do a late submission as well.

@hinnaweali
Copy link

hinnaweali commented Jan 8, 2021

Hi @hseelawi,

This is really a nice tip.
Thank you

@hseelawi
Copy link
Contributor

hseelawi commented Jan 8, 2021

You are most welcome. Please let us know should you need any further information or help.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants