Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to prevent models from cheating on the test set? #75

Open
haixingDu opened this issue Jan 9, 2024 · 1 comment
Open

How to prevent models from cheating on the test set? #75

haixingDu opened this issue Jan 9, 2024 · 1 comment
Assignees
Labels
question Further information is requested

Comments

@haixingDu
Copy link

Hello, I'm intrigued by TGB and have a few questions for which I'm seeking answers.

1、In the edge prediction test set (tgbl-wiki-v2), are the data labeled?
2、If they are labeled, I'm interested in understanding the measures implemented to ensure the fairness of the testing process. Specifically, how are potential manipulations after model prediction - such as adjusting the position of positive samples in the MRR ranking test to artificially enhance performance - identified and prevented?

Your responses to these queries would be highly appreciated.

@shenyangHuang shenyangHuang self-assigned this Jan 9, 2024
@shenyangHuang shenyangHuang added the question Further information is requested label Jan 9, 2024
@shenyangHuang
Copy link
Owner

Hi Haixing,

Thanks for your interest in our work and the questions. Hope the following answers address your concerns.

  1. The edge labels are provided, this is only for evaluation purpose. For example on how to use the TGB evaluator, see an example here.
  2. The TGB datasets are designed for research purpose and we trust the users to only use the test set for evaluation when submitting to the TGB leaderboard. This is similar to many open benchmark initiatives such as OGB and TDC. When submitting to the leaderboard, please fill in the google form and we ask the authors to provide a link to their paper and public code repository (which the results can then be verified by the community).

Best,
Andy

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants