Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Forbid task metadata and add upload functions #1362

Merged
merged 21 commits into from
Dec 4, 2024

Conversation

Samoed
Copy link
Collaborator

@Samoed Samoed commented Oct 30, 2024

Checklist

  • Run tests locally to make sure nothing is broken using make test.
  • Run the formatter to format the code using make lint.

Some retrieval tasks need to be reuploaded because they are loaded from different repositories. I’ve created upload functions to convert these datasets into our current format. I tested each reuploaded dataset, and the scores matched, except for mFollowIR (rus). In the main branch, the main_score is -0.039465099069488106, whereas the reuploaded dataset gives -0.031187925634321677. However, this run was only for testing purposes.

Initially, I tried adding this script to the mteb folder, but it gave an error: AttributeError: module 'logging' has no attribute 'getLogger'. So, I moved it to the scripts folder.

Additionally, some tasks may not be imported successfully. For example, I tried to load IndicXnliPairClassification, but it resulted in an error.

uploaded.zip
mteb main.zip

@KennethEnevoldsen
Copy link
Contributor

@Samoed is this PR stale?

@Samoed
Copy link
Collaborator Author

Samoed commented Nov 11, 2024

Yes, some datasets (mostly from CMTEB) load data from multiple repositories on HF, so we need to convert them first to complete this PR.

@KennethEnevoldsen KennethEnevoldsen changed the base branch from main to v2.0.0 November 11, 2024 10:02
@Samoed Samoed closed this Nov 26, 2024
@Samoed Samoed reopened this Nov 26, 2024
# Conflicts:
#	mteb/abstasks/TaskMetadata.py
@Samoed Samoed changed the title feat: Forbid task metadata feat: Forbid task metadata and add upload functions Dec 1, 2024
@Samoed Samoed marked this pull request as ready for review December 1, 2024 10:51
@orionw
Copy link
Contributor

orionw commented Dec 1, 2024

I’m traveling and won’t be at a computer til the end of the week, but this looks good.

Are there any datasets that are still not converted?

And is the mFollowIR Russian still an issue? FWIW the v2 branch fixed a small bug that the current one doesn’t have, so the numbers from main and v2 will be different. The number looks reasonable and I wouldn’t worry about it.

@Samoed
Copy link
Collaborator Author

Samoed commented Dec 1, 2024

I've tested from v2.0.0 branch, but mostly this is not the issue, because I just tested how uploader would work with multilingual tasks. I didn't change mFollowIR dataset version

@Samoed
Copy link
Collaborator Author

Samoed commented Dec 3, 2024

@KennethEnevoldsen Can you review, please?

Copy link
Contributor

@KennethEnevoldsen KennethEnevoldsen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A few minor things and a suggestion to move the upload utility to the class object (assuming we want to maintain it)

Generally though this looks great!

tests/test_TaskMetadata.py Show resolved Hide resolved
scripts/upload_utils.py Outdated Show resolved Hide resolved
scripts/upload_utils.py Outdated Show resolved Hide resolved
scripts/upload_utils.py Outdated Show resolved Hide resolved
Samoed and others added 4 commits December 4, 2024 23:44
…al.py

Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsen@gmail.com>
* fix FilipinoHateSpeechClassification

* update tests
Copy link
Contributor

@KennethEnevoldsen KennethEnevoldsen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Loving it <3!

@Samoed Samoed merged commit dec5d6a into embeddings-benchmark:v2.0.0 Dec 4, 2024
10 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants