Data is Better Together

Data is Better Together is a collab between 🤗 Hugging Face, 🏓 Argilla, and the Open Source ML community. Our goal is to empower the open source community to collectively build impactful datasets.

What have we done so far?

The community has created a dataset of 10k prompts DIBT/10k_prompts_ranked ranked by quality as part of Data is Better Together.

What are currently working on?

We are working on several strands of work. Here are current active projects.

1. Prompt ranking

Our first DIBT activity is focused on ranking the quality of prompts. We have already released version 1.0 of this dataset DIBT/10k_prompts_ranked. So far over 385 people have contributed annotations to this dataset but we are continuing to collect more annotations!

Follow the progress of this effort in this dashboard
You can contribute to the ranking of prompts here

2. Multilingual Prompt Evaluation Project (MPEP)

There are not enough language-specific benchmarks for open LLMs! We want to create a leaderboard for more languages by leveraging the community! You can find more information about this project in the MPEP README.

Want to contribute translations? Currently, these translation efforts are underway:

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
kto-preference		kto-preference
prompt_translation		prompt_translation
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data is Better Together

What have we done so far?

What are currently working on?

1. Prompt ranking

2. Multilingual Prompt Evaluation Project (MPEP)

About

Releases

Packages

Languages

ignacioct/data-is-better-together

Folders and files

Latest commit

History

Repository files navigation

Data is Better Together

What have we done so far?

What are currently working on?

1. Prompt ranking

2. Multilingual Prompt Evaluation Project (MPEP)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages