-
Notifications
You must be signed in to change notification settings - Fork 739
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Argilla 2.4: Curate Hub Datasets with Human Feedback—No Code Needed #2448
base: main
Are you sure you want to change the base?
Conversation
|
||
This new feature enables a new set of use cases for building high quality datasets on the Hub: | ||
|
||
- You have just published an open dataset and want the community to contribute: import it into a public Argilla Space and share the URL with the world! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- You have just published an open dataset and want the community to contribute: import it into a public Argilla Space and share the URL with the world! | |
- If you are a dataset publisher and want the community to contribute, import it into a public Argilla Space and share the URL with the world! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Don't you think "dataset publisher" sounds like a job title? I'm not sure how many people would feel represented by that.
> [!NOTE] | ||
> In this first version, the Hub dataset must be public, if you are interested in support for private datasets, we’d love to hear from you on [GitHub](https://github.com/argilla-io/argilla). | ||
|
||
The dataset’s columns will be mapped to fields and questions in Argilla. Fields include the data that you want feedback on, like text, chats, or images. Questions are the feedback you want to collect, like labels, ratings, rankings, or text. If you need, you can add and configure questions or remove unnecessary fields. All of the changes that you make will be previewed in real time, so you can see how your changes affect the dataset. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The dataset’s columns will be mapped to fields and questions in Argilla. Fields include the data that you want feedback on, like text, chats, or images. Questions are the feedback you want to collect, like labels, ratings, rankings, or text. If you need, you can add and configure questions or remove unnecessary fields. All of the changes that you make will be previewed in real time, so you can see how your changes affect the dataset. | |
The goal is to map dataset columns to fields and questions in Argilla. Fields include the data you want feedback on, like text, chats, or images. Questions are the feedback you wish to collect, like labels, ratings, rankings, or text. You can add and configure questions or remove unnecessary fields if needed. You can preview all changes in real time to get a clear idea of the Argilla dataset you’re configuring. |
Co-authored-by: Daniel Vila Suero <daniel.vila@huggingface.co>
|
||
This new feature enables a new set of use cases for building high quality datasets on the Hub: | ||
|
||
- You have just published an open dataset and want the community to contribute: import it into a public Argilla Space and share the URL with the world! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Don't you think "dataset publisher" sounds like a job title? I'm not sure how many people would feel represented by that.
@@ -0,0 +1,55 @@ | |||
--- | |||
title: "Argilla 2.4: Easily Add Human Feedback to Hub Datasets—No Code Required" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do you think this title matches well with the thumbnail @dvsrepo? I see a big semantic difference between "creating datasets" and "adding human feedback to datasets".
Congratulations! You've made it this far! Once merged, the article will appear at https://huggingface.co/blog. Official articles
require additional reviews. Alternatively, you can write a community article following the process here.
Preparing the Article
You're not quite done yet, though. Please make sure to follow this process (as documented here):
md
file. You can also specifyguest
ororg
for the authors.Here is an example of a complete PR: #2382
Getting a Review
Please make sure to get a review from someone on your team or a co-author.
Once this is done and once all the steps above are completed, you should be able to merge.
There is no need for additional reviews if you and your co-authors are happy and meet all of the above.
Feel free to add @osanseviero or @pcuenca as reviewers if you want a final check. They'll be biased toward light reviews
(e.g., check for proper metadata) rather than content reviews unless explicitly asked.