Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Argilla 2.4: Curate Hub Datasets with Human Feedback—No Code Needed #2448

Draft
wants to merge 8 commits into
base: main
Choose a base branch
from

Conversation

nataliaElv
Copy link

@nataliaElv nataliaElv commented Oct 28, 2024

Congratulations! You've made it this far! Once merged, the article will appear at https://huggingface.co/blog. Official articles
require additional reviews. Alternatively, you can write a community article following the process here.

Preparing the Article

You're not quite done yet, though. Please make sure to follow this process (as documented here):

  • Add an entry to _blog.yml.
  • Add a thumbnail. There are no requirements here, but there is a template if it's helpful.
  • Check you use a short title and blog path.
  • Upload any additional assets (such as images) to the Documentation Images repo. This is to reduce bloat in the GitHub base repo when cloning and pulling. Try to have small images to avoid a slow or expensive user experience.
  • Add metadata (such as authors) to your md file. You can also specify guest or org for the authors.
  • Ensure the publication date is correct.
  • Preview the content. A quick way is to paste the markdown content in https://huggingface.co/new-blog. Do not click publish, this is just a way to do an early check.

Here is an example of a complete PR: #2382

Getting a Review

Please make sure to get a review from someone on your team or a co-author.
Once this is done and once all the steps above are completed, you should be able to merge.
There is no need for additional reviews if you and your co-authors are happy and meet all of the above.

Feel free to add @osanseviero or @pcuenca as reviewers if you want a final check. They'll be biased toward light reviews
(e.g., check for proper metadata) rather than content reviews unless explicitly asked.

argilla-ui-hub.md Outdated Show resolved Hide resolved
argilla-ui-hub.md Outdated Show resolved Hide resolved
argilla-ui-hub.md Outdated Show resolved Hide resolved
argilla-ui-hub.md Outdated Show resolved Hide resolved
argilla-ui-hub.md Outdated Show resolved Hide resolved
argilla-ui-hub.md Outdated Show resolved Hide resolved
argilla-ui-hub.md Outdated Show resolved Hide resolved
argilla-ui-hub.md Outdated Show resolved Hide resolved

This new feature enables a new set of use cases for building high quality datasets on the Hub:

- You have just published an open dataset and want the community to contribute: import it into a public Argilla Space and share the URL with the world!
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
- You have just published an open dataset and want the community to contribute: import it into a public Argilla Space and share the URL with the world!
- If you are a dataset publisher and want the community to contribute, import it into a public Argilla Space and share the URL with the world!

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Don't you think "dataset publisher" sounds like a job title? I'm not sure how many people would feel represented by that.

argilla-ui-hub.md Outdated Show resolved Hide resolved
argilla-ui-hub.md Outdated Show resolved Hide resolved
argilla-ui-hub.md Outdated Show resolved Hide resolved
argilla-ui-hub.md Outdated Show resolved Hide resolved
argilla-ui-hub.md Outdated Show resolved Hide resolved
> [!NOTE]
> In this first version, the Hub dataset must be public, if you are interested in support for private datasets, we’d love to hear from you on [GitHub](https://github.com/argilla-io/argilla).

The dataset’s columns will be mapped to fields and questions in Argilla. Fields include the data that you want feedback on, like text, chats, or images. Questions are the feedback you want to collect, like labels, ratings, rankings, or text. If you need, you can add and configure questions or remove unnecessary fields. All of the changes that you make will be previewed in real time, so you can see how your changes affect the dataset.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
The dataset’s columns will be mapped to fields and questions in Argilla. Fields include the data that you want feedback on, like text, chats, or images. Questions are the feedback you want to collect, like labels, ratings, rankings, or text. If you need, you can add and configure questions or remove unnecessary fields. All of the changes that you make will be previewed in real time, so you can see how your changes affect the dataset.
The goal is to map dataset columns to fields and questions in Argilla. Fields include the data you want feedback on, like text, chats, or images. Questions are the feedback you wish to collect, like labels, ratings, rankings, or text. You can add and configure questions or remove unnecessary fields if needed. You can preview all changes in real time to get a clear idea of the Argilla dataset you’re configuring.

argilla-ui-hub.md Outdated Show resolved Hide resolved
argilla-ui-hub.md Outdated Show resolved Hide resolved
_blog.yml Outdated Show resolved Hide resolved
Co-authored-by: Daniel Vila Suero <daniel.vila@huggingface.co>
_blog.yml Outdated Show resolved Hide resolved
argilla-ui-hub.md Outdated Show resolved Hide resolved
argilla-ui-hub.md Outdated Show resolved Hide resolved

This new feature enables a new set of use cases for building high quality datasets on the Hub:

- You have just published an open dataset and want the community to contribute: import it into a public Argilla Space and share the URL with the world!
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Don't you think "dataset publisher" sounds like a job title? I'm not sure how many people would feel represented by that.

argilla-ui-hub.md Outdated Show resolved Hide resolved
argilla-ui-hub.md Outdated Show resolved Hide resolved
argilla-ui-hub.md Outdated Show resolved Hide resolved
argilla-ui-hub.md Show resolved Hide resolved
@@ -0,0 +1,55 @@
---
title: "Argilla 2.4: Easily Add Human Feedback to Hub Datasets—No Code Required"
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do you think this title matches well with the thumbnail @dvsrepo? I see a big semantic difference between "creating datasets" and "adding human feedback to datasets".

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants