-
Notifications
You must be signed in to change notification settings - Fork 885
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
docs(datasets) Add dataset contribution guide #4601
base: main
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hey @adam-narozniak , it looks pretty good. I like the inclusion of a mini example at the bottom. Some comments below to move some text around a bit.
|
||
1. Create the dataset locally. | ||
|
||
We recommend that you do not upload custom scripts to HuggingFace Hub; instead, create the dataset locally and upload the data, which will speed up the processing time each time the data set is downloaded. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
useful info, how about having it as a :: tip:
right under the Creating a datasets locally
section?
|
||
2. Contribute to HuggingFace Hub. | ||
|
||
Each dataset in the HF Hub is a Git repository with a specific structure and readme file, and HuggingFace provides an API to push the dataset and, alternatively, a user interface directly in the website to populate the information in the readme file. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'd move this as a mini introduction to the Contribution to the HuggingFace Hub
section. It's good
Co-authored-by: Javier <jafermarq@users.noreply.github.com>
No description provided.