Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE] Include papers citations automatically in dataset README.md #752

Closed
gabrielmbmb opened this issue Jun 24, 2024 · 0 comments · Fixed by #855
Closed

[FEATURE] Include papers citations automatically in dataset README.md #752

gabrielmbmb opened this issue Jun 24, 2024 · 0 comments · Fixed by #855
Assignees
Labels
enhancement New feature or request
Milestone

Comments

@gabrielmbmb
Copy link
Member

Is your feature request related to a problem? Please describe.
Many of the Tasks implemented in distilabel are derived from papers. We cite those papers in the docstring of the Task.

Hugging Face Hub allows to include papers citations in the README.md of a model, dataset, etc that will render nicely in the UI and also allows to trace which things have been built using that paper.

image

Describe the solution you'd like
Parsing the docstring, we should be able to extract the arXiv ID arxiv.org/abs/2406.13542 of the tasks used in a pipeline to generate a dataset and include those in the README.md. We might need to update the citations in all the Tasks to follow this convention.

@gabrielmbmb gabrielmbmb added the enhancement New feature or request label Jun 24, 2024
@gabrielmbmb gabrielmbmb changed the title [FEATURE] Include papers citations automatically in README.md [FEATURE] Include papers citations automatically in dataset README.md Jun 24, 2024
@plaguss plaguss self-assigned this Aug 6, 2024
@plaguss plaguss added this to the 1.3.0 milestone Aug 6, 2024
@plaguss plaguss closed this as completed Aug 6, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants