Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Warn if user tries to upload a parquet file to a model repo #2403

Merged
merged 1 commit into from
Jul 24, 2024

Conversation

Wauplin
Copy link
Contributor

@Wauplin Wauplin commented Jul 19, 2024

Little QoL suggested by @coyotte508 on slack.

We've had several users reporting that they've pushed their dataset to a model repo. It is then impossible to convert the repo to dataset on the Hub, forcing them to entirely re-upload everything. This PR tries to prevent this kind of mistake by raising a warning when user does upload an arrow or a parquet file to a non-dataset repo. If the user don't stop the process, the files are still correctly uploaded since we don't want to forbid the upload.

@Wauplin Wauplin requested a review from LysandreJik July 19, 2024 12:21
@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@coyotte508
Copy link
Member

I don't think I'm the one that made that suggestion x)

@Wauplin
Copy link
Contributor Author

Wauplin commented Jul 19, 2024

Ah yes indeed I misread 😄 I was the one to suggest it here and you suggested to recommend large-upload when uploading large datasets (in here). I'll add the second recommendation in a future PR.

@julien-c
Copy link
Member

image

Copy link
Member

@LysandreJik LysandreJik left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No problem for me even if the problem seems a bit niche from an outsider's point of view

@Wauplin
Copy link
Contributor Author

Wauplin commented Jul 24, 2024

It's niche yeah, but will hopefully save some support time ^^

@Wauplin Wauplin merged commit ae6986c into main Jul 24, 2024
17 checks passed
@Wauplin Wauplin deleted the warn-about-parquet-file-in-model-repo branch July 24, 2024 14:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants