Modalities not detected for some datasets using the Webdatasets format #2996

ProGamerGov · 2024-07-23T15:15:38Z

I have found 2 examples of the modality detection code failing to recognize modalities in text and image datasets using the Webdataset format:

I'm not sure where in the modality detection code that things are failing: https://github.com/huggingface/dataset-viewer/blob/main/services/worker/src/worker/job_runners/dataset/modalities.py

severo · 2024-07-23T15:24:03Z

Thanks for opening. Note that you can force the modality: https://huggingface.co/docs/hub/datasets-cards#force-set-a-dataset-modality

github-actions · 2024-08-23T15:04:05Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modalities not detected for some datasets using the Webdatasets format #2996

Modalities not detected for some datasets using the Webdatasets format #2996

ProGamerGov commented Jul 23, 2024

severo commented Jul 23, 2024

github-actions bot commented Aug 23, 2024

Modalities not detected for some datasets using the Webdatasets format #2996

Modalities not detected for some datasets using the Webdatasets format #2996

Comments

ProGamerGov commented Jul 23, 2024

severo commented Jul 23, 2024

github-actions bot commented Aug 23, 2024