Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow easy uploading of image datasets to OpenML #7

Open
PGijsbers opened this issue Sep 2, 2024 · 1 comment
Open

Allow easy uploading of image datasets to OpenML #7

PGijsbers opened this issue Sep 2, 2024 · 1 comment

Comments

@PGijsbers
Copy link

PGijsbers commented Sep 2, 2024

In general, image datasets currently consist of a header table with a directory of files. So a "File Dataset" may be more apt.

@PGijsbers PGijsbers changed the title BOOST: Allow easy uploading of image datasets to OpenML Allow easy uploading of image datasets to OpenML Sep 2, 2024
@PGijsbers
Copy link
Author

From a related item:
We are currently with some prototypes of downloading bucket content with many images, however there are many things left unspecified (and unsupported by packages), e.g.:

dataset upload (adding auxiliary files in general)
how should the files be zipped/unzipped, and how can we know at download time how to resolve the paths?
how should we store metadata/which metadata should be store (different types of tasks, bounding boxes, segmentation masks, etc.)
add the relevant documentation
More generally should extend to parquet file describing other files (images, audio, video).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: No status
Development

No branches or pull requests

1 participant