Feel free to join my Discord Server to discuss this tool!
A simple tool for reproducable dataset gathering for machine learning tasks.
MacOS/Linux
curl https://raw.githubusercontent.com/ex3ndr/datasets/main/install.sh | sh
Right now available datasets are listed here: https://korshakov.com/datasets
Entry point is the datasets.yaml
file in your project, that looks like this:
datasets:
- cifar-100 # This downloads from central repository
- name: some_private_dataset
source: https://not-so-real-url.org
Then you can execute sync of datasets:
datasets sync
That's all!
MIT