[`feat`] Addition of popular image retrieval benchmark datasets #724

ir2718 · 2024-10-26T21:57:21Z

Hi,

this is a PR for issue #722. I've implemented four benchmark datasets: CUB-200, Cars196, INaturalist2018, and StanfordOnlineProducts. When using any of these datasets they will be downloaded directly and saved to the root directory, similar to the PyTorch dataset handling. Each of the implemented datasets inherits torch.utils.data.Dataset and can be used with dataloaders seamlessly. I've also added docs for each of the datasets implement, and a short overview of what users need to implemented if they want to add their own custom dataset. Tests for each of the datasets are also added. I've deliberately left out the __init__.py in tests/datasets, as each of the files has to be downloaded, and these can be pretty big (up to 130Gb).

@KevinMusgrave when you have time, please take a look and tell me if something requires changing.

ir2718 added 13 commits October 20, 2024 19:08

add cub

e8a8466

convert to base and cub

98b5897

add cars with disjoint split

4939740

added datasets docs page

3f14aae

add info on creating custom dataset

c18aeb0

refactor

cdffe50

add pretty download function

8808fb2

add inaturalist

324622d

update docs

0c71360

add stanford online products

5e82379

update paths

4315b7a

add tests

e86eab0

format code

b629fff

KevinMusgrave changed the base branch from master to dev October 28, 2024 12:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`feat`] Addition of popular image retrieval benchmark datasets #724

[`feat`] Addition of popular image retrieval benchmark datasets #724

ir2718 commented Oct 26, 2024 •

edited

Loading

[feat] Addition of popular image retrieval benchmark datasets #724

Are you sure you want to change the base?

[feat] Addition of popular image retrieval benchmark datasets #724

Conversation

ir2718 commented Oct 26, 2024 • edited Loading

[`feat`] Addition of popular image retrieval benchmark datasets #724

[`feat`] Addition of popular image retrieval benchmark datasets #724

ir2718 commented Oct 26, 2024 •

edited

Loading