Skip to content
This repository has been archived by the owner on Jun 14, 2023. It is now read-only.

Can the author provide the YFCC-100M data downloader? #13

Open
linhuixiao opened this issue Feb 24, 2022 · 5 comments
Open

Can the author provide the YFCC-100M data downloader? #13

linhuixiao opened this issue Feb 24, 2022 · 5 comments

Comments

@linhuixiao
Copy link

Can the author or someone provide the YFCC-100M data downloader?

It mentioned that the YFCC-100M data format must follow as:
'''
Download the YFCC100M dataset. Our dataloader expects the following dataset directory structure with 100 folders containing 1000 zip archives of 1000 images each. The concatenation of the folder, archive, and file names is the index of the image (i.e. image 12345678 is stored as 678.jpg within 12/345.zip):
'''

It seems not the original data collect format.

thank you.

@shugerdou
Copy link

May I know where can we download 'yfcc100m_dataset.txt'?

@normster
Copy link
Contributor

Sorry for the late reply. I did not download the data myself, so I won't be able to provide a download script. I'll look into whether it's possible for me to share the yfcc100m_dataset.txt metadata file and get back to you two.

@linhuixiao
Copy link
Author

Could provide' yfcc100m_dataset.txt' already yet? thx

@linhuixiao
Copy link
Author

Could you provide ' yfcc100m_dataset.txt' already yet? If it's convenient, please send me by email: linhui.xiao@foxmail.com, just used for academic research. thx!

@Soonhwan-Kwon
Copy link

I also failed to reproduce preprocessing of yfcc15m because yfcc100m_dataset.txt was missing.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants