-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
downsampled_imagenet broken #4662
Comments
I also reached out to the imagenet moderators to hear their input and will post any response here. |
@Kim-Dongjun provided a good explanation and shared the location of the torrent that people use for the original data from pixel rnn. Here is Dongjun's explanation of the discrepancy (which also coincides with things I've heard from some authors at talks/conferences):
|
Here is a summary. For imagenet 32x32, some papers use an "old" version and some use a "new" version. My understanding is:
My proposals are
Thanks, curious about others' take on this issue and for others to confirm. |
Hi TFDS,
downsampled_imagenet (32x32) gives a 404 (stack trace at end of issue). This is because the imagenet link stored by tfds (https://image-net.org/small/download.php) is broken. The broken link is also featured in some papers such as Pixel Recurrent Neural Networks.
There is a different New currently-working link for 32x32 imagenet (https://image-net.org/download-images.php, if you log in, you can see a 32x32 option).
Let us refer to them as OLD (what TFDS used to host) and NEW (currently on imagenet website).
An anon. ICLR reviewer (see "weaknesses" under reviewer AKwV) mentioned that NEW is "too easy" and cannot be used to compare to old results using OLD. The reviewer also mentioned that OLD floats around the community on some torrent.
TFDS' link to OLD likely broke more recently than 9 months ago since another Google repo shared code that uses tfds to get downsampled_imagenet (I left an issue there google-research/vdm#8) and their datasets.py file was pushed then.
None of these are the same as imagenet_resized.
Purpose:
Possible solution:
Examples of research using OLD
Some ICLR publications from this year already use NEW.
Thanks!
Mark
Environment information
Operating System: Ubuntu VERSION="18.04.6 LTS (Bionic Beaver)"
Python version: 3.9.12
tensorflow-datasets
/tfds-nightly
version: tfds '4.7.0' and tfds '4.8.2+nightly'tensorflow
/tf-nightly
version: tf '2.10.0'Does the issue still exists with the last
tfds-nightly
package (pip install --upgrade tfds-nightly
) ?Yes
Reproduction instructions
Link to logs
The text was updated successfully, but these errors were encountered: