Dataset split ids? #7

cjrd · 2020-08-17T23:10:34Z

Would it be possible to provide the dataset split ids you used for the paper, i.e. train val test?

The text was updated successfully, but these errors were encountered:

akolesnikoff · 2020-08-18T18:54:44Z

Splits are uniquely defined in our data folder through the tfds subsplit API: https://www.tensorflow.org/datasets/splits.

The easiest solution would be to use our code to load the data (which will produce the exact splits from the paper).

cjrd · 2020-08-26T16:37:03Z

Thanks for your response: I've been able to load the data and output the train/val/test splits.
Is there a particular way to output the train splits for the 1000 example training?

frkl · 2020-11-09T17:13:33Z

Dear VTAB team,

I’m Xiao Lin from SRI. We’ve been working on cross-domain few-shot learning solutions and find your VTAB-1000 benchmark very exciting. It’s the large-scale fixed-split benchmark we need, comparing to existing small 5-way k-shot problems and the random-way random-shot meta-dataset, so we hope to try it out.

But I ran into some difficulties downloading the dataset. After installing the pip requirements and try running dataset preparation scripts, TF1.5 tells me that “the version of dataset you want to download requires TF2” and when I try installing TF2 instead of TF1.5, some other errors pop up
“Exporting/importing meta graphs is not supported when eager execution is enabled. No graph exists when eager execution is enabled” which looks like a code compatibility issue. I see that you are still actively making commits to add TF2 support so keep up the good work.

On the other hand, I main pytorch and I’m not very familiar with tensorflow. I think maybe a good common ground is sharing the images/image names/your custom labels in addition to the benchmarking code. Your protocol of train/val/test sounds very clear so people would be able to reproduce across platforms. The exception being the Res50v2 model architecture/weights and the fine-tuning procedures, but both of which are actively being improved in your BigTransfer work. In case there’s a follow up challenge, is it possible for the benchmark side to run dockers for some cross-platform love?

Best,
Xiao Lin

dukleryoni · 2022-06-17T23:20:12Z

Hi,

Would it be possible to upload split_ids file giving the ids of the original dataset samples?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dataset split ids? #7

Dataset split ids? #7

cjrd commented Aug 17, 2020 •

edited

Loading

akolesnikoff commented Aug 18, 2020

cjrd commented Aug 26, 2020

frkl commented Nov 9, 2020 •

edited

Loading

dukleryoni commented Jun 17, 2022

Dataset split ids? #7

Dataset split ids? #7

Comments

cjrd commented Aug 17, 2020 • edited Loading

akolesnikoff commented Aug 18, 2020

cjrd commented Aug 26, 2020

frkl commented Nov 9, 2020 • edited Loading

dukleryoni commented Jun 17, 2022

cjrd commented Aug 17, 2020 •

edited

Loading

frkl commented Nov 9, 2020 •

edited

Loading