how to download / create single_caption_per_sample_val.json file #7

BoiAkay · 2023-03-09T17:47:22Z

can anyone please help me how to generate single_caption_per_sample_val.json file as mentioned in embeddings_generator.py file as shown below
annotations_path = f'/home/gamir/DER-Roei/davidn/myprivate_coco/annotations/single_caption_per_sample_val.json'

DavidHuji · 2023-04-04T07:52:19Z

Hi, here are the instructions. Please let me know if you encounter any issue.

gWeiXP · 2023-11-25T13:58:25Z

Hi, here are the instructions. Please let me know if you encounter any issue.

Hi, I had the same problem, I didn't get single_caption_per_sample_val.json, and what does it mean to set dataset_mode to 0.5, 1.5, 2.5, etc. in embeddings_generator.py ?

gWeiXP · 2023-12-12T08:31:38Z

I gave up, I found other code on GitHub and then conducted an evaluation, referring to https://github.com/jmhessel/clipscore.
Simply save the generated descriptions and label descriptions into two lists, refer to clipscore.py.

DavidHuji · 2023-12-12T08:53:39Z

Hi, sorry for the confusion. The json (single_caption_per_sample_val) holds the captions data (per id) and it is generated in the script of parse_karpathy. So once you download the data from the sources I mentioned in the readme, you can use the script of parse_karpathy to pre-process it and to generate a json that is in the format of single_caption_per_sample_val. Then you can simply use that json as the input for the embeddings_generator. The different dataset_mode s in the embeddings_generator are just something internal for me that was useful since I wanted have mode per dataset (for me it is easier to manage the different ~10 paths) but you can definitely ignore it and just have your own json and assign it there to 'annotations_path'. Hope it is helpful. Once I have some free time I'll update the code to make it easier to use.

qq123aa456 · 2023-12-12T09:18:40Z

Hi, sorry for the confusion. The json (single_caption_per_sample_val) holds the captions data (per id) and it is generated in the script of parse_karpathy. So once you download the data from the sources I mentioned in the readme, you can use the script of parse_karpathy to pre-process it and to generate a json that is in the format of single_caption_per_sample_val. Then you can simply use that json as the input for the embeddings_generator. The different dataset_mode s in the embeddings_generator are just something internal for me that was useful since I wanted have mode per dataset (for me it is easier to manage the different ~10 paths) but you can definitely ignore it and just have your own json and assign it there to 'annotations_path'. Hope it is helpful. Once I have some free time I'll update the code to make it easier to use.
Thank you so much for your reply,could you please give us some instructions on how to get scores,like bleu,cider?

qq123aa456 · 2023-12-12T09:19:47Z

@wxpqq826615304 I'll try this,thanks

DavidHuji mentioned this issue Dec 12, 2023

how to get train.json? #13

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to download / create single_caption_per_sample_val.json file #7

how to download / create single_caption_per_sample_val.json file #7

BoiAkay commented Mar 9, 2023

DavidHuji commented Apr 4, 2023 •

edited

Loading

gWeiXP commented Nov 25, 2023

gWeiXP commented Dec 12, 2023

DavidHuji commented Dec 12, 2023

qq123aa456 commented Dec 12, 2023

qq123aa456 commented Dec 12, 2023

how to download / create single_caption_per_sample_val.json file #7

how to download / create single_caption_per_sample_val.json file #7

Comments

BoiAkay commented Mar 9, 2023

DavidHuji commented Apr 4, 2023 • edited Loading

gWeiXP commented Nov 25, 2023

gWeiXP commented Dec 12, 2023

DavidHuji commented Dec 12, 2023

qq123aa456 commented Dec 12, 2023

qq123aa456 commented Dec 12, 2023

DavidHuji commented Apr 4, 2023 •

edited

Loading