where processor should i put in a training code? #13427

lycfight · 2021-09-05T04:52:11Z

Hi @lycfight could you please open an issue with a minimal code snippet so we could take a look. Thanks :)

Originally posted by @patil-suraj in #11445 (comment)

lycfight · 2021-09-05T05:24:26Z

I have a problem during writing a train code in pytorch.I want to create a custom Dataset for coco image_caption dataset such as follow:

inherit torch.utils.data:

from torch.utils.data import Dataset
Class Image_textDataset(Dataset):

2.then overwrite getiem(self,idx) where i used processor to process a tuple of (image, text) sample
but it seems that CLIPProcessor could't process a tuple of (image, text) sample to same shape for dataloader to make a batch as follow:

def __getitem__(self, idx):
        img_id = self.img_ids[idx]
        # randomly pick one caption from the image captions
        text = random.choice(self.img_id_to_captions[img_id])
        img_filename = self.img_id_to_filename[img_id]
        img_path = op.join(self.img_dir, img_filename)
        img = Image.open(img_path)
        inputs = processor(text = text, images = img, return_tensors="pt", padding="max_length",truncation=True)
        return inputs

3.or __getitem__return a pair of raw (image, text) sample, and use processor in a custom collate_fn like:

def collate_fn(examples):
        images = [example[0] for example in examples]
        captions = [example[1] for example in examples]
        inputs = processor(
            text=captions,
            images=images,
            max_length=77,
            padding="max_length",
            truncation=True,
            return_tensors="pt",
        )

        batch = {
            "pixel_values": inputs["pixel_values"],
            "input_ids": inputs["input_ids"],
            "attention_mask": inputs["attention_mask"],
        }

        return batch

then pass collate_fn as a parameter to dataloader

knitemblazor · 2021-09-06T05:24:15Z

This issue is not of relevance to transformers repository please post this in pytorch forums for quick help

lycfight · 2021-09-06T05:54:15Z

This issue is not of relevance to transformers repository please post this in pytorch forums for quick help

I think processor is a base method of transformers which should be concerned in transformers' tutorials

patil-suraj · 2021-09-06T06:08:03Z

You could put the processor anywhere you want either in the dataset or collate_fn. If processing on the fly, then I would put it in the collate_fn as it would process the whole batch with single call, which is usually faster than processing single examples.

github-actions · 2021-10-05T15:12:00Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

github-actions bot closed this as completed Oct 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

where processor should i put in a training code? #13427

where processor should i put in a training code? #13427

lycfight commented Sep 5, 2021

lycfight commented Sep 5, 2021 •

edited

Loading

knitemblazor commented Sep 6, 2021

lycfight commented Sep 6, 2021

patil-suraj commented Sep 6, 2021

github-actions bot commented Oct 5, 2021

where processor should i put in a training code? #13427

where processor should i put in a training code? #13427

Comments

lycfight commented Sep 5, 2021

lycfight commented Sep 5, 2021 • edited Loading

knitemblazor commented Sep 6, 2021

lycfight commented Sep 6, 2021

patil-suraj commented Sep 6, 2021

github-actions bot commented Oct 5, 2021

lycfight commented Sep 5, 2021 •

edited

Loading