Convert all non-`RGB` images to `RGB` in `CLIPImageTransform` #1975

vancoykendall · 2024-11-08T19:51:57Z

In the CLIPImageTransfrom, only RGBA images get converted to RGB. I don't see a reason to not convert all non-RGB images. I have a multimodal dataset with mostly RGB images, but a handful are in CMYK and L. I'm finetuning a Llama3.2 VLM which uses the CLIPImageTransform and my first training crashed in the middle because a CMYK image finally was loaded, didn't get converted, and the size didn't match.

torchtune/torchtune/models/clip/_transform.py

Lines 161 to 163 in 96dea61

    
           # Make image torch.tensor((3, H, W), dtype=dtype), 0<=values<=1 
        
           if hasattr(image, "mode") and image.mode == "RGBA": 
        
               image = image.convert("RGB")

I noticed the code in the eval recipe actually does convert all non-RGB images to RGB:

torchtune/recipes/eleuther_eval.py

Lines 168 to 170 in 96dea61

    
           for image in images: 
        
               if image.mode != "RGB": 
        
                   image = image.convert("RGB")

The text was updated successfully, but these errors were encountered:

ebsmothers · 2024-11-08T19:57:45Z

Thanks for creating the issue. Actually this was done as a bit of a one-off so we probably can consider changing to match the eval recipe. cc @pbontrager in case you have any strong objections to this, but otherwise @vancoykendall we'd love to have you open a PR with the change if you're interested.

vancoykendall · 2024-11-08T20:05:23Z

@ebsmothers sounds good. Also, I think like the eval code I can remove the check for the mode attribute as all PIL.Images have the attribute

vancoykendall · 2024-11-08T20:40:52Z

@ebsmothers Just created a pr #1976

ebsmothers · 2024-11-09T15:28:04Z

Closing now that the fix has landed. Thanks again!

vancoykendall mentioned this issue Nov 8, 2024

convert rgba to rgb #1678

Merged

vancoykendall mentioned this issue Nov 8, 2024

Convert all non-rgb images to rgb #1976

Merged

ebsmothers closed this as completed Nov 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert all non-`RGB` images to `RGB` in `CLIPImageTransform` #1975

Convert all non-`RGB` images to `RGB` in `CLIPImageTransform` #1975

vancoykendall commented Nov 8, 2024

ebsmothers commented Nov 8, 2024

vancoykendall commented Nov 8, 2024

vancoykendall commented Nov 8, 2024

ebsmothers commented Nov 9, 2024

Convert all non-RGB images to RGB in CLIPImageTransform #1975

Convert all non-RGB images to RGB in CLIPImageTransform #1975

Comments

vancoykendall commented Nov 8, 2024

ebsmothers commented Nov 8, 2024

vancoykendall commented Nov 8, 2024

vancoykendall commented Nov 8, 2024

ebsmothers commented Nov 9, 2024

Convert all non-`RGB` images to `RGB` in `CLIPImageTransform` #1975

Convert all non-`RGB` images to `RGB` in `CLIPImageTransform` #1975