Add CamVid dataset for segmentation #90

felixgwu · 2017-03-08T06:12:40Z

Because of the request #60, I implemented this Camvid dataset class for people who are interested in doing image segmentation.

Since both the input and target should go through the same transformation, I added the file join_transform.py which allows us to do the same image transformation on a list of images.

Sample usage:

import torch
from torchvision.datasets import camvid
from torchvision import transforms, joint_transforms

normalize = transforms.Normalize(mean=camvid.mean, std=camvid.std)
train_joint_transformer = transforms.Compose([
    joint_transforms.JointRandomCrop(224),
    joint_transforms.JointRandomHorizontalFlip()
    ])
train_dataset = camvid.CamVid('path/to/CamVid', 'train',
                      joint_transform=train_joint_transformer,
                      transform=transforms.Compose([
                          transforms.ToTensor(),
                          normalize,
                      ])
train_loader = torch.utils.data.DataLoader(train_dataset, batch_size=10, shuffle=True)

Currently, the download function is not implemented yet.
It requires the users to download the data from here by themselves.

torchvision/datasets/camvid.py

+    return images
+
+
+def LabelToLongTensor(pic):


fmassa · 2017-03-11T01:08:45Z

So, I'm not sure about the Joint Transforms. We were thinking about factoring out the random number generation from the transforms, so that the same random transform can be applied to different inputs (and eventually to inputs from different modalities, such as images and bounding boxes).
I'll send a PR with these transforms tomorrow.

felixgwu · 2017-03-11T06:08:24Z

OK. I'll modify the code based on the new version of transforms.

felixgwu · 2017-03-18T22:50:06Z

Hi @fmassa,
I am wondering what the conclusion of joint random transformations is.
In my opinion, making all the transform class being able to take either a image or a list of images as inputs, and providing a joint_transform parameter the dataset object like my code could be a solution.

Also, I made the function private. However, the other two class should be public. They should be able to be used by the users. LabelToLongTensor can be passed by as a transform class to the Camvid class.
More importantly, LabelToPILImage can used to visualize the predicted labels.

fmassa · 2017-03-19T12:12:09Z

Hi @felixgwu,

In a local branch I factored out the random parameter generation from the transforms, and I'm using it now in a project for semantic segmentation.

The drawback of this approach is that we need to pass an extra argument to the constructor of the dataset (let's call it generators). The generators are a list of object that when called generate the random parameters for the transforms.
Thus, in the __getitem__, if the generators were provided in the constructor, we need to call them in order to generate the parameters for the transforms, that can be used for both the inputs and the targets.

This requires passing a new argument to the constructor, but I think it's better than having joint transforms, because we light want to apply some individual transforms before / after the joint transforms, which would not be possible in this setup.

I'm not yet 100% hapoy with my refactoring, but it's time to send a PR to get some feedback. I'm out of my computer this weekend, but when I create the PR I'll tag you on it.

alykhantejani · 2017-09-06T10:03:32Z

Hi @felixgwu sorry for the delay on this, we are discussing a refactor to transforms in #230 and should have something merged soon after which using the random transformation parameters between transforms should be much easier.

carlogarro · 2020-05-28T15:00:40Z

Did it succeed?

yassineAlouini · 2022-06-06T09:27:02Z

Hello @felixgwu and sorry for taking that long to get back at you.

As you might know, there is a new dataset API being designed and existing datasets will be ported to it. Here is a thread explaining the logic behind it: #5336.

Also, there is this thread that discusses adding the CamVid dataset: #60. You are probably aware of it.

As far as my understanding goes, it would be better to wait a bit until the new design is stable. Then, it would be best to port the code here to the new design. Someone can help you doing it of course and proper attribution will be given to you @felixgwu of course or you can do it yourself if you want to. What do you think @felixgwu?

Also, since this PR is a bit old maybe you don't need it anymore and/or found an alternative. Any feedback is welcome @felixgwu.

Thanks again and sorry for the long delay.

pmeier · 2022-06-07T15:12:04Z

@yassineAlouini This PR adds two things:

The image dataset CamVid (I erroneously thought this was a video dataset in CamVid dataset #60)
Joint transformations for images and segmentation masks.

The second part is well covered by the transforms rework that is currently going on. If I'm not mistaken we already have transformations for everything proposed here in torchvision.prototype.transforms.

As for the dataset, the implementation seems pretty straight forward. Porting it to the new API should be simple. As of now, we haven't we looked mostly into classification and detection datasets, but segmentation datasets will follow soon. We should wait at least until then before we have a go at this.

felixgwu added 3 commits March 8, 2017 00:43

add camvid

5ef52c6

fix bug

2ab4ee6

add fix bugs in joint_transforms

cf491d3

fmassa reviewed Mar 11, 2017

View reviewed changes

torchvision/datasets/camvid.py Outdated

return images

def LabelToLongTensor(pic):

This comment was marked as off-topic.

Sign in to view

felixgwu and others added 3 commits March 18, 2017 18:29

make function private

93c586d

Merge branch 'master' into add_camvid

44f3a19

fix bugs caused by merge

3fde2d0

gpleiss mentioned this pull request Mar 29, 2017

DenseNet FCN #131

Closed

facebook-github-bot added the cla signed label Oct 30, 2020

pmeier self-assigned this Apr 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CamVid dataset for segmentation #90

Add CamVid dataset for segmentation #90

felixgwu commented Mar 8, 2017 •

edited

Loading

This comment was marked as off-topic.

fmassa commented Mar 11, 2017

felixgwu commented Mar 11, 2017

felixgwu commented Mar 18, 2017

fmassa commented Mar 19, 2017

alykhantejani commented Sep 6, 2017

carlogarro commented May 28, 2020

yassineAlouini commented Jun 6, 2022

pmeier commented Jun 7, 2022

Add CamVid dataset for segmentation #90

Are you sure you want to change the base?

Add CamVid dataset for segmentation #90

Conversation

felixgwu commented Mar 8, 2017 • edited Loading

This comment was marked as off-topic.

fmassa commented Mar 11, 2017

felixgwu commented Mar 11, 2017

felixgwu commented Mar 18, 2017

fmassa commented Mar 19, 2017

alykhantejani commented Sep 6, 2017

carlogarro commented May 28, 2020

yassineAlouini commented Jun 6, 2022

pmeier commented Jun 7, 2022

felixgwu commented Mar 8, 2017 •

edited

Loading