Train pix2pix with my own data #309

happyday521 · 2018-06-28T08:32:02Z

I use my own data to train the pix2pix model.
I stack the A and B(a pair) as the input to my model and I want to get the C.
Besides, the B is the segmentation label for C. I use the A to provide RGB information for the generation of C.
However, in my training,the result is not what I want. The fake C is very alike to the A and not like the real C that I want.
The loss plot is as follows：

I want to know how can I decrease the influence of A and make the fake C look like the real C rather than the A? How should I change the pix2pix model(now used by default) ?Can someone give me some advice?Thanks！

phamnam95 · 2018-06-29T19:03:01Z

How did you stack A and B as a pair? Did you stack along the channel dimension. For example A has dimension (H1,W1,C1) and B is (H1,W1,C2). Did you stack along the C dimension to have (H1,W1,C1+C2)? Does your segmentation label have only label 1 and 0, or the label is between 0 and 1? How you normalize the data after stacking (I guess the range of your RGB image and segmentation result is different)?

happyday521 · 2018-06-30T02:41:34Z

Yes, I stack A and B along the channel dimension as you said. My segmentation label is also a RGB image. I think their range is similar.Do you have any advice? Thanks.

phamnam95 · 2018-06-30T02:47:51Z

I am doing similar problem. But my A and B have different range. And I am having same problem with you. I am looking for the solution as well.

happyday521 · 2018-06-30T02:50:18Z

Ok, good luck! If you have any useful idea,please tell me!

junyanz · 2018-07-04T02:57:19Z

You should stack two images as (H1, W1+W1, C1) where we assume C1=C2. For other types of data, you may consider writing your own data loader inherited from the base_dataset model.

phamnam95 · 2018-07-04T03:01:48Z

Why do we need to stack along the width dimension?

junyanz · 2018-07-04T03:09:21Z

@phamnam95 it is the current design of default data loader for pix2pix. You can run this script to concatenate input and output images. It works fine if C1=C2=3 or C1=C2=1. It might not be the best way for your datasets. Feel free to write your own data loader.

phamnam95 · 2018-07-04T03:13:48Z

Is it possible to have the input image and output image with different dimensions? If we stack along the width dimension, the dimension of input image is (H,W+W,C) and the dimension of output image is (H,W,C)?

junyanz · 2018-07-04T03:54:00Z

It's not supported by the aligned_dataset.py. Also, the code assumes that H and W are the same for both input and output.

phamnam95 · 2018-07-04T03:59:58Z

So if I stack the dataset like you suggested, the dimension of input and output will be different. For example, I have two sets of images with dimension (200,200,1) and (200,200,1). I want to create the output of dimension (200,200,1). How can I stack inputs to feed in training? If I stack along width dimension, it will be (200,400,1) for input and (200,200,1) for output?

junyanz · 2018-07-04T18:41:07Z

If you stack your inputs, the image will be (200, 400, 1).
The aligned_dataset will load the (200, 400, 1) and split it into two: one for input, and one for output.
See this line for more details.

phamnam95 · 2018-07-04T18:42:42Z

I am a little confused because my input has 2 image A, B; and my output is only image C. Thanks

junyanz · 2018-07-04T18:44:39Z

I see. In this special case, you may want to write your own data loader. It should only take 1 hour.

phamnam95 · 2018-07-04T19:00:53Z

Should I stack along the channel dimension for A and B?

junyanz · 2018-07-05T16:09:33Z

If you write your own data loader, you can load each image separately by the name image0000_A, image0000_B, image0000_C. You don't need to stack them.

phamnam95 · 2018-07-05T16:12:22Z

I mean when I train, I guess I cannot feed two input images A and B to the input tensor. I need to stack them to create one image for input.

happyday521 closed this as completed Jul 7, 2018

2110317008 mentioned this issue Aug 28, 2019

after some epochs, training stops #703

Closed

turian mentioned this issue Apr 2, 2021

16-bit or 24-bit color channels? #1264

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train pix2pix with my own data #309

Train pix2pix with my own data #309

happyday521 commented Jun 28, 2018 •

edited

Loading

phamnam95 commented Jun 29, 2018

happyday521 commented Jun 30, 2018

phamnam95 commented Jun 30, 2018

happyday521 commented Jun 30, 2018

junyanz commented Jul 4, 2018

phamnam95 commented Jul 4, 2018

junyanz commented Jul 4, 2018 •

edited

Loading

phamnam95 commented Jul 4, 2018

junyanz commented Jul 4, 2018

phamnam95 commented Jul 4, 2018

junyanz commented Jul 4, 2018

phamnam95 commented Jul 4, 2018

junyanz commented Jul 4, 2018

phamnam95 commented Jul 4, 2018

junyanz commented Jul 5, 2018

phamnam95 commented Jul 5, 2018

Train pix2pix with my own data #309

Train pix2pix with my own data #309

Comments

happyday521 commented Jun 28, 2018 • edited Loading

phamnam95 commented Jun 29, 2018

happyday521 commented Jun 30, 2018

phamnam95 commented Jun 30, 2018

happyday521 commented Jun 30, 2018

junyanz commented Jul 4, 2018

phamnam95 commented Jul 4, 2018

junyanz commented Jul 4, 2018 • edited Loading

phamnam95 commented Jul 4, 2018

junyanz commented Jul 4, 2018

phamnam95 commented Jul 4, 2018

junyanz commented Jul 4, 2018

phamnam95 commented Jul 4, 2018

junyanz commented Jul 4, 2018

phamnam95 commented Jul 4, 2018

junyanz commented Jul 5, 2018

phamnam95 commented Jul 5, 2018

happyday521 commented Jun 28, 2018 •

edited

Loading

junyanz commented Jul 4, 2018 •

edited

Loading