Two questions that I hope to consult with you #5

luobendewugong · 2024-05-20T19:02:44Z

Thank you very much for your sharing. I have two questions that I hope to consult with you.

1.If I would like to incorporate a validation set into your code, how should the code be modified?

2.When I change the BATCH_SIZE to 2 or 3, I encounter the following error. How should I modify the code to avoid such errors?

Traceback (most recent call last): File "c:/Users/PC/Documents/Code/Sam_LoRA-main/train.py", line 59, in <module> stk_gt, stk_out = utils.stacking_batch(batch, outputs) File "c:\Users\PC\Documents\Code\Sam_LoRA-main\src\utils.py", line 101, in stacking_batch stk_gt = torch.stack([b["ground_truth_mask"] for b in batch], dim=0) RuntimeError: stack expects each tensor to be equal size, but got [300, 450] at entry 0 and [500, 600] at entry 1

Thank you very much！

The text was updated successfully, but these errors were encountered:

MathieuNlp · 2024-05-20T22:20:52Z

Hi, happy that you are interested.

An approach is to split the training dataset into a train set and validation set. You could then create a function to evaluate the validation set like in the ./inference_eval.py (line 35 to 51) with the score you want. Finally print/plot the train loss and validation loss to save the best checkpoint.
You will need to add a padding (zero padding will work).I used a batch size of 1 so the stacking of tensors wasn’t a problem. However when you go into a larger batch size, the stacking will need the height width dimensions to be equal. The labels in the dataset are not the same size so will need to crop the images (the original ones) or add a transformation in the dataloader process that will crop, center all the images so that you will only need to stack. You can add the transformation in the DatasetSegmentation object in src/dataloader.py class.

Hope it helps.

luobendewugong · 2024-05-26T11:43:36Z

Hi, happy that you are interested.

An approach is to split the training dataset into a train set and validation set. You could then create a function to evaluate the validation set like in the ./inference_eval.py (line 35 to 51) with the score you want. Finally print/plot the train loss and validation loss to save the best checkpoint.

You will need to add a padding (zero padding will work).I used a batch size of 1 so the stacking of tensors wasn’t a problem. However when you go into a larger batch size, the stacking will need the height width dimensions to be equal. The labels in the dataset are not the same size so will need to crop the images (the original ones) or add a transformation in the dataloader process that will crop, center all the images so that you will only need to stack. You can add the transformation in the DatasetSegmentation object in src/dataloader.py class.

Hope it helps.

Thank you very much for your reply, and I will try it.
Additionally, I would like to consult you, have you tried the checkpoints of sam's sam_vit_l_0b3195 or sam_vit_h_4b8939? I tried to train with these two checkpoints and got a shape error, is there a part of code that needs to be modified?

MathieuNlp · 2024-06-07T13:07:02Z

Hello,

Could you show me the shape error you got? I have answered a question regarding the loading of other ViT sizes here: #7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Two questions that I hope to consult with you #5

Two questions that I hope to consult with you #5

luobendewugong commented May 20, 2024

MathieuNlp commented May 20, 2024

luobendewugong commented May 26, 2024

MathieuNlp commented Jun 7, 2024 •

edited

Loading

Two questions that I hope to consult with you #5

Two questions that I hope to consult with you #5

Comments

luobendewugong commented May 20, 2024

MathieuNlp commented May 20, 2024

luobendewugong commented May 26, 2024

MathieuNlp commented Jun 7, 2024 • edited Loading

MathieuNlp commented Jun 7, 2024 •

edited

Loading