how to adjust learning rates or something to be better when using a pretrained model to train my dataset?please give me some advice #820

J-LINC · 2023-03-22T15:18:59Z

I'm trying to train my own data set using pretrained weights, BDD100K. Can you give me some advice? (All parameters unchanged, e.g. learning rate and step of learning rate decay) Below is the result of my 12 epoch training. I think map is a little too low, but the values of each loss function shown in the diagram are very small. I am worried that if I continue without any hyper parameter changes, the model will be overfitting. I currently set the number of training rounds to 250 epoch, which is evaluated every 10 rounds, and every 50 epochs saves the model.The following image shows the losses I have trained for 10 epochs and the evaluation results on the validation set

Flova · 2023-04-04T17:53:38Z

What pretrained weights did you use? I would not worry about overfitting that much on this dataset scale. I am preparing a PR with better hyperparameters in the near future, but for now I would just recommend longer training and maybe a slightly higher lr.

J-LINC · 2023-04-05T15:32:20Z

What pretrained weights did you use? I would not worry about overfitting that much on this dataset scale. I am preparing a PR with better hyperparameters in the near future, but for now I would just recommend longer training and maybe a slightly higher lr.

I did not use the pre-training model because I think BDD100k is big enough. I forgot whether the pre-training model was used when I trained COCO in the original paper. Now I am wondering whether to change the learning rate

J-LINC · 2023-04-05T15:34:58Z

What pretrained weights did you use? I would not worry about overfitting that much on this dataset scale. I am preparing a PR with better hyperparameters in the near future, but for now I would just recommend longer training and maybe a slightly higher lr.

you mean learning rate > 0.001 will be better? i think the initial learning rate in the yolov3-custom.cfg is too low.

J-LINC · 2023-04-05T15:46:30Z

What pretrained weights did you use? I would not worry about overfitting that much on this dataset scale. I am preparing a PR with better hyperparameters in the near future, but for now I would just recommend longer training and maybe a slightly higher lr.

Before your reply, I have modified ，which is not to use the pre-training model. As for the two messages I just replied to you, I plan to look at the training results tomorrow and reply to you

J-LINC · 2023-05-02T06:24:38Z

@Flova hi ,i think most issues about map low is because they just use one optimiser ,adam, i checked your code long time ago and i found sgd optimiser emmm,it 's better than adam in training ,at least in some aspects

Flova mentioned this issue Apr 12, 2023

Tune for COCO #826

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to adjust learning rates or something to be better when using a pretrained model to train my dataset?please give me some advice #820

how to adjust learning rates or something to be better when using a pretrained model to train my dataset?please give me some advice #820

J-LINC commented Mar 22, 2023

Flova commented Apr 4, 2023

J-LINC commented Apr 5, 2023

J-LINC commented Apr 5, 2023

J-LINC commented Apr 5, 2023 •

edited

Loading

J-LINC commented May 2, 2023

how to adjust learning rates or something to be better when using a pretrained model to train my dataset?please give me some advice #820

how to adjust learning rates or something to be better when using a pretrained model to train my dataset?please give me some advice #820

Comments

J-LINC commented Mar 22, 2023

Flova commented Apr 4, 2023

J-LINC commented Apr 5, 2023

J-LINC commented Apr 5, 2023

J-LINC commented Apr 5, 2023 • edited Loading

J-LINC commented May 2, 2023

J-LINC commented Apr 5, 2023 •

edited

Loading