Optimal lr used for fine-tuning with larger LR on ImageNet #120

Godofnothing · 2021-10-17T15:23:41Z

Hello. I wonder, what is order of magnitude of the learning rate, that one should take for fine-tuning on ImageNet with the input resolution 384, having taken the 224 DeiT-Tiny pretrained model.

There are discussions in this repo for transfer learning on other datasets (CIFAR-10, iNaturalist)- #105, #45.

Would the learning rate of order 5e-6 - 1e-5 be the optimal choice for finetuning on ImageNet with higher resolution, assuming all the other optimizer settings are kept default - mixup, cutmix, adamW as optimizer, etc. ?

Thanks in advance

TouvronHugo · 2021-11-21T14:19:37Z

Hi @Godofnothing ,
Thanks for your question,
It is possible to keep the same setting and change only the learning rate for the fine-tuning.
The optimal lr depends on the model and the number of epochs of fine tuning. I think that 1e-5 is a good start.
Best,
Hugo

Godofnothing · 2021-11-24T09:47:29Z

@TouvronHugo thanks a lot for your response!

TouvronHugo closed this as completed Nov 24, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimal lr used for fine-tuning with larger LR on ImageNet #120

Optimal lr used for fine-tuning with larger LR on ImageNet #120

Godofnothing commented Oct 17, 2021

TouvronHugo commented Nov 21, 2021

Godofnothing commented Nov 24, 2021

Optimal lr used for fine-tuning with larger LR on ImageNet #120

Optimal lr used for fine-tuning with larger LR on ImageNet #120

Comments

Godofnothing commented Oct 17, 2021

TouvronHugo commented Nov 21, 2021

Godofnothing commented Nov 24, 2021