Question about 'lr_mult' #11

letitbehj · 2016-04-19T08:59:37Z

After reading the sample in http://caffe.berkeleyvision.org/tutorial/layers.html, I know 'lr_mult' is learning rate multipliers for the weights or the biases. But there are 3 'lr_mult' in deep_lstm_short.prototxt, what does the third lr_mult mean?
I am still a novice in caffe, sorry to disturb you.

junhyukoh · 2016-04-20T19:53:27Z

There are three weights in the lstm layer.
The first one corresponds to input-to-hidden weight
The second one corresponds to hidden-to-hidden weight
The third one corresponds to bias.
So, the third lr_mult is the lr multiplier for the bias.

letitbehj · 2016-04-21T07:26:08Z

Thank U~

erinchen824 · 2018-04-19T02:17:41Z

I found that there are 3 lstm layers with the first one have the lr_mult. Does it mean the latter two lstm layers won't get param updated? BTW, the lr_mult for conv layer is 1 and 2 for weight and bias respectively. Here, the lr_mult is 1,1,2 for the upper mentioned three condition. Is 1,1,2 make any sense? Thanks! @junhyukoh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about 'lr_mult' #11

Question about 'lr_mult' #11

letitbehj commented Apr 19, 2016 •

edited

Loading

junhyukoh commented Apr 20, 2016

letitbehj commented Apr 21, 2016

erinchen824 commented Apr 19, 2018

Question about 'lr_mult' #11

Question about 'lr_mult' #11

Comments

letitbehj commented Apr 19, 2016 • edited Loading

junhyukoh commented Apr 20, 2016

letitbehj commented Apr 21, 2016

erinchen824 commented Apr 19, 2018

letitbehj commented Apr 19, 2016 •

edited

Loading