Different claims for the paper and the code on attention regularization #18

tao0420 · 2019-12-18T19:16:24Z

Hi there,

Thanks for the contribution! After reading the code, I am kind of confused on the attention regularization part. Please correct me if there is some misunderstanding.

From the code, what I understand for the center loss part is that for every class(label), you have a center for the features and obviously those features are also used for softmax classification with multiplying a scale 100. However, what you claimed in the paper is that the center loss is used for the attention regularization which will assign each attention feature in the feature matrix a center. The equation you used in the paper for center loss is the sum of distance difference between those attention features ("with an distinguished M in the equation").

Is there any explanation of doing this?

LawrenceXia2008 · 2020-06-07T04:31:10Z

I have the same question, can anyone help explain this Thank the future helpers~

17314796423 · 2024-04-25T12:09:44Z

same question!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Different claims for the paper and the code on attention regularization #18

Different claims for the paper and the code on attention regularization #18

tao0420 commented Dec 18, 2019

LawrenceXia2008 commented Jun 7, 2020 •

edited

Loading

17314796423 commented Apr 25, 2024

Different claims for the paper and the code on attention regularization #18

Different claims for the paper and the code on attention regularization #18

Comments

tao0420 commented Dec 18, 2019

LawrenceXia2008 commented Jun 7, 2020 • edited Loading

17314796423 commented Apr 25, 2024

LawrenceXia2008 commented Jun 7, 2020 •

edited

Loading