EltAffine Functionality #2

Darwin2011 · 2016-01-08T08:59:22Z

I am trying to figure out BN implementations from the PR you test and there's no bias and shift implemented there.

I also notice that from your experiments, it seems that BN + Affine doesn't improve performance that much from the initial training stages.

And in your Caffe fork, https://github.com/ducha-aiki/caffe, there's another version of bn implementation as Caffe PR 1965, which implements shift and bias.

So may I know why such two operations is dropped in Caffe Upstream you test? Do they even hurt performance? Or what version should I choose to use?

Thanks a lot.

.

ducha-aiki · 2016-01-08T09:58:31Z

@Darwin2011 the caffe maintainers said that want layers to be smaller building block, so separate EA. EA could be implemented with dummydatalayer in very unelegant way, but with existing layers. So my version was dropped.

However, I am going to PR bias layer and scale layer, which together would EA and maintainers are ok with merging them, when ready.

Darwin2011 · 2016-01-08T11:12:27Z

Thanks a lot for your kind help.

Darwin2011 closed this as completed Jan 8, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EltAffine Functionality #2

EltAffine Functionality #2

Darwin2011 commented Jan 8, 2016

ducha-aiki commented Jan 8, 2016

Darwin2011 commented Jan 8, 2016

EltAffine Functionality #2

EltAffine Functionality #2

Comments

Darwin2011 commented Jan 8, 2016

ducha-aiki commented Jan 8, 2016

Darwin2011 commented Jan 8, 2016