Expose Net.copy_trained_layers_from and Net.share_trained_layers_with in pycaffe #1195

longjon · 2014-10-01T02:12:49Z

This allows finetuning from Python, for example.

Misgivings: copy_trained_layers_from and share_trained_layers_with are very long names, while pycaffe mostly has pretty succinct naming. Also, the layers don't really have to be trained at all, so maybe the simple copy_from and share_with would be better. But these are the names in the C++ interface.

I don't know what will happen if you share_trained_layers_with a net, and then delete that net. But it's the same as what will happen if you do the same in C++; does anyone know how that is handled? For the use case I have in mind (the solver), this issue does not arise, because the solver is holding all the nets.

shelhamer · 2014-10-02T07:02:28Z

I like the suggested name change to copy_from and share_with. The interfaces need not be in lock-step. While a close family resemblance is good to keep everything making sense there's a lot to be said for convenience.

This allows finetuning from Python, for example.

Expose Net.copy_from and Net.share_with in pycaffe

longjon changed the title ~~Expose Net.copy_trained_layers_from in pycaffe~~ Expose Net.copy_trained_layers_from and Net.share_trained_layers_with in pycaffe Oct 1, 2014

longjon added 2 commits October 2, 2014 12:47

[pycaffe] expose Net::CopyTrainedLayersFrom as Net.copy_from

ab5e86e

This allows finetuning from Python, for example.

[pycaffe] expose Net::SharedTrainedLayersWith as Net.share_with

0dfda37

longjon force-pushed the python-copy-from branch from f2d5930 to 0dfda37 Compare October 2, 2014 19:51

shelhamer force-pushed the dev branch from d8eb4df to 914da95 Compare October 8, 2014 16:36

longjon mentioned this pull request Oct 9, 2014

Refactor Solver to allow interactive stepping #1228

Merged

shelhamer added a commit that referenced this pull request Oct 10, 2014

Merge pull request #1195 from longjon/python-copy-from

7eecdf9

Expose Net.copy_from and Net.share_with in pycaffe

shelhamer merged commit 7eecdf9 into BVLC:dev Oct 10, 2014

shelhamer added the interface label Oct 10, 2014

mitmul pushed a commit to mitmul/caffe that referenced this pull request Oct 11, 2014

Merge pull request BVLC#1195 from longjon/python-copy-from

cf9dd50

Expose Net.copy_from and Net.share_with in pycaffe

RazvanRanca pushed a commit to RazvanRanca/caffe that referenced this pull request Nov 4, 2014

Merge pull request BVLC#1195 from longjon/python-copy-from

7dc3552

Expose Net.copy_from and Net.share_with in pycaffe

longjon deleted the python-copy-from branch December 30, 2014 05:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose Net.copy_trained_layers_from and Net.share_trained_layers_with in pycaffe #1195

Expose Net.copy_trained_layers_from and Net.share_trained_layers_with in pycaffe #1195

longjon commented Oct 1, 2014

shelhamer commented Oct 2, 2014

Expose Net.copy_trained_layers_from and Net.share_trained_layers_with in pycaffe #1195

Expose Net.copy_trained_layers_from and Net.share_trained_layers_with in pycaffe #1195

Conversation

longjon commented Oct 1, 2014

shelhamer commented Oct 2, 2014