Python/net spec coordinate map and crop computation #3613

longjon · 2016-01-30T03:16:28Z

This PR provides an updated version of #1975 (see also #1976; this is the new version described there). This is meant for use along with #3570 (new ND crop layer).

This version has several advantages over #1975, which make it a better candidate for merge.

Unlike Augment layers with their induced coordinate maps #1975, layers do not need a pointer back to their owning net, and are not responsible for inspecting each other. This preserves the clean separation between Net and Layer, whereas the strategy employed by Augment layers with their induced coordinate maps #1975 allowed for layers to interact in arbitrary ways that are difficult to reason about and make certain kinds of functionality difficult to implement. (This was the primary objection blocking the merge of Augment layers with their induced coordinate maps #1975.) The alternate strategy used here is to perform the net-level analysis at net definition time (through pycaffe's net spec), and bake the results into the parameters of the net, through a much more straightforward (e.g.) crop layer (now ND Crop layer #3570).
This alternate strategy makes the computed parameters user-visible and user-modifiable, which has value for debugging and understanding.
Cropping issues appear at specification time rather than run time.
Rather than adding C++ code scattered throughout layers, this patch adds a single Python file.
The C++ Layer interface remains unchanged; layers don't gain an extra method which may be obscure to some users.

In addition, the functionality of this code is significantly expanded from that of #1975. In particular,

The assumption that common ancestors can be reached by descending through first bottoms is removed. Any two spatially-mappable blobs with a common ancestor should work for cropping or otherwise computing coordinate maps.
Layers with multiple bottoms (e.g., Eltwise or Convolution with multiple inputs/outputs) should now be supported.
Rectangular filters should now be supported.
ND de/convolution should be supported.
Dilated de/convolution should be supported.

Like #1975, the coordinate maps computed with this code can be used for other purposes in addition to cropping.

The main disadvantage of this approach compared to #1975 is that the coordinate mapping formulae are kept apart from the definitions of their corresponding layers, which have to be manually kept in sync. Given that these formulae only come in a few basic forms, this seems like a reasonable approach for now.

This PR is not complete. The latest version of this code has just been written and has not been well-tested. Work still to be done:

nudge this PR/ND Crop layer #3570 as needed for compatibility
add a test for basic functionality
add a test for rectangular filters
add a test for ND convolution
settle on names for functions (coord_map_from_to is more awkward than it needs to be)
flesh out docstrings and comments (in particular, explain how to use coord_map_from_to)
fix style issues
consider whether any currently unsupported layers should be supported

longjon · 2016-01-30T03:18:50Z

By the way (this should become evident when tests are added): basic usage is like this:

from caffe.coord_map import crop
from caffe import layers as L

data = L.Data(...)
...
output = L.Convolution(...)
cropped_output = crop(output, data)

seanbell · 2016-01-30T07:04:46Z

python/caffe/coord_map.py

+        'TanH', 'Threshold']
+
+def conv_params(fn):
+    params = fn.params.get('convolution_param', fn.params)


If fn.type_name == 'Pooling', then this should check pooling_param I think, since this can get called in coord_map for pooling layers.

Not at work here, but a reminder to one day settle #1318.

Not quite; perhaps this warrants a comment. What's going on here is that the parameters are normally read from fn.params, which contains the conv params for a conv layer and the pooling params for a pooling layer. However, for layers which share the ConvolutionParameter message type (i.e., currently only deconv layer), we need to explicitly ask for convolution_param, since Convolution is not the name of the layer.

SvenTwo · 2016-02-01T18:33:54Z

Does this replace the "crop" layer types used in some fully convolutional networks? How do you perform training of fully convolutional networks if there is no crop layer to bring the per-pixel labels and the network output into the same coordinate space? Will you have to do the cropping manually as a pre-processing step on the training data? This looks rather inefficient to me.

longjon · 2016-02-01T23:20:07Z

@SvenTwo: #3570 becomes the new Crop layer; this code replaces #1975 in order to automatically determine crop parameters. There is no regression of functionality or speed compared to any previous branches; see details above.

ahundt · 2016-02-23T08:00:45Z

@longjon will #3570 and #3613 together provide the functionality in https://github.com/longjon/caffe/? I created longjon#11 to keep things going until the requisite parts are done. Also, is this close to merging? Things seem to have stalled in both for a month or so. Apologies if this is the wrong place to post this.

shelhamer · 2016-02-27T08:07:14Z

python/caffe/coord_map.py

+import numpy as np
+from caffe import layers as L
+
+PASS_THROUGH_LAYERS = ['AbsVal', 'ReLU', 'PReLU', 'Dropout', 'LRN', 'Eltwise',


add ELU, Scale, Bias

shelhamer · 2016-02-28T19:56:07Z

@ahundt right, python coord map #3613/this and crop layer #3570 together deliver the same functionality and efficiency of longjon/caffe:future but with less intrusion into the framework and layers.

ahundt · 2016-02-29T16:02:21Z

@shelhamer thanks! Also are these two patches applied to master compatible with the exiting trained .caffemodel and .prototext of the version in https://github.com/longjon/caffe?

ahundt · 2016-02-29T19:54:05Z

The answer to my prior question is that no ~~yes~~, they are not compatible.

From the model zoo:

fcn-32s runs with expected output Update: it seems this just happens to work by chance, crop does nothing as mentioned in the next post.
fcn-8s crashes Here are the relevant logs

printed output in python:

I0229 14:37:44.157938 2077495296 layer_factory.hpp:77] Creating layer fuse
I0229 14:37:44.157945 2077495296 net.cpp:91] Creating Layer fuse
I0229 14:37:44.157949 2077495296 net.cpp:425] fuse <- upscore2_upscore2_0_split_1
I0229 14:37:44.157953 2077495296 net.cpp:425] fuse <- score-pool4c
I0229 14:37:44.157958 2077495296 net.cpp:399] fuse -> score-fused
F0229 14:37:44.157968 2077495296 eltwise_layer.cpp:34] Check failed: bottom[i]->shape() == bottom[0]->shape() 
*** Check failure stack trace: ***

key lines of stack trace:

7   libglog.0.dylib                 0x000000011fa91015 google::LogMessageFatal::~LogMessageFatal() + 15
8   libglog.0.dylib                 0x000000011fa8e363 google::LogMessageFatal::~LogMessageFatal() + 9
9   libcaffe.so.1.0.0-rc3           0x000000011acf1f4a caffe::EltwiseLayer<float>::Reshape(std::__1::vector<caffe::Blob<float>*, std::__1::allocator<caffe::Blob<float>*> > const&, std::__1::vector<caffe::Blob<float>*, std::__1::allocator<caffe::Blob<float>*> > const&) + 202
10  libcaffe.so.1.0.0-rc3           0x000000011ad466b9 caffe::Net<float>::Init(caffe::NetParameter const&) + 3449
11  libcaffe.so.1.0.0-rc3           0x000000011ad47de6 caffe::Net<float>::Net(std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> > const&, caffe::Phase, caffe::Net<float> const*) + 454

@shelhamer is this perhaps a bug in this pull request?

longjon · 2016-02-29T21:44:40Z

@ahundt, no (unless @shelhamer has made updates I'm not aware of), existing prototxts are not compatible with these two PRs. They will probably run with Crop layers doing nothing, so the FCN-8s dimension mismatch is exactly as expected.

shelhamer · 2016-02-29T22:22:21Z

@ahundt @longjon no, existing models are not compatible but we plan to bundle up nets in a standard format once master is sorted out. It's only an architectural/configuration difference however and the same weights will be fine.

shelhamer · 2016-03-03T08:57:48Z

With this PR @ 775a5c7 and #3570 I have been able to reproduce FCN experiments. a3359a4 adds tests so this can be merged once #3570 is in.

This provides a framework for automatically aligning different layers of a net despite up/downsampling, padding, and output size rounding.

Python/net spec coordinate map and crop computation * longjon/py-coord-map: [pycaffe] test coord_map [pycaffe] align coord_map and BVLC#3570 Crop layer [pycaffe] document, style, and complete coord_map [pycaffe] add coord_map.py for computing induced coordinate transform

- document by docstring and comment - pep8 - add latest layers and alphabetize - respect default crop params - handle graphs with compositions of crops by walking only the first, cropped bottom of Crop layers - make python3 happy by replacing arg tuple unpacking

- crop -> offset - adjust crop axis by 1

- test known mappings: conv-pool-deconv stack, ReLU and 1x1 conv - test effects of padding - test rectangular/anisotropic coordinate mapping, test N-D - catch error cases: negative crop, scale mismatch, tops that are not spatially connected

Python/net spec coordinate map and crop offset computation

ahundt · 2016-03-15T20:14:28Z

@shelhamer Is it possible to post new or updated pre-trained fcn-xx models using this code in the model zoo?

weiliu620 · 2016-04-19T15:14:11Z

Does pycaffe support 'Crop' layer? I run caffe.Net to load model

net = caffe.Net(model_file, model_weights, caffe.TEST)

and got error:

F0419 10:14:49.936854 10131 layer_factory.hpp:77] Check failed: registry.count(type) == 1 (0 vs. 1) Unknown layer type: Crop (known types: AbsVal, Accuracy, ArgMax, BNLL, Concat, ContrastiveLoss, Convolution, Data, Deconvolution, Dropout, DummyData, Eltwise, EuclideanLoss, Exp, Flatten, HDF5Data, HDF5Output, HingeLoss, Im2col, ImageData, InfogainLoss, InnerProduct, LRN, MVN, MemoryData, MultinomialLogisticLoss, Pooling, Power, ReLU, Sigmoid, SigmoidCrossEntropyLoss, Silence, Slice, Softmax, SoftmaxWithLoss, Split, TanH, Threshold, WindowData)

(I used latest caffe master branch)

Python/net spec coordinate map and crop offset computation

longjon added enhancement in progress Python labels Jan 30, 2016

seanbell reviewed Jan 30, 2016
View reviewed changes

BlGene mentioned this pull request Feb 22, 2016

ND Crop layer #3570

Merged

8 tasks

shelhamer added the ES label Feb 25, 2016

shelhamer reviewed Feb 27, 2016
View reviewed changes

shelhamer added the focus label Feb 27, 2016

This was referenced Feb 27, 2016

Augment layers with their induced coordinate maps #1975

Closed

Give layers a pointer to their owning Net #1974

Closed

shelhamer force-pushed the py-coord-map branch from 846a8ea to 2ad9995 Compare February 28, 2016 08:04

ahundt mentioned this pull request Feb 29, 2016

Fixbug #3494 No to_python (by-value) converter found for C++ t… #3575

Merged

BlGene mentioned this pull request Mar 1, 2016

Rebase to latest master branch weiliu89/caffe#1

Open

shelhamer force-pushed the py-coord-map branch 3 times, most recently from 5860e05 to 775a5c7 Compare March 3, 2016 04:44

shelhamer force-pushed the py-coord-map branch 3 times, most recently from 8a46e4c to a3359a4 Compare March 4, 2016 02:05

shelhamer mentioned this pull request Mar 4, 2016

Update versions for Travis build #3771

Closed

shelhamer force-pushed the py-coord-map branch from bd8b5bd to a3359a4 Compare March 4, 2016 04:32

shelhamer closed this Mar 4, 2016

shelhamer reopened this Mar 4, 2016

shelhamer removed in progress ES labels Mar 4, 2016

shelhamer closed this Mar 4, 2016

shelhamer reopened this Mar 4, 2016

[pycaffe] add coord_map.py for computing induced coordinate transform

7a8b19f

This provides a framework for automatically aligning different layers of a net despite up/downsampling, padding, and output size rounding.

shelhamer force-pushed the py-coord-map branch from a3359a4 to 9b93099 Compare March 4, 2016 23:30

shelhamer added 3 commits March 4, 2016 19:09

[pycaffe] align coord_map and BVLC#3570 Crop layer

25b9ef9

- crop -> offset - adjust crop axis by 1

[pycaffe] test coord_map

880e147

- test known mappings: conv-pool-deconv stack, ReLU and 1x1 conv - test effects of padding - test rectangular/anisotropic coordinate mapping, test N-D - catch error cases: negative crop, scale mismatch, tops that are not spatially connected

shelhamer force-pushed the py-coord-map branch from 9b93099 to 880e147 Compare March 5, 2016 03:09

shelhamer added a commit that referenced this pull request Mar 5, 2016

Merge pull request #3613 from longjon/py-coord-map

74cc497

Python/net spec coordinate map and crop offset computation

shelhamer merged commit 74cc497 into BVLC:master Mar 5, 2016

shelhamer mentioned this pull request Apr 10, 2016

Add FCN example for semantic segmentation #3890

Closed

ahundt mentioned this pull request Apr 16, 2016

update the crfasrnn with upstream caffe future version torrvision/crfasrnn#42

Merged

shelhamer deleted the py-coord-map branch April 19, 2016 18:05

fxbit pushed a commit to Yodigram/caffe that referenced this pull request Sep 1, 2016

Merge pull request BVLC#3613 from longjon/py-coord-map

f58e39a

Python/net spec coordinate map and crop offset computation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python/net spec coordinate map and crop computation #3613

Python/net spec coordinate map and crop computation #3613

longjon commented Jan 30, 2016

longjon commented Jan 30, 2016

seanbell Jan 30, 2016

shelhamer Feb 1, 2016

longjon Feb 1, 2016

SvenTwo commented Feb 1, 2016

longjon commented Feb 1, 2016

ahundt commented Feb 23, 2016

shelhamer Feb 27, 2016

shelhamer commented Feb 28, 2016

ahundt commented Feb 29, 2016

ahundt commented Feb 29, 2016

longjon commented Feb 29, 2016

shelhamer commented Feb 29, 2016

shelhamer commented Mar 3, 2016

ahundt commented Mar 15, 2016

weiliu620 commented Apr 19, 2016 •

edited

Loading

Python/net spec coordinate map and crop computation #3613

Python/net spec coordinate map and crop computation #3613

Conversation

longjon commented Jan 30, 2016

longjon commented Jan 30, 2016

seanbell Jan 30, 2016

Choose a reason for hiding this comment

shelhamer Feb 1, 2016

Choose a reason for hiding this comment

longjon Feb 1, 2016

Choose a reason for hiding this comment

SvenTwo commented Feb 1, 2016

longjon commented Feb 1, 2016

ahundt commented Feb 23, 2016

shelhamer Feb 27, 2016

Choose a reason for hiding this comment

shelhamer commented Feb 28, 2016

ahundt commented Feb 29, 2016

ahundt commented Feb 29, 2016

longjon commented Feb 29, 2016

shelhamer commented Feb 29, 2016

shelhamer commented Mar 3, 2016

ahundt commented Mar 15, 2016

weiliu620 commented Apr 19, 2016 • edited Loading

weiliu620 commented Apr 19, 2016 •

edited

Loading