Caffe support by pranv #368

fchollet · 2015-07-08T22:38:10Z

Creating a PR for easier reviewing

… Caffe)

refactored data layers added error handling added more return values(inputs and outputs)

…tails

pranv · 2015-07-10T08:47:03Z

I've been thinking of adding tests, but since the model files are huge to be included into keras, I think we have only 2 options:

Fetch it on demand
Eliminate it, just test the model definition conversion code with a complicated model.

Any suggestions?

phreeza · 2015-07-10T08:48:10Z

+1 for tests, and +1 for fetch on demand, just as it is done with the datasets.

fchollet · 2015-07-10T08:50:21Z

If you've uploaded your model files somewhere (e.g. S3), then it's just one line of code:

from keras.datasets.data_utils import get_file
local_path = get_file('local_name.ext', origin="https://s3.amazonaws.com/some_path.ext")

pranv · 2015-07-10T08:53:02Z

The model files will be available on the researcher's page, who trained the model or in Caffe model Zoo.
Will add it ASAP.

pranv · 2015-07-10T16:50:56Z

Has anyone tried it out on a few models yet?
Any results?

fchollet · 2015-07-15T06:28:36Z

keras/caffe/converters.py

+					input_layer_names.append(layers[input_layer].name)
+
+		if layer_nb in ends:
+			name = 'output_' + name 	# outputs nodes are marked with 'output_' prefix from which output is derived later in 'add_output'


To avoid very long lines, I would recommend putting comments before the line (possibly over several lines).

fchollet · 2015-07-15T07:58:47Z

The protobuf issue is fixed in a clean way by adding the following to setup.py:

import os
from six.moves.urllib.request import urlretrieve

# First, compile Caffe protobuf Python file
datadir = os.path.expanduser(os.path.join('~', '.keras', 'data'))
if not os.path.exists(datadir):
    os.makedirs(datadir)

caffe_source = os.path.join(datadir, 'caffe.proto')
caffe_destination = os.path.join(os.path.dirname(os.path.realpath(__file__)), 'keras', 'caffe')
urlretrieve('https://raw.githubusercontent.com/BVLC/caffe/master/src/caffe/proto/caffe.proto', caffe_source)
os.system('protoc --proto_path="' + datadir + '" --python_out="' + caffe_destination + '" "' + caffe_source + '"')

This should be Windows compatible as well.

Only potential issue is that this requires Protobuf to be installed before running setup.py. Or else Caffe import won't work (Keras can still be installed though).

fchollet · 2015-07-15T08:00:36Z

We can now remove the pre-compiled protobuf file in keras/caffe as well. Since we can generate it at install time. Please update the PR.

pranv · 2015-07-15T08:14:01Z

Thanks for the feedback!

This idea will help remove the caffe_pb2.py being in repo unncecessarily as discussed in chainer/chainer#166

I will update the based on your suggestions ASAP.

pranv · 2015-07-15T08:17:07Z

Only potential issue is that this requires Protobuf to be installed before running setup.py. Or else Caffe import won't work (Keras can still be installed though).

Can we make google protocol buffer a optional dependancy? Like h5py was before?

pranv · 2015-07-15T08:19:10Z

move test_caffe_conversion.py to tests/auto. You can put the auxiliary file on S3 and fetch it with get_file (if you want I can put it on S3 for you).

I don't have any S3 storage, please do it

pranv · 2015-07-15T09:37:32Z

@fchollet have you tried out a few models?

Any results, feedback, bugs in that regard?

fchollet · 2015-07-15T13:00:13Z

Can we make google protocol buffer a optional dependancy? Like h5py was before?

Using the code above, it is already de facto an optional dependency, because you can install and use Keras without it. You just won't be able to load Caffe models.

I don't have any S3 storage, please do it

Sure. In that case just remove every model file, I'll set up S3 storage & fetching.

Any results, feedback, bugs in that regard?

Not yet.

asampat3090 · 2015-07-16T09:22:03Z

I believe in the current form, the CaffeToKeras class creates the network graph based on the caffemodel file instead of the prototxt. That is, if the user wants to plug in a subset of weights from the caffemodel into a new model (as defined in the prototxt) they currently cannot. Since I assume many will try to do transfer learning using weights from models in the Model Zoo plugged into new models, this could be a big issue.

pranv · 2015-07-16T09:34:55Z

I think I know the problem. When a caffemodel is provided, my code will completely construct keras model from it, disregarding the prototext. Hence changing the prototext will not change your model. This is a bad idea. My initial idea was to create a model from prototext and then copy weights. I reverted it to be what it is now, since I hadn't written the convert_weights function at the time. This way I could test the conversion simply (the older PR before did everything from prototext and then copied weights).

I think it will be fixed when I complete it, along with the other changes mentioned here by tomorrow.

Thanks for pointing it out!

asampat3090 · 2015-07-22T04:50:28Z

@pranv Just wanted to see if you were able to make any progress. If you are swamped with work and already know what needs to change let me know - I can try to work on some changes as well.

fchollet · 2015-07-23T09:12:17Z

What's the status on this PR? We'd like to merge it asap. If you don't have time for it, do you want me to take over?

pranv · 2015-07-23T09:22:01Z

Hey,
Sorry for the delay. I'll have it ready by this time tomorrow.

mynameisfiber · 2015-07-23T15:54:52Z

keras/caffe/converters.py

+			layer_output_dim = layer_input_dim
+
+		else:
+			raise RuntimeError("one or many layers used int this model is not currently supported")


Typo fix and clarification:

raise RuntimeError("One or more layers used in this model are not currently supported")

pranv · 2015-07-24T09:39:12Z

@fchollet I think I've done the changes.
The file fetching has to be set up

fchollet · 2015-07-24T12:08:39Z

Cool, thank you. I'll take it from here.

llcao · 2015-08-17T18:35:39Z

I could not find caffe converter in official keras repository. Where shall I look?

fchollet · 2015-08-17T18:39:12Z

In the caffe branch: https://github.com/fchollet/keras/tree/caffe

It is still being tested and debugged.

On 18 August 2015 at 03:35, llcao notifications@github.com wrote:

I could not find caffe converter in official keras repository. Where shall
I look?

—
Reply to this email directly or view it on GitHub
#368 (comment).

asampat3090 · 2015-08-18T00:05:45Z

I've attached my testing code if someone would like to try it out as well. I've loaded an example image the same way and just used caffe. Please see code below. You can try any image ('exampleimg.jpg') and I have just used the 16 layer caffemodel file. My 16 layer prototxt file is also shown below the code. The caffe output claims its from the fc7 layer but given I'm getting a lot of zeros, I'm pretty sure the ReLU is being applied. Either way, the result from the two aren't matching up. Please let me know if I made any egregious errors below.

import sys
import numpy as np
from scipy.misc import imread, imresize
import pdb

import caffe
from keras.caffe import convert

# model files used
cnn_model_def = 'cnn_params/VGG_ILSVRC_16_layers_deploy_features.prototxt'
cnn_model_params = 'cnn_params/VGG_ILSVRC_16_layers.caffemodel'

C = 3
H = 224
W = 224

def format_img_for_input(image, H, W):
    """
    Helper function to convert image read from imread to caffe input

    Input:
    image - numpy array describing the image
    H - height in px
    W - width in px
    """
    if len(image.shape) == 2:
        image = np.tile(image[:, :, np.newaxis], (1, 1, 3))
    # RGB -> BGR
    image = image[:, :, (2, 1, 0)]
    # mean subtraction (get mean from model file?..hardcoded for now)
    image = image - np.array([103.939, 116.779, 123.68])
    # resize
    image = imresize(image, (H, W))
    # get channel in correct dimension
    image = np.transpose(image, (2, 0, 1))
    return image


# setup caffe cnn
print "Setting up caffe CNN..."
net = caffe.Net(cnn_model_def, cnn_model_params)
net.set_mode_gpu()

net.set_phase_test()
caffe_batch = np.zeros((10, C, H, W))

# setup keras
print "Setting up keras CNN..."
model = convert.caffe_to_keras(
    prototext=cnn_model_def,
    caffemodel=cnn_model_params,
    phase='test')
graph = model
keras_batch = np.zeros((1, C, H, W))

# Load image and format for input
print "Loading example image..."
im = imread('exampleimg.jpg')
formatted_im = format_img_for_input(im, H, W)

keras_batch[0, :, :, :] = formatted_im
for i in range(10):
    caffe_batch[i] = formatted_im

# extract features using caffe
print "Extracting features from caffe Net..."
out = net.forward(**{net.inputs[0]: caffe_batch})
caffe_features = out[net.outputs[0]].squeeze(axis=(2, 3))
caffe_features = caffe_features[0]

# extract features using keras
print "Extracting features from keras Graph..."
graph.compile('rmsprop', {graph.outputs.keys()[0]: 'mse'})
keras_features = graph.predict({'conv1_1':keras_batch}, batch_size=1, verbose=1)

# compare values - print True if equal
print "Compare values..."
print np.sum(caffe_features==keras_features) == 4096
pdb.set_trace()

Any my prototxt here:

name: "VGG_ILSVRC_16_layers"
input: "data"
input_dim: 10
input_dim: 3
input_dim: 224
input_dim: 224
layers {
  bottom: "data"
  top: "conv1_1"
  name: "conv1_1"
  type: CONVOLUTION
  convolution_param {
    num_output: 64
    pad: 1
    kernel_size: 3
  }
}
layers {
  bottom: "conv1_1"
  top: "conv1_1"
  name: "relu1_1"
  type: RELU
}
layers {
  bottom: "conv1_1"
  top: "conv1_2"
  name: "conv1_2"
  type: CONVOLUTION
  convolution_param {
    num_output: 64
    pad: 1
    kernel_size: 3
  }
}
layers {
  bottom: "conv1_2"
  top: "conv1_2"
  name: "relu1_2"
  type: RELU
}
layers {
  bottom: "conv1_2"
  top: "pool1"
  name: "pool1"
  type: POOLING
  pooling_param {
    pool: MAX
    kernel_size: 2
    stride: 2
  }
}
layers {
  bottom: "pool1"
  top: "conv2_1"
  name: "conv2_1"
  type: CONVOLUTION
  convolution_param {
    num_output: 128
    pad: 1
    kernel_size: 3
  }
}
layers {
  bottom: "conv2_1"
  top: "conv2_1"
  name: "relu2_1"
  type: RELU
}
layers {
  bottom: "conv2_1"
  top: "conv2_2"
  name: "conv2_2"
  type: CONVOLUTION
  convolution_param {
    num_output: 128
    pad: 1
    kernel_size: 3
  }
}
layers {
  bottom: "conv2_2"
  top: "conv2_2"
  name: "relu2_2"
  type: RELU
}
layers {
  bottom: "conv2_2"
  top: "pool2"
  name: "pool2"
  type: POOLING
  pooling_param {
    pool: MAX
    kernel_size: 2
    stride: 2
  }
}
layers {
  bottom: "pool2"
  top: "conv3_1"
  name: "conv3_1"
  type: CONVOLUTION
  convolution_param {
    num_output: 256
    pad: 1
    kernel_size: 3
  }
}
layers {
  bottom: "conv3_1"
  top: "conv3_1"
  name: "relu3_1"
  type: RELU
}
layers {
  bottom: "conv3_1"
  top: "conv3_2"
  name: "conv3_2"
  type: CONVOLUTION
  convolution_param {
    num_output: 256
    pad: 1
    kernel_size: 3
  }
}
layers {
  bottom: "conv3_2"
  top: "conv3_2"
  name: "relu3_2"
  type: RELU
}
layers {
  bottom: "conv3_2"
  top: "conv3_3"
  name: "conv3_3"
  type: CONVOLUTION
  convolution_param {
    num_output: 256
    pad: 1
    kernel_size: 3
  }
}
layers {
  bottom: "conv3_3"
  top: "conv3_3"
  name: "relu3_3"
  type: RELU
}
layers {
  bottom: "conv3_3"
  top: "pool3"
  name: "pool3"
  type: POOLING
  pooling_param {
    pool: MAX
    kernel_size: 2
    stride: 2
  }
}
layers {
  bottom: "pool3"
  top: "conv4_1"
  name: "conv4_1"
  type: CONVOLUTION
  convolution_param {
    num_output: 512
    pad: 1
    kernel_size: 3
  }
}
layers {
  bottom: "conv4_1"
  top: "conv4_1"
  name: "relu4_1"
  type: RELU
}
layers {
  bottom: "conv4_1"
  top: "conv4_2"
  name: "conv4_2"
  type: CONVOLUTION
  convolution_param {
    num_output: 512
    pad: 1
    kernel_size: 3
  }
}
layers {
  bottom: "conv4_2"
  top: "conv4_2"
  name: "relu4_2"
  type: RELU
}
layers {
  bottom: "conv4_2"
  top: "conv4_3"
  name: "conv4_3"
  type: CONVOLUTION
  convolution_param {
    num_output: 512
    pad: 1
    kernel_size: 3
  }
}
layers {
  bottom: "conv4_3"
  top: "conv4_3"
  name: "relu4_3"
  type: RELU
}
layers {
  bottom: "conv4_3"
  top: "pool4"
  name: "pool4"
  type: POOLING
  pooling_param {
    pool: MAX
    kernel_size: 2
    stride: 2
  }
}
layers {
  bottom: "pool4"
  top: "conv5_1"
  name: "conv5_1"
  type: CONVOLUTION
  convolution_param {
    num_output: 512
    pad: 1
    kernel_size: 3
  }
}
layers {
  bottom: "conv5_1"
  top: "conv5_1"
  name: "relu5_1"
  type: RELU
}
layers {
  bottom: "conv5_1"
  top: "conv5_2"
  name: "conv5_2"
  type: CONVOLUTION
  convolution_param {
    num_output: 512
    pad: 1
    kernel_size: 3
  }
}
layers {
  bottom: "conv5_2"
  top: "conv5_2"
  name: "relu5_2"
  type: RELU
}
layers {
  bottom: "conv5_2"
  top: "conv5_3"
  name: "conv5_3"
  type: CONVOLUTION
  convolution_param {
    num_output: 512
    pad: 1
    kernel_size: 3
  }
}
layers {
  bottom: "conv5_3"
  top: "conv5_3"
  name: "relu5_3"
  type: RELU
}
layers {
  bottom: "conv5_3"
  top: "pool5"
  name: "pool5"
  type: POOLING
  pooling_param {
    pool: MAX
    kernel_size: 2
    stride: 2
  }
}
layers {
  bottom: "pool5"
  top: "fc6"
  name: "fc6"
  type: INNER_PRODUCT
  inner_product_param {
    num_output: 4096
  }
}
layers {
  bottom: "fc6"
  top: "fc6"
  name: "relu6"
  type: RELU
}
layers {
  bottom: "fc6"
  top: "fc6"
  name: "drop6"
  type: DROPOUT
  dropout_param {
    dropout_ratio: 0.5
  }
}
layers {
  bottom: "fc6"
  top: "fc7"
  name: "fc7"
  type: INNER_PRODUCT
  inner_product_param {
    num_output: 4096
  }
}
layers {
  bottom: "fc7"
  top: "fc7"
  name: "relu7"
  type: RELU
}

fchollet · 2015-08-18T00:43:15Z

Please post any further comments in the current PR thread: #442

@asampat3090 : is the caffemodel hosted somewhere? I'd like to take a look.

One initial reason why you would see different result (independently of any potential bug in the PR) is that the networks are in different phases; the Keras net is in test mode and the Caffe net is in train mode (which is why Dropout is being applied). This changes intermediate representations substantially, but should not affect significantly the last layer probabilities (assuming the network has been trained until convergence).

@divyashreepathihalli

… by : @divyashreepathihalli (#368) * add mlp classifier example * remove tfaddons dependency, remove GELU and AdamW and replace with keras core optimizer isnstead * review updates applied

@divyashreepathihalli

… by : @divyashreepathihalli (keras-team#368) * add mlp classifier example * remove tfaddons dependency, remove GELU and AdamW and replace with keras core optimizer isnstead * review updates applied

pranv added 25 commits July 5, 2015 20:37

Added caffe interfacing skeletal structure and all necessary utilities

42f9c0a

Indépendant padding on each axis in ZeroPadding2D

6c64227

Added the parsing code structure of a model and dimensionality inference

444f6cf

added util for getting input dims of data layers

2bcedac

Added simple docstring to explain usage

f645cd8

Improved Naming of layers, updated dimensionality calculation(same as…

27bf5a0

… Caffe)

Merge remote-tracking branch 'upstream/master' into caffe

53e7cbd

Merge can concat on user defined axis

2d47b79

Fix concat_axis issue

19d4aa8

added (broken) concatenate operation

0c75d67

Added Clearer explanation of making a DAG

4a118d9

Updated convert to recent changes

4e34414

Fixed Axis issue and added other concat options

95b4b4b

Updated protocol buffers

b335f87

re-architected the interface and structuring

9cb40c7

added param extraction and model conversion based on it

d918af7

better management of deploy and non deploy models

38f2fb1

model_from_config complete and working

728b31e

refactored data layers added error handling added more return values(inputs and outputs)

Added working param conversion code

a24954d

fix typo, add explanations in comments

7f3b5c3

re organized precedence - caffemodel over prototext, input, output de…

eebf754

…tails

added docstring

bde978f

added softmax_loss, better error messages

ac51167

added test scripts

7d3b90f

added support for grouped convolutions

69ad56b

fchollet reviewed Jul 15, 2015
View reviewed changes

mynameisfiber reviewed Jul 23, 2015
View reviewed changes

pranv added 2 commits July 24, 2015 15:02

Fixed all issue and the model is loaded according to prototxt

85276b6

removed caffe_pb2.py dependancy

dd3615e

pranv added 2 commits July 24, 2015 15:09

moved test scripts and removed auxiliary file

e9235f7

added concat_axis option to output nodes as well

71989d7

fchollet closed this Jul 25, 2015

asampat3090 mentioned this pull request Aug 18, 2015

Caffe models support #442

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Caffe support by pranv #368

Caffe support by pranv #368

fchollet commented Jul 8, 2015

pranv commented Jul 10, 2015

phreeza commented Jul 10, 2015

fchollet commented Jul 10, 2015

pranv commented Jul 10, 2015

pranv commented Jul 10, 2015

fchollet Jul 15, 2015

fchollet commented Jul 15, 2015

fchollet commented Jul 15, 2015

pranv commented Jul 15, 2015

pranv commented Jul 15, 2015

pranv commented Jul 15, 2015

pranv commented Jul 15, 2015

fchollet commented Jul 15, 2015

asampat3090 commented Jul 16, 2015

pranv commented Jul 16, 2015

asampat3090 commented Jul 22, 2015

fchollet commented Jul 23, 2015

pranv commented Jul 23, 2015

mynameisfiber Jul 23, 2015

pranv commented Jul 24, 2015

fchollet commented Jul 24, 2015

llcao commented Aug 17, 2015

fchollet commented Aug 17, 2015

asampat3090 commented Aug 18, 2015

fchollet commented Aug 18, 2015

Caffe support by pranv #368

Caffe support by pranv #368

Conversation

fchollet commented Jul 8, 2015

pranv commented Jul 10, 2015

phreeza commented Jul 10, 2015

fchollet commented Jul 10, 2015

pranv commented Jul 10, 2015

pranv commented Jul 10, 2015

fchollet Jul 15, 2015

Choose a reason for hiding this comment

fchollet commented Jul 15, 2015

fchollet commented Jul 15, 2015

pranv commented Jul 15, 2015

pranv commented Jul 15, 2015

pranv commented Jul 15, 2015

pranv commented Jul 15, 2015

fchollet commented Jul 15, 2015

asampat3090 commented Jul 16, 2015

pranv commented Jul 16, 2015

asampat3090 commented Jul 22, 2015

fchollet commented Jul 23, 2015

pranv commented Jul 23, 2015

mynameisfiber Jul 23, 2015

Choose a reason for hiding this comment

pranv commented Jul 24, 2015

fchollet commented Jul 24, 2015

llcao commented Aug 17, 2015

fchollet commented Aug 17, 2015

asampat3090 commented Aug 18, 2015

fchollet commented Aug 18, 2015