Try to extract Convolution code from cuda-convnet2 #830

sguada · 2014-07-30T16:49:08Z

According to some benchmarks Caffe was the fastest Convolution until the recent release of cuda-convnet2
https://github.com/soumith/convnet-benchmarks

So maybe someone would like to look into the code and try to extract some code for doing convolutions in Caffe.
https://code.google.com/p/cuda-convnet2/

bhack · 2014-07-30T23:22:51Z

Could be interesting to see this benchmark with #544 but it is still CPU only.

kloudkl · 2014-08-06T04:43:56Z

cuda-convnet2 has three major new features relative to cuda-convnet:

Improved training times on Kepler-generation Nvidia GPUs (Geforce Titan, K20, K40).
Multi-GPU training support implementing data parallelism, model parallelism, and the hybrid approach described in One weird trick for parallelizing convolutional neural networks [1].
Less-polished code and incomplete (but improving) documentation.

[1] Alex Krizhevsky. One weird trick for parallelizing convolutional neural networks. arXiv:1404.5997 [cs.NE]

kloudkl · 2014-08-06T04:52:20Z

It was said that "Caffe is fastest forward+backward". What is the "banded approach for im2col" in the comment?

kloudkl · 2014-08-06T05:03:22Z

There have been a lot of interests in running Caffe on multiple GPUs (#194, #301, #423, #519, #547, #630, #653). Alex is keeping ahead of Caffe by implementing the data prallelism, model parallelism, and the hybrid approach. Can something be done to catch up?

rodrigob · 2014-08-06T11:16:08Z

The memory consumption aspect #852 should also be considered.

shelhamer · 2014-08-07T07:07:00Z

For parallelism, please discuss at #876. It's certainly planned, but given the freedom in this direction don't hesitate to attempt parallelism after your own fashion for comparison.

shelhamer · 2014-09-19T18:48:30Z

Closing -- cuDNN and the Caffe layer integration supersedes the custom cuda-convnet2 kernels.

shelhamer mentioned this issue Aug 7, 2014

Convolutions with cudaconv2 to reduce memory consumption #852

Closed

shelhamer closed this as completed Sep 19, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Try to extract Convolution code from cuda-convnet2 #830

Try to extract Convolution code from cuda-convnet2 #830

sguada commented Jul 30, 2014

bhack commented Jul 30, 2014

kloudkl commented Aug 6, 2014

kloudkl commented Aug 6, 2014

kloudkl commented Aug 6, 2014

rodrigob commented Aug 6, 2014

shelhamer commented Aug 7, 2014

shelhamer commented Sep 19, 2014

Try to extract Convolution code from cuda-convnet2 #830

Try to extract Convolution code from cuda-convnet2 #830

Comments

sguada commented Jul 30, 2014

bhack commented Jul 30, 2014

kloudkl commented Aug 6, 2014

kloudkl commented Aug 6, 2014

kloudkl commented Aug 6, 2014

rodrigob commented Aug 6, 2014

shelhamer commented Aug 7, 2014

shelhamer commented Sep 19, 2014