Convolutions with cudaconv2 to reduce memory consumption #852

hosang · 2014-08-04T17:29:44Z

I want to run a caffe with small stride on big images and ran into memory issues. I tried out PR #520, but even on quite small images (480x640) and an okay sized model (params and blobs take about 2GB on that image) the Consumption on CPU is at ~12GB. I assume the difference is in colbuffers since that's where I get the out of memory error. My understanding of FFT based convolutions is, that it also won't solve my memory problems.

What do you think about adding a convolution implementation that doesn't use additional memory? cuda-convnet [1] seems to be quite fast judging from the benchmark at [2]. The convolution code doesn't exactly look simple, so it doesn't look like a no-brainer to me to add it into caffe. Does it make any sense?

[1] https://code.google.com/p/cuda-convnet/
[2] https://github.com/soumith/convnet-benchmarks

Yangqing · 2014-08-05T20:26:16Z

It is a little non-trivial, and cuda-convnet actually uses a different order. We are exploring alternate approaches which may achieve the same (or better) goal, so incorporating cuda-convnet may not be on our radar (at least for now).

shelhamer · 2014-08-07T01:57:16Z

Closing as duplicate of #830 to focus the conversation now that the memory aspect has been noted there. While we expect our alternative approach to address memory usage and speed, you are welcome to try integrating cuda-convnet2 convolution for comparison.

hosang · 2014-08-18T15:12:18Z

We are exploring alternate approaches which may achieve the same (or better) goal

@Yangqing: What approaches do you mean? I didn't find anything like that in the bugtracker/mailing list.

hosang changed the title ~~Convolutions with cudaconv2 to reduce memory consumtion~~ Convolutions with cudaconv2 to reduce memory consumption Aug 4, 2014

rodrigob mentioned this issue Aug 6, 2014

Try to extract Convolution code from cuda-convnet2 #830

Closed

shelhamer added the duplicate label Aug 7, 2014

shelhamer closed this as completed Aug 7, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convolutions with cudaconv2 to reduce memory consumption #852

Convolutions with cudaconv2 to reduce memory consumption #852

hosang commented Aug 4, 2014

Yangqing commented Aug 5, 2014

shelhamer commented Aug 7, 2014

hosang commented Aug 18, 2014

Convolutions with cudaconv2 to reduce memory consumption #852

Convolutions with cudaconv2 to reduce memory consumption #852

Comments

hosang commented Aug 4, 2014

Yangqing commented Aug 5, 2014

shelhamer commented Aug 7, 2014

hosang commented Aug 18, 2014