Ensure consistency between memory alloc and free #3073

ronghanghu · 2015-09-16T19:15:49Z

Add a bool flag to record whether a host memory is allocated using malloc or cudaMallocHost, and free correspondingly using this flag, instead of depending on Caffe::mode(), which is mutable during runtime.

This should fix #3053 .

ronghanghu · 2015-09-23T16:21:09Z

I hope to merge this PR, if no one opposes. This ensures that memories are allocated and freed consistently.

shelhamer · 2015-09-23T18:05:03Z

Let's hold off a bit. Although this addresses the immediate issue the root cause is that Nets do not know their own state / mode and adding more state to SyncedMem isn't necessarily how we want to work around that. However I might be convinced this is the way to go for now since it works.

ronghanghu · 2015-09-23T18:08:54Z

Although this addresses the immediate issue the root cause is that Nets do not know their own state / mode and adding more state to SyncedMem isn't necessarily how we want to work around that.

I agree. In the spirit of #1500 I think it is better to have mode and device in net (or in layers if we plan to have individual mode/device for each layer).

ronghanghu · 2015-09-23T18:20:56Z

However, I think even the net knows their state/mode/device, it is still not enough, because in order to make SyncMemory and Blob self-contained data structures (not depending on the net holding it), SyncMemory class needs to know things like how memory is allocated, on which device etc.

So personally I still think a bool flag in this PR could be sort of necessary for SyncMemory. But I am open to opinions and suggestions on this issue.

longjon · 2015-09-25T22:05:01Z

src/caffe/syncedmem.cpp

@@ -154,4 +154,3 @@ void SyncedMemory::async_gpu_push(const cudaStream_t& stream) {
 #endif

 }  // namespace caffe
-


Can we remove this whitespace change from the commit?

shelhamer · 2015-09-25T22:07:11Z

Alright, this is good to merge for solving the immediate issue and we will further plan #1500 for diffusing mode + device through Net, Layer, and Blob.

Add a bool flag to record whether a host memory is allocated using malloc or cudaMallocHost, and free correspondingly using this flag, instead of depending on Caffe::mode(), which is mutable during runtime.

ronghanghu · 2015-09-26T18:14:17Z

This should be good now :)

Ensure consistency between memory alloc and free

ronghanghu force-pushed the consistent-malloc-free branch 9 times, most recently from 82ae14b to e8e95da Compare September 16, 2015 20:47

ronghanghu added the ready for review label Sep 17, 2015

longjon reviewed Sep 25, 2015
View reviewed changes

Add flag on how host memory is allocated

bd5f154

Add a bool flag to record whether a host memory is allocated using malloc or cudaMallocHost, and free correspondingly using this flag, instead of depending on Caffe::mode(), which is mutable during runtime.

ronghanghu force-pushed the consistent-malloc-free branch from e8e95da to bd5f154 Compare September 25, 2015 22:52

ronghanghu added a commit that referenced this pull request Sep 26, 2015

Merge pull request #3073 from ronghanghu/consistent-malloc-free

ff16f6e

Ensure consistency between memory alloc and free

ronghanghu merged commit ff16f6e into BVLC:master Sep 26, 2015

ronghanghu deleted the consistent-malloc-free branch September 26, 2015 18:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure consistency between memory alloc and free #3073

Ensure consistency between memory alloc and free #3073

ronghanghu commented Sep 16, 2015

ronghanghu commented Sep 23, 2015

shelhamer commented Sep 23, 2015

ronghanghu commented Sep 23, 2015

ronghanghu commented Sep 23, 2015

longjon Sep 25, 2015

ronghanghu Sep 25, 2015

shelhamer commented Sep 25, 2015

ronghanghu commented Sep 26, 2015

		@@ -154,4 +154,3 @@ void SyncedMemory::async_gpu_push(const cudaStream_t& stream) {
		#endif

		} // namespace caffe

Ensure consistency between memory alloc and free #3073

Ensure consistency between memory alloc and free #3073

Conversation

ronghanghu commented Sep 16, 2015

ronghanghu commented Sep 23, 2015

shelhamer commented Sep 23, 2015

ronghanghu commented Sep 23, 2015

ronghanghu commented Sep 23, 2015

longjon Sep 25, 2015

Choose a reason for hiding this comment

ronghanghu Sep 25, 2015

Choose a reason for hiding this comment

shelhamer commented Sep 25, 2015

ronghanghu commented Sep 26, 2015