Improve data prefeteching efficiency #1535

futurely · 2014-12-05T11:01:24Z

Current implementation prefetches only a single batch after the last batch was used for computations. The initial motivation for this design is probably to avoid thread synchronization. There may not be maximal overlap between computation and data IO. This problem becomes a serious bottleneck when multiple devices simultaneously train the data (#1148).

The IO efficiency can be increased by continuously prefetching multiple batches without waiting for the computation thread and storing them in a thread safe buffer with limited capacity.

bhack · 2014-12-05T11:11:05Z

cc: @mtamburrano @sguada

sguada · 2014-12-09T22:14:51Z

@futurely it will be good to separate prefetching and data transformation, using a thread-pool and a shared buffer, so if you do a PR along this it will be good.

futurely · 2014-12-18T06:28:16Z

Concurrent buffer introduces another dependency Intel TBB. Is it acceptable?

sguada · 2014-12-29T20:16:20Z

@futurely take a look at https://gist.github.com/sguada/1e1d474a25f4ddcc7ba8 for an draft for a concurrent buffer

futurely · 2014-12-30T02:44:07Z

Some time ago, I tried to invent a similar one in a project but finally found that it is hard to get concurrent data structure correct and efficient at the same time. Thorough unit tests of the blocking queue need to be added to get confidence before usage in production. TBB concurrent containers are generally block-free. Multiple threads can access them simultaneously. In this use case, blocking may not be a performance bottleneck.

shelhamer · 2015-06-30T06:39:15Z

Solved by #2366 #2367 #2368 #2383 #2386 soon to be merged as part of #2114. The persistent prefetch thread and data reader avoid overhead and can fetch multiple batches.

bhack mentioned this issue Dec 29, 2014

Datum db #1568

Closed

longjon added the enhancement label May 9, 2015

shelhamer closed this as completed Jun 30, 2015

futurely mentioned this issue Jul 1, 2015

Multi-GPU operation and data / model Parallelism #876

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve data prefeteching efficiency #1535

Improve data prefeteching efficiency #1535

futurely commented Dec 5, 2014

bhack commented Dec 5, 2014

sguada commented Dec 9, 2014

futurely commented Dec 18, 2014

sguada commented Dec 29, 2014

futurely commented Dec 30, 2014

shelhamer commented Jun 30, 2015

Improve data prefeteching efficiency #1535

Improve data prefeteching efficiency #1535

Comments

futurely commented Dec 5, 2014

bhack commented Dec 5, 2014

sguada commented Dec 9, 2014

futurely commented Dec 18, 2014

sguada commented Dec 29, 2014

futurely commented Dec 30, 2014

shelhamer commented Jun 30, 2015