Data Reader #2386

cypof · 2015-04-29T00:52:17Z

Part of #2351, but switched to a round-robin way of distributing data to solvers, instead of the shared queue that was not deterministic. Combined with random seeds initialization on threads in #2367, it should make parallel training reproducible.

The data reader sits between a database and each solver's prefetch thread. It makes sure each solver processes a different subset of the database. It also prefetches data, but to host memory, and the amount is configurable. Solvers prefetch threads instead only prefetch a fixed small amount of data, since it is stored in GPU memory.

A single reading thread is created per source, even if multiple solvers are running in parallel.
Sources are identified by layer name + source path, in case net has multiple data layers on same DB.
Databases are read sequentially, for better performance.
Each solver sees a different subset of the database.
Data is distributed to solvers in a round-robin way to keep parallel training deterministic.

- Interrupt the thread before waiting on join - Provide a method for looping threads to exit on demand - CHECK if start and stop succeed instead of returning an error

- Makes sure each solver accesses a different subset of the data - Sequential reading of DB for performance - Prefetches a configurable amount of data to host memory - Distributes data to solvers in round-robin way for determinism

shelhamer · 2015-09-26T00:09:45Z

Merged with revisions in #2903, thanks.

cypof force-pushed the data_reader branch from fb2bd58 to 0954181 Compare April 29, 2015 21:00

cypof mentioned this pull request Apr 29, 2015

Multi-GPU #2114

Closed

cypof force-pushed the data_reader branch from 0954181 to 4f4ee49 Compare April 30, 2015 22:42

cypof added 5 commits May 18, 2015 17:24

Added BlockingQueue for inter-thread communication.

6108b25

Thread-local Caffe

b8b5a52

Changed the way threads are started and stopped

59bb3ad

- Interrupt the thread before waiting on join - Provide a method for looping threads to exit on demand - CHECK if start and stop succeed instead of returning an error

Persistent prefetch thread

01cbda5

cypof force-pushed the data_reader branch from 4f4ee49 to 0bd8238 Compare May 19, 2015 01:06

shelhamer mentioned this pull request Jun 30, 2015

Improve data prefeteching efficiency #1535

Closed

shelhamer closed this Sep 26, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data Reader #2386

Data Reader #2386

cypof commented Apr 29, 2015

shelhamer commented Sep 26, 2015

Data Reader #2386

Data Reader #2386

Conversation

cypof commented Apr 29, 2015

shelhamer commented Sep 26, 2015