A C++ implementation of RDD (Resilient Distributed Datasets).
- clang or gcc/g++ 4.8+ or Intel Compiler
- CMake 2.8+
- jubatus-msgpack-rpc
- Intel Threading Building Blocks
- google-sparsehash
- MapReduce
$ mkdir build
$ cd build
$ cmake ..
$ make
-
Start worker(s)
$ ./bin/RDDWorker [job_port] -
Start your master application (e.g. WordCount)
$ ./bin/WordCount [path_to_workers.conf] [path_to_text_file] [path_to_Mapper.so] [path_to_Reducer.so]