Skip to content

Latest commit

 

History

History
19 lines (11 loc) · 869 Bytes

README.md

File metadata and controls

19 lines (11 loc) · 869 Bytes

PySpark + development version of Mongo Hadoop.

Usage

You can modify the virsion of SPARK_VERSION to get the newest Spark. If you want to get newest version of Mongo-Hadoop, you have to update the ENV of MONGO_HADOOP_VERSION and MONGO_HADOOP_COMMIT.

Build image

sudo docker build -t zero323/mongo-spark --build-arg IP={YOUR-IP} .

Run image

sudo docker run -t -i --net=host --env SPARK_LOCAL_IP=$DOCKER_HOSTNAME zero323/mongo-spark /bin/bash

For details see: Getting Spark, Python, and MongoDB to work together

Reference