A basic dockerfile for messing around with pyspark standalone.
- Spark version:
2.1.0
- Python version:
3.5
docker-compose run spark bin/spark-shell
docker-compose run -e PYSPARK_PYTHON=ipython spark bin/pyspark
docker-compose run spark bin/spark-submit scripts/word_count.py data/words.txt