Skip to content

drj42/docker-spark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

docker-spark

A basic dockerfile for messing around with pyspark standalone.

  • Spark version: 2.1.0
  • Python version: 3.5

Launch spark shell

docker-compose run spark bin/spark-shell

Launch pyspark shell with ipython

docker-compose run -e PYSPARK_PYTHON=ipython spark bin/pyspark

Run spark job

docker-compose run spark bin/spark-submit scripts/word_count.py data/words.txt

About

Apache Spark Standalone & Docker

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published