Skip to content

Minimum viable example repo for running Pants with PySpark libraries

License

Notifications You must be signed in to change notification settings

jzmnd/pants-pyspark-mve

Repository files navigation

pants-pyspark-mve

Background

Minimum viable example repo for running Pants with PySpark libraries. Motivation was to provide a proof of concept for running PySpark unit tests in Pants, particularly when using various UDFs. Also includes a Pants plugin that uses git describe to automatically generate package versions in the python_distribution target.

How to

To run tests with PySpark, JAVA_HOME must be set before running any Pants commands.

MacOS example with openjdk 11 installed via homebrew:

export JAVA_HOME=/opt/homebrew/opt/openjdk@11/libexec/openjdk.jdk/Contents/Home/
pants lint ::
pants check ::
pants test ::
pants package ::

About

Minimum viable example repo for running Pants with PySpark libraries

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published