Minimum viable example repo for running Pants with PySpark libraries.
Motivation was to provide a proof of concept for running PySpark unit tests in Pants, particularly when using various UDFs.
Also includes a Pants plugin that uses git describe
to automatically generate package versions in the python_distribution
target.
To run tests with PySpark, JAVA_HOME
must be set before running any Pants commands.
MacOS example with openjdk 11 installed via homebrew:
export JAVA_HOME=/opt/homebrew/opt/openjdk@11/libexec/openjdk.jdk/Contents/Home/
pants lint ::
pants check ::
pants test ::
pants package ::