-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-20102] Fix nightly packaging and RC packaging scripts w/ two minor build fixes #17437
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
| 'pyspark.ml', | ||
| 'pyspark.ml.linalg', | ||
| 'pyspark.ml.param', | ||
| 'pyspark.ml.stat', |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@holdenk @viirya, quick question here: I see that this pyspark.ml.stat line was added in #16465. The problem with this line is that it seems to cause failures during python setup.py sdist packaging because that process searches for a non-existent pyspark/ml/stat file / directory.
Was this ml.stat just a typo?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think it was. There is pyspark.mllib.stat but no pyspark.ml.stat. Seems it is mistakenly missing the non-parity between mllib and ml sub-modules.
|
Test build #75246 has finished for PR 17437 at commit
|
|
I'm going to merge this into master and branch-2.1 then will manually kick off the packaging jobs so that we can get back to publishing nightly package snapshots. |
…inor build fixes ## What changes were proposed in this pull request? The master snapshot publisher builds are currently broken due to two minor build issues: 1. For unknown reasons, the LFTP `mkdir -p` command began throwing errors when the remote directory already exists. This change of behavior might have been caused by configuration changes in the ASF's SFTP server, but I'm not entirely sure of that. To work around this problem, this patch updates the script to ignore errors from the `lftp mkdir -p` commands. 2. The PySpark `setup.py` file references a non-existent `pyspark.ml.stat` module, causing Python packaging to fail by complaining about a missing directory. The fix is to simply drop that line from the setup script. ## How was this patch tested? The LFTP fix was tested by manually running the failing commands on AMPLab Jenkins against the ASF SFTP server. The PySpark fix was tested locally. Author: Josh Rosen <joshrosen@databricks.com> Closes #17437 from JoshRosen/spark-20102. (cherry picked from commit 314cf51) Signed-off-by: Josh Rosen <joshrosen@databricks.com>
What changes were proposed in this pull request?
The master snapshot publisher builds are currently broken due to two minor build issues:
mkdir -pcommand began throwing errors when the remote directory already exists. This change of behavior might have been caused by configuration changes in the ASF's SFTP server, but I'm not entirely sure of that. To work around this problem, this patch updates the script to ignore errors from thelftp mkdir -pcommands.setup.pyfile references a non-existentpyspark.ml.statmodule, causing Python packaging to fail by complaining about a missing directory. The fix is to simply drop that line from the setup script.How was this patch tested?
The LFTP fix was tested by manually running the failing commands on AMPLab Jenkins against the ASF SFTP server. The PySpark fix was tested locally.