Skip to content

Conversation

@JoshRosen
Copy link
Contributor

@JoshRosen JoshRosen commented Mar 26, 2017

What changes were proposed in this pull request?

The master snapshot publisher builds are currently broken due to two minor build issues:

  1. For unknown reasons, the LFTP mkdir -p command began throwing errors when the remote directory already exists. This change of behavior might have been caused by configuration changes in the ASF's SFTP server, but I'm not entirely sure of that. To work around this problem, this patch updates the script to ignore errors from the lftp mkdir -p commands.
  2. The PySpark setup.py file references a non-existent pyspark.ml.stat module, causing Python packaging to fail by complaining about a missing directory. The fix is to simply drop that line from the setup script.

How was this patch tested?

The LFTP fix was tested by manually running the failing commands on AMPLab Jenkins against the ASF SFTP server. The PySpark fix was tested locally.

@JoshRosen
Copy link
Contributor Author

/cc @marmbrus @holdenk, this patch should fix the build issues blocking Spark 2.1.1 RC1 builds.

'pyspark.ml',
'pyspark.ml.linalg',
'pyspark.ml.param',
'pyspark.ml.stat',
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@holdenk @viirya, quick question here: I see that this pyspark.ml.stat line was added in #16465. The problem with this line is that it seems to cause failures during python setup.py sdist packaging because that process searches for a non-existent pyspark/ml/stat file / directory.

Was this ml.stat just a typo?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think it was. There is pyspark.mllib.stat but no pyspark.ml.stat. Seems it is mistakenly missing the non-parity between mllib and ml sub-modules.

@JoshRosen JoshRosen changed the title [SPARK-20102] Fix nightly snapshot and RC packaging scripts w/ two minor build fixes [SPARK-20102] Fix nightly paackaging and RC packaging scripts w/ two minor build fixes Mar 26, 2017
@JoshRosen JoshRosen changed the title [SPARK-20102] Fix nightly paackaging and RC packaging scripts w/ two minor build fixes [SPARK-20102] Fix nightly packaging and RC packaging scripts w/ two minor build fixes Mar 26, 2017
@SparkQA
Copy link

SparkQA commented Mar 27, 2017

Test build #75246 has finished for PR 17437 at commit 867ca2c.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@JoshRosen
Copy link
Contributor Author

I'm going to merge this into master and branch-2.1 then will manually kick off the packaging jobs so that we can get back to publishing nightly package snapshots.

asfgit pushed a commit that referenced this pull request Mar 27, 2017
…inor build fixes

## What changes were proposed in this pull request?

The master snapshot publisher builds are currently broken due to two minor build issues:

1. For unknown reasons, the LFTP `mkdir -p` command began throwing errors when the remote directory already exists. This change of behavior might have been caused by configuration changes in the ASF's SFTP server, but I'm not entirely sure of that. To work around this problem, this patch updates the script to ignore errors from the `lftp mkdir -p` commands.
2. The PySpark `setup.py` file references a non-existent `pyspark.ml.stat` module, causing Python packaging to fail by complaining about a missing directory. The fix is to simply drop that line from the setup script.

## How was this patch tested?

The LFTP fix was tested by manually running the failing commands on AMPLab Jenkins against the ASF SFTP server. The PySpark fix was tested locally.

Author: Josh Rosen <joshrosen@databricks.com>

Closes #17437 from JoshRosen/spark-20102.

(cherry picked from commit 314cf51)
Signed-off-by: Josh Rosen <joshrosen@databricks.com>
@asfgit asfgit closed this in 314cf51 Mar 27, 2017
@JoshRosen JoshRosen deleted the spark-20102 branch March 27, 2017 17:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants