-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-16403][Examples] Cleanup to remove unused imports, consistent style, minor fixes #14081
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-16403][Examples] Cleanup to remove unused imports, consistent style, minor fixes #14081
Conversation
|
cc @MLnick @yanboliang - either of you mind taking a look? Thanks! |
|
Test build #61880 has finished for PR 14081 at commit
|
|
Test build #61883 has finished for PR 14081 at commit
|
|
Test build #61885 has finished for PR 14081 at commit
|
| spark = SparkSession\ | ||
| .builder\ | ||
| .appName("PythonGuassianMixtureExample")\ | ||
| .appName("GuassianMixtureExample")\ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Typo still here: Guassian
|
Test build #61918 has finished for PR 14081 at commit
|
|
Test build #61922 has finished for PR 14081 at commit
|
|
I'm going to merge this to master only if nobody has further comments. |
|
Thanks for the review Sean!
|
|
I realized that the "PipelineExample"s are included in the docs, while the "SimpleTextClassificationExample"s are not, so it might be better to keep those instead. I just changed the data and regularization value to that of "SimpleTextClassificationExample" which gives correct predictions (it looks like these examples were updated at one time by DB to fix this, but the change was not put into the doc example). |
|
Test build #62072 has finished for PR 14081 at commit
|
|
I noticed that in a similar vein, SimpleParamsExample was a duplicate of EstimatorTransformerParamExample in Scala, Java, and Python, and removed those. |
|
Test build #62118 has finished for PR 14081 at commit
|
|
Double checking with @jkbradley that the example removals look OK? |
|
Merged to master |
|
Thanks @srowen!
|
What changes were proposed in this pull request?
Cleanup of examples, mostly from PySpark-ML to fix minor issues: unused imports, style consistency, pipeline_example is a duplicate, use future print funciton, and a spelling error.
How was this patch tested?
local tests and run modified examples