-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-18793] [SPARK-18794] [R] add spark.randomForest/spark.gbt to vignettes #16264
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
LGTM. maybe good to have one example for classification but optional |
|
and if we are doing "(Added in Spark 2.1.0)" in https://github.com/apache/spark/pull/16222/files perhaps we should have the same for these 2? actually, I'd vote for removing them - vignettes is for that specific version of package you install. If you install SparkR 2.1.0 what is described there is in 2.1.0, by default |
| ```{r} | ||
| df <- createDataFrame(longley) | ||
| rfModel <- spark.randomForest(df, Employed ~ ., type = "regression", numTrees = 5) | ||
| rfModel <- spark.randomForest(df, Employed ~ ., type = "regression", maxDepth = 2, numTrees = 2) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
you could also do this to limit output
ops <- options()
options(max.print=40)
there is another example in the vignettes
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The current length of the output seems fine (<40 lines).
R/pkg/vignettes/sparkr-vignettes.Rmd
Outdated
|
|
||
| In the following example, we use the `longley` dataset to train a random forest and make predictions: | ||
|
|
||
| ```{r} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
longley has columns with . like Armed.Forces
suggest putting warning = FALSE like here otherwise we will have a warning in the output vignettes
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
updated
|
Test build #70068 has finished for PR 16264 at commit
|
|
Test build #70070 has finished for PR 16264 at commit
|
|
Test build #70069 has finished for PR 16264 at commit
|
|
Agree that it is not necessary to mention the version added here. I will send a follow-up PR after this one to rearrange the ordering of the ML algorithms. There is no logical ordering now. |
|
Test build #70095 has finished for PR 16264 at commit
|
|
Merged into master and branch-2.1. The last commit passed Jenkins. |
…nettes ## What changes were proposed in this pull request? Mention `spark.randomForest` and `spark.gbt` in vignettes. Keep the content minimal since users can type `?spark.randomForest` to see the full doc. cc: jkbradley Author: Xiangrui Meng <meng@databricks.com> Closes #16264 from mengxr/SPARK-18793. (cherry picked from commit 594b14f) Signed-off-by: Xiangrui Meng <meng@databricks.com>
|
@HyukjinKwon Could you take a look at the AppVeyor error? |
|
That looks like a network error when accessing github
|
|
Yup, it seems failed time to time due to network problem. FYI, there was some discussions about this in #15686 (comment) and #15697 (comment) I could not find a good workaround to re-trigger this so far rather than closing and reopening and it seems (I manually checked and privately asked if it is true) some Apahce projects using Travis CI/AppVeyor are also re-triggering this via closing and opening. |
…nettes ## What changes were proposed in this pull request? Mention `spark.randomForest` and `spark.gbt` in vignettes. Keep the content minimal since users can type `?spark.randomForest` to see the full doc. cc: jkbradley Author: Xiangrui Meng <meng@databricks.com> Closes apache#16264 from mengxr/SPARK-18793.
…nettes ## What changes were proposed in this pull request? Mention `spark.randomForest` and `spark.gbt` in vignettes. Keep the content minimal since users can type `?spark.randomForest` to see the full doc. cc: jkbradley Author: Xiangrui Meng <meng@databricks.com> Closes apache#16264 from mengxr/SPARK-18793.
What changes were proposed in this pull request?
Mention
spark.randomForestandspark.gbtin vignettes. Keep the content minimal since users can type?spark.randomForestto see the full doc.cc: @jkbradley