[SPARK-18793] [SPARK-18794] [R] add spark.randomForest/spark.gbt to vignettes #16264

mengxr · 2016-12-13T06:59:27Z

What changes were proposed in this pull request?

Mention spark.randomForest and spark.gbt in vignettes. Keep the content minimal since users can type ?spark.randomForest to see the full doc.

cc: @jkbradley

felixcheung · 2016-12-13T07:28:53Z

LGTM. maybe good to have one example for classification but optional

felixcheung · 2016-12-13T07:35:52Z

and if we are doing "(Added in Spark 2.1.0)" in https://github.com/apache/spark/pull/16222/files perhaps we should have the same for these 2?

actually, I'd vote for removing them - vignettes is for that specific version of package you install. If you install SparkR 2.1.0 what is described there is in 2.1.0, by default

felixcheung · 2016-12-13T07:37:45Z

R/pkg/vignettes/sparkr-vignettes.Rmd

 ```{r}
 df <- createDataFrame(longley)
-rfModel <- spark.randomForest(df, Employed ~ ., type = "regression", numTrees = 5)
+rfModel <- spark.randomForest(df, Employed ~ ., type = "regression", maxDepth = 2, numTrees = 2)


you could also do this to limit output

ops <- options() options(max.print=40)

there is another example in the vignettes

The current length of the output seems fine (<40 lines).

felixcheung · 2016-12-13T07:42:09Z

R/pkg/vignettes/sparkr-vignettes.Rmd

+
+In the following example, we use the `longley` dataset to train a random forest and make predictions:
+
+```{r}


longley has columns with . like Armed.Forces
suggest putting warning = FALSE like here otherwise we will have a warning in the output vignettes

SparkQA · 2016-12-13T07:42:57Z

Test build #70068 has finished for PR 16264 at commit 292745b.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-12-13T07:56:25Z

Test build #70070 has finished for PR 16264 at commit be9c846.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-12-13T07:57:57Z

Test build #70069 has finished for PR 16264 at commit fb1933d.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

mengxr · 2016-12-13T20:20:31Z

Agree that it is not necessary to mention the version added here. I will send a follow-up PR after this one to rearrange the ordering of the ML algorithms. There is no logical ordering now.

SparkQA · 2016-12-13T21:01:42Z

Test build #70095 has finished for PR 16264 at commit b3bf19f.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

mengxr · 2016-12-14T01:00:20Z

Merged into master and branch-2.1. The last commit passed Jenkins.

…nettes ## What changes were proposed in this pull request? Mention `spark.randomForest` and `spark.gbt` in vignettes. Keep the content minimal since users can type `?spark.randomForest` to see the full doc. cc: jkbradley Author: Xiangrui Meng <meng@databricks.com> Closes #16264 from mengxr/SPARK-18793. (cherry picked from commit 594b14f) Signed-off-by: Xiangrui Meng <meng@databricks.com>

mengxr · 2016-12-14T03:02:42Z

@HyukjinKwon Could you take a look at the AppVeyor error?

felixcheung · 2016-12-14T03:08:31Z

That looks like a network error when accessing github

HyukjinKwon · 2016-12-14T03:49:42Z

Yup, it seems failed time to time due to network problem. FYI, there was some discussions about this in #15686 (comment) and #15697 (comment)

I could not find a good workaround to re-trigger this so far rather than closing and reopening and it seems (I manually checked and privately asked if it is true) some Apahce projects using Travis CI/AppVeyor are also re-triggering this via closing and opening.

…nettes ## What changes were proposed in this pull request? Mention `spark.randomForest` and `spark.gbt` in vignettes. Keep the content minimal since users can type `?spark.randomForest` to see the full doc. cc: jkbradley Author: Xiangrui Meng <meng@databricks.com> Closes apache#16264 from mengxr/SPARK-18793.

mention spark.randomForest in vignettes

292745b

mengxr changed the title ~~SPARK-18792] [R] add spark.randomForest to vignettes~~ SPARK-18793] [R] add spark.randomForest to vignettes Dec 13, 2016

mengxr changed the title ~~SPARK-18793] [R] add spark.randomForest to vignettes~~ [SPARK-18793] [R] add spark.randomForest to vignettes Dec 13, 2016

add spark.gbt as well

fb1933d

mengxr changed the title ~~[SPARK-18793] [R] add spark.randomForest to vignettes~~ [SPARK-18793] [SPARK-18794] [R] add spark.randomForest/spark.gbt to vignettes Dec 13, 2016

update params so the output is shorter

be9c846

felixcheung reviewed Dec 13, 2016

View reviewed changes

address comments

b3bf19f

asfgit closed this in 594b14f Dec 14, 2016


		In the following example, we use the `longley` dataset to train a random forest and make predictions:

		```{r}

[SPARK-18793] [SPARK-18794] [R] add spark.randomForest/spark.gbt to vignettes #16264

[SPARK-18793] [SPARK-18794] [R] add spark.randomForest/spark.gbt to vignettes #16264

Uh oh!

Conversation

mengxr commented Dec 13, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Uh oh!

felixcheung commented Dec 13, 2016

Uh oh!

felixcheung commented Dec 13, 2016

Uh oh!

felixcheung Dec 13, 2016

Choose a reason for hiding this comment

Uh oh!

mengxr Dec 13, 2016

Choose a reason for hiding this comment

Uh oh!

felixcheung Dec 13, 2016

Choose a reason for hiding this comment

Uh oh!

mengxr Dec 13, 2016

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Dec 13, 2016

Uh oh!

SparkQA commented Dec 13, 2016

Uh oh!

SparkQA commented Dec 13, 2016

Uh oh!

mengxr commented Dec 13, 2016

Uh oh!

SparkQA commented Dec 13, 2016

Uh oh!

mengxr commented Dec 14, 2016

Uh oh!

mengxr commented Dec 14, 2016

Uh oh!

felixcheung commented Dec 14, 2016 via email

Uh oh!

HyukjinKwon commented Dec 14, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mengxr commented Dec 13, 2016 •

edited

Loading

HyukjinKwon commented Dec 14, 2016 •

edited

Loading