-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-18110][PYTHON][ML] add missing parameter in Python for RandomForest regression and classification #15638
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
| class RandomForestRegressor(JavaEstimator, HasFeaturesCol, HasLabelCol, HasPredictionCol, HasSeed, | ||
| RandomForestParams, TreeRegressorParams, HasCheckpointInterval, | ||
| JavaMLWritable, JavaMLReadable): | ||
| JavaMLWritable, JavaMLReadable, HasVarianceCol): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Would you like to group all the Has* parameters? Just a minor comment on the style.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
that doesn't seem to be the existing style, so adding to the end instead
| maxDepth=5, maxBins=32, minInstancesPerNode=1, minInfoGain=0.0, | ||
| maxMemoryInMB=256, cacheNodeIds=False, checkpointInterval=10, impurity="gini", | ||
| numTrees=20, featureSubsetStrategy="auto", seed=None): | ||
| numTrees=20, featureSubsetStrategy="auto", seed=None, subsamplingRate=1.0): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Add some doc string tests?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
generally spark avoid having too many verbose tests in python doc string, so I think we are ok here.
|
Test build #67558 has finished for PR 15638 at commit
|
|
merged to master. |
…orest regression and classification ## What changes were proposed in this pull request? Add subsmaplingRate to randomForestClassifier Add varianceCol to randomForestRegressor In Python ## How was this patch tested? manual tests Author: Felix Cheung <felixcheung_m@hotmail.com> Closes apache#15638 from felixcheung/pyrandomforest.
…orest regression and classification ## What changes were proposed in this pull request? Add subsmaplingRate to randomForestClassifier Add varianceCol to randomForestRegressor In Python ## How was this patch tested? manual tests Author: Felix Cheung <felixcheung_m@hotmail.com> Closes apache#15638 from felixcheung/pyrandomforest.
What changes were proposed in this pull request?
Add subsmaplingRate to randomForestClassifier
Add varianceCol to randomForestRegressor
In Python
How was this patch tested?
manual tests