-
Notifications
You must be signed in to change notification settings - Fork 255
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How can i use RandomForestClassifier with sparkit-learn library #72
Comments
Hello, It is not yet released, you need to install latest master.
|
hello,
|
it's much better with : Do you think that i can use the same parameters ? #Search best params param = {"n_estimators": list(range(20, 300,40)), digit_rf=GridSearchCV(forest,param,cv=5,n_jobs=-1) |
It depends on your data, but be carefull, n_estimators is misleading coming from scikit-learn. It will learn n_estimators X number of partitions. This is because this implementation in fact train RandomForestClassifier on each partition and then merge them. Thus you may need to reduce n_estimators depending on your dataset. |
It works, but i have new issue with the SparkGridSearchCV
|
Someone knows if the SparkGridSearchCV offers the same parameters ? |
Another issue with SparkGridSearchCV on yarn-client MODE 16/10/05 17:52:04 ERROR akka.ErrorMonitor: Uncaught fatal error from thread [sparkDriver-akka.actor.default-dispatcher-2] shutting down ActorSystem [sparkDriver] |
The text was updated successfully, but these errors were encountered: