-
Notifications
You must be signed in to change notification settings - Fork 313
Home
Hollin Wilkins edited this page Dec 26, 2016
·
55 revisions
MLeap is here to help you deploy your Spark transformers and pipelines into production, without a Spark context.
- Setting up a Spark 2.0 notebook with MLeap and Toree
- Setting up PySpark 2.0 notebook with MLeap and Toree
- Setting up Scikit-Learn with MLeap
- ML Pipelines with AirBnb data - Scala
- ML Pipelines with AirBnb data - PySpark
- ML Pipelines with Lending Club data - Scala
Transformer | Spark | Scikit-Learn | TensorFlow |
---|---|---|---|
Binarizer | x | x | |
Bucketizer | x | ||
ChiSqSelector | |||
CountVectorizer | |||
ElementwiseProduct | x | x | |
HashingTermFrequency | x | x | |
Imputer | x | x | |
Interaction | x | x | |
LSH | |||
MaxAbsScaler | x | ||
MinMaxScaler | x | x | |
Ngram | x | ||
Normalizer | x | ||
OneHotEncoder | x | x | |
PCA | x | x | |
QuantileDiscretizer | x | ||
PolynomialExpansion | x | x | |
ReverseStringIndexer | x | x | |
StandardScaler | x | x | |
StopWordsRemover | x | ||
StringIndexer | x | x | |
Tokenizer | x | x | |
VectorAssembler | x | x |
Transformer | Spark | Scikit-Learn | TensorFlow |
---|---|---|---|
DecisionTreeClassifier | x | x | |
GradientBoostedTreeClassifier | x | ||
LogisticRegression | x | x | |
LogisticRegressionCv | x | x | |
NaiveBayesClassifier | x | ||
OneVsRest | x | ||
RandomForestClassifier | x | x | |
SupportVectorMachines | x | x | |
MultiLayerPerceptron | x |
Transformer | Spark | Scikit-Learn | TensorFlow |
---|---|---|---|
AFTSurvivalRegression | |||
DecisionTreeRegression | x | x | |
GeneralizedLinearRegression | |||
GradientBoostedTreeRegression | x | ||
IsotonicRegression | |||
LinearRegression | x | x | |
RandomForestRegression | x | x |
Transformer | Spark | Scikit-Learn | TensorFlow |
---|---|---|---|
BisectingKMeans | |||
GaussianMixtureModel | x | ||
KMeans | x | ||
LDA |
- CholeskyDecomposition