Skip to content
Hollin Wilkins edited this page Dec 26, 2016 · 55 revisions

MLeap is here to help you deploy your Spark transformers and pipelines into production, without a Spark context.

Tutorials

Demos

Supported Transformers

Features

Transformer Spark Scikit-Learn TensorFlow
Binarizer x x
Bucketizer x
ChiSqSelector
CountVectorizer
ElementwiseProduct x x
HashingTermFrequency x x
Imputer x x
Interaction x x
LSH
MaxAbsScaler x
MinMaxScaler x x
Ngram x
Normalizer x
OneHotEncoder x x
PCA x x
QuantileDiscretizer x
PolynomialExpansion x x
ReverseStringIndexer x x
StandardScaler x x
StopWordsRemover x
StringIndexer x x
Tokenizer x x
VectorAssembler x x

Classification

Transformer Spark Scikit-Learn TensorFlow
DecisionTreeClassifier x x
GradientBoostedTreeClassifier x
LogisticRegression x x
LogisticRegressionCv x x
NaiveBayesClassifier x
OneVsRest x
RandomForestClassifier x x
SupportVectorMachines x x
MultiLayerPerceptron x

Regression

Transformer Spark Scikit-Learn TensorFlow
AFTSurvivalRegression
DecisionTreeRegression x x
GeneralizedLinearRegression
GradientBoostedTreeRegression x
IsotonicRegression
LinearRegression x x
RandomForestRegression x x

Clustering

Transformer Spark Scikit-Learn TensorFlow
BisectingKMeans
GaussianMixtureModel x
KMeans x
LDA

Linear Algebra

  • CholeskyDecomposition