diff --git a/Cats.html b/Cats.html index c9283316..2a88e5bd 100644 --- a/Cats.html +++ b/Cats.html @@ -166,7 +166,7 @@
As with Job
, note that nothing has been run yet. The effect has been properly suspended. To
run our program, we must first supply the SparkSession
to the ReaderT
layer and then
@@ -193,7 +193,7 @@
And an instance of a TypedDataset
:
We can be less verbose using the Injection.apply
function:
import frameless._
@@ -180,7 +180,7 @@ cal.setTime(new java.util.Date(l))
cal
})
-// calendarToLongInjection: Injection[Calendar, Long] = frameless.Injection$$anon$1@5ad19282
+Now we can create our TypedDataset
:
val personDS = TypedDataset.create(people)
// personDS: TypedDataset[Person] = [age: int, birthday: bigint]
@@ -214,7 +214,7 @@ And now we can create our TypedDataset
:
val personDS = TypedDataset.create(people)
// personDS: TypedDataset[Person] = [age: int, gender: int]
diff --git a/Job.html b/Job.html
index 465c6f69..a36ecc7a 100644
--- a/Job.html
+++ b/Job.html
@@ -156,7 +156,7 @@ Now we can execute this new job by specifying a group-id and a description. This allows the programmer to see this information on the Spark UI and help track, say, performance issues.
diff --git a/TypedDatasetVsSparkDataset.html b/TypedDatasetVsSparkDataset.html index bbfc8428..fa947c75 100644 --- a/TypedDatasetVsSparkDataset.html +++ b/TypedDatasetVsSparkDataset.html @@ -159,9 +159,9 @@The value ds
holds the content of the initialDs
read from a parquet file.
diff --git a/TypedEncoder.html b/TypedEncoder.html
index c62ac83a..bfce99c1 100644
--- a/TypedEncoder.html
+++ b/TypedEncoder.html
@@ -212,7 +212,7 @@
But any non-encodable in the case class hierarchy will be detected at compile time:
case class BarDate(d: Double, s: String, t: java.util.Calendar)
case class FooDate(i: Int, b: BarDate)
diff --git a/TypedML.html b/TypedML.html
index 15059060..23058784 100644
--- a/TypedML.html
+++ b/TypedML.html
@@ -176,7 +176,7 @@ case class RFInputs(price: Double, features: Vector)
val rf = TypedRandomForestRegressor[RFInputs]
-// rf: TypedRandomForestRegressor[RFInputs] = frameless.ml.regression.TypedRandomForestRegressor@49c125ff
+// rf: TypedRandomForestRegressor[RFInputs] = frameless.ml.regression.TypedRandomForestRegressor@4197f8a4
val model = rf.fit(trainingDataWithFeatures).run()
-// model: AppendTransformer[RFInputs, TypedRandomForestRegressor.Outputs, org.apache.spark.ml.regression.RandomForestRegressionModel] = frameless.ml.TypedEstimator$$anon$1@2a38506d
+TypedRandomForestRegressor[RFInputs]
compiles only if RFInputs
contains only one field of type Double (the label) and one field of type Vector (the features):
case class WrongRFInputs(labelOfWrongType: String, features: Vector)
@@ -281,7 +281,7 @@ case class RFInputs(cityIndexed: Double, features: Vector)
val rf = TypedRandomForestClassifier[RFInputs]
-// rf: TypedRandomForestClassifier[RFInputs] = frameless.ml.classification.TypedRandomForestClassifier@1919d203
+// rf: TypedRandomForestClassifier[RFInputs] = frameless.ml.classification.TypedRandomForestClassifier@6763ffd9
val model = rf.fit(indexedData).run()
-// model: AppendTransformer[RFInputs, TypedRandomForestClassifier.Outputs, org.apache.spark.ml.classification.RandomForestClassificationModel] = frameless.ml.TypedEstimator$$anon$1@14ffd6cb
+We now want to predict city
for testData
using the previously trained model. Like the Spark ML API,
@@ -342,7 +342,7 @@