[SPARK-15643][DOC][ML] Add breaking changes to ML migration guide #13924

MLnick · 2016-06-27T13:22:16Z

This PR adds the breaking changes from SPARK-14810 to the migration guide.

How was this patch tested?

Built docs locally.

MLnick · 2016-06-27T13:22:42Z

Will be merged once #13378 is merged.

SparkQA · 2016-06-27T13:43:55Z

Test build #61303 has finished for PR 13924 at commit 28e0412.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

jkbradley · 2016-06-28T18:55:26Z

I just merged #13378

MLnick · 2016-06-28T20:30:42Z

@yanboliang @jkbradley @mengxr updated.

SparkQA · 2016-06-28T20:53:52Z

Test build #61408 has finished for PR 13924 at commit 6ef09a3.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

jkbradley · 2016-06-29T00:08:20Z

docs/mllib-guide.md

+
+**Linear algebra classes for DataFrame-based APIs**
+
+Spark's linear algebra dependencies were moved to a new project, `spark-mllib-local` 


Should be "mllib-local" (no "spark-")

jkbradley · 2016-06-29T00:08:46Z

Done with review pass. Thanks for the PR!

MLnick · 2016-06-29T10:56:12Z

docs/mllib-guide.md

+
+# convert DataFrame columns
+convertedVecDF = MLUtils.convertVectorColumnsToML(vecDF)
+convertedMatrxDF = MLUtils.convertMatrixColumnsToML(matrixDF)


Note, it looks like we don't have single instance conversion methods asML / fromML in Python linalg classes (I commented on SPARK-15944.

Not sure if this is intended or we just missed them. One can do newVec = Vectors.dense(oldVec) (or vice versa for sparse) in Python directly, so if that is the expected way to do things I can add that here.

That may have just been overlooked, but that's a good point that there is already a decent way to do the conversion. Could you please just note that way here?

@jkbradley Ah sorry - I mispoke. It happens to work for dense vectors because it effectively calls np.array(DenseVector), but not for sparse. Workaround is fairly ugly: mlSV = NewVectors.sparse(mllibSV.size, zip(mllibSV.indices, mllibSV.values)), or something similar.

I'd say we should have some convenience methods like in Scala/Java?

Created SPARK-16328 and #13997.

SparkQA · 2016-06-29T11:18:47Z

Test build #61464 has finished for PR 13924 at commit ac49f31.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

jkbradley · 2016-06-29T20:50:47Z

The changes look good, so just the Python item remains. Thanks!

MLnick · 2016-06-30T14:01:53Z

@jkbradley updated Python example assuming #13997 will get merged - refer #13924 (comment).

SparkQA · 2016-06-30T14:25:09Z

Test build #61545 has finished for PR 13924 at commit c2ce7cd.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-06-30T14:29:55Z

Test build #61546 has finished for PR 13924 at commit 919bfe9.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

jkbradley · 2016-07-01T00:54:49Z

LGTM
Merging with master and branch-2.0 now that #13997 has been merged
Thank you!

This PR adds the breaking changes from [SPARK-14810](https://issues.apache.org/jira/browse/SPARK-14810) to the migration guide. ## How was this patch tested? Built docs locally. Author: Nick Pentreath <nickp@za.ibm.com> Closes #13924 from MLnick/SPARK-15643-migration-guide. (cherry picked from commit 4a981dc) Signed-off-by: Joseph K. Bradley <joseph@databricks.com>

MLnick mentioned this pull request Jun 27, 2016

[SPARK-15643] [Doc] [ML] Update spark.ml and spark.mllib migration guide from 1.6 to 2.0 #13378

Closed

Nick Pentreath added 3 commits June 28, 2016 21:11

Add breaking changes to ML migration guide

80872b3

initial work

177a9e0

Update guide for vector/matrix conversion and clean up

6ef09a3

MLnick force-pushed the SPARK-15643-migration-guide branch from 28e0412 to 6ef09a3 Compare June 28, 2016 20:29

MLnick changed the title ~~[WIP][SPARK-15643][DOC][ML] Add breaking changes to ML migration guide~~ [SPARK-15643][DOC][ML] Add breaking changes to ML migration guide Jun 28, 2016

jkbradley reviewed Jun 29, 2016
View reviewed changes

review comment and add single instance conversion details

ac49f31

MLnick reviewed Jun 29, 2016
View reviewed changes

Nick Pentreath added 3 commits June 30, 2016 15:59

Add Python asML example code

c48b894

fix comment

91fd24c

Fix java ;

c2ce7cd

Fix Java method invoc

919bfe9

asfgit closed this in 4a981dc Jul 1, 2016


		Linear algebra classes for DataFrame-based APIs

		Spark's linear algebra dependencies were moved to a new project, `spark-mllib-local`

[SPARK-15643][DOC][ML] Add breaking changes to ML migration guide #13924

[SPARK-15643][DOC][ML] Add breaking changes to ML migration guide #13924

Uh oh!

Conversation

MLnick commented Jun 27, 2016

How was this patch tested?

Uh oh!

MLnick commented Jun 27, 2016

Uh oh!

SparkQA commented Jun 27, 2016

Uh oh!

jkbradley commented Jun 28, 2016

Uh oh!

MLnick commented Jun 28, 2016

Uh oh!

SparkQA commented Jun 28, 2016

Uh oh!

jkbradley Jun 29, 2016

Choose a reason for hiding this comment

Uh oh!

jkbradley commented Jun 29, 2016

Uh oh!

MLnick Jun 29, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jkbradley Jun 29, 2016

Choose a reason for hiding this comment

Uh oh!

MLnick Jun 30, 2016

Choose a reason for hiding this comment

Uh oh!

MLnick Jun 30, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Jun 29, 2016

Uh oh!

jkbradley commented Jun 29, 2016

Uh oh!

MLnick commented Jun 30, 2016

Uh oh!

SparkQA commented Jun 30, 2016

Uh oh!

SparkQA commented Jun 30, 2016

Uh oh!

jkbradley commented Jul 1, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

MLnick Jun 29, 2016 •

edited

Loading

MLnick Jun 30, 2016 •

edited

Loading