[WIP][SPARK-1486][MLlib] Multi Model Training with Gradient Descent #2451

brkyvz · 2014-09-18T21:28:59Z

Note: This is still a work in progress

This is the first of the pull requests to support multi-model training in MLlib. It batches examples and trains multiple models with different regularization parameters and step sizes all at once using Matrix-Matrix multiplication. It uses Native BLAS when the data matrix is dense, and uses sparse matrices as much as possible for both better memory utilization and performance (I will post performance results in the comments).

This is a HUGE Pull Request, therefore I'm posting this now. It is not finished, docs need to be updated, code can be somewhat cleaned up for ease of understanding. I'm posting this now so that users can comment and make suggestions along the way.

Most of the PR consists of adding additional Local Matrix operations for the calculation of gradients and losses.

…operations added

…ting

…r optimized

SparkQA · 2014-09-18T21:34:20Z

QA tests have started for PR 2451 at commit 5e7d744.

This patch merges cleanly.

SparkQA · 2014-09-18T21:34:32Z

QA tests have finished for PR 2451 at commit 5e7d744.

This patch fails unit tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
- sealed trait Matrix extends Serializable
- class DenseMatrix(val numRows: Int, val numCols: Int, val values: Array[Double]) extends Matrix with Serializable
- class SparseMatrix(
- sealed trait Vector extends Serializable
- abstract class MultiModelGradient extends Serializable
- class MultiModelLogisticGradient extends MultiModelGradient
- class MultiModelLeastSquaresGradient extends MultiModelGradient
- class MultiModelHingeGradient extends MultiModelGradient
- trait Optimizer[V] extends Serializable
- abstract class MultiModelUpdater extends Serializable
- class MultiModelSimpleUpdater extends MultiModelUpdater
- class MultiModelL1Updater extends MultiModelUpdater
- class MultiModelSquaredL2Updater extends MultiModelUpdater

AtlasPilotPuppy · 2014-09-18T21:53:26Z

With some guidance I could help you with the docs

jkbradley · 2014-09-19T00:57:56Z

mllib/src/main/scala/org/apache/spark/mllib/linalg/BLAS.scala

I think this segment merits a one-line explanation.

These are related to #2294
I can add explanations there. I realize the math is hard to understand.

brkyvz · 2014-09-19T16:42:21Z

@anantasty: If you could look through the code and mark places where you're like "What the heck is going on here", it would be easier for me to write up proper comments. I'm going to add a lot today, I can incorporate yours as well. Thanks!

jkbradley · 2014-09-19T18:26:44Z

mllib/src/main/scala/org/apache/spark/mllib/linalg/Matrices.scala

Do we really need sparse versions of rand and randn? It should not be too much more expensive to use the dense versions, and then convert to a sparse matrix. (I figure < 2x the cost.) I can not think of use cases for these either, except unit testing.

They're nice functions to have. It will be helpful for people who want to do random projections

Sorry, I did not see the "density" argument. Sounds OK to me (but is there a use case?)

No use case in MLlib yet. Randomized SVD for big matrices (distributed) may make use of this.

OK, sounds fine then.

jkbradley · 2014-09-19T20:39:53Z

mllib/src/main/scala/org/apache/spark/mllib/linalg/Matrices.scala

This line is a good reason to implement this in DenseMatrix: You could avoid the expensive index (multiplication), and just iterate through counts.

jkbradley · 2014-09-19T21:33:30Z

mllib/src/main/scala/org/apache/spark/mllib/optimization/MultiModelGradientDescent.scala

Add doc: "We will concatenate results (weights) to finalWeights as we iterate." (or something like that)

mengxr · 2014-09-19T21:35:48Z

@brkyvz Let's try to split this PR into small ones. For example, functions like factory methods for sparse matrices should not be included in this PR. We want to keep the vector and matrix classes in MLlib simple and let user use breeze for linear algebra operations. If breeze has performance issues, maybe we should contribute the optimization to breeze to centralize the effort on single-machine linear algebra computation.

jkbradley · 2014-09-19T21:38:30Z

mllib/src/main/scala/org/apache/spark/mllib/optimization/MultiModelGradientDescent.scala

miniBatchSize is inexact. We could avoid the initial count() and instead aggregate the minibatch size during the treeAggregate.

jkbradley · 2014-09-19T22:46:53Z

@brkyvz I've made a rough pass, and have listed all of my comments. I can make future passes as needed. Lots of work & it will be great to have!

brkyvz · 2015-01-29T23:53:43Z

closing this PR as a lot of functionality has changed

brkyvz added 14 commits September 5, 2014 13:34

[SPARK-3418][MLlib] Sparse Matrix support and additional native BLAS …

5138d3f

…operations added

[SPARK-3418][MLlib] Matrix unit tests expanded with indexing and upda…

4362ff1

…ting

[SPARK-3418] Fixed Scala-style errors

8dcb763

[SPARK-3418] Fixed failing Matrix unit test

41b2da3

[SPARK-3418] Fixed style issues and added documentation for methods

56d7c85

[SPARK-3418] Fixed one more style issue

848406c

[SPARK-3418] Code review comments addressed and multiplication furthe…

eeb13eb

…r optimized

[SPARK-3418] New code review comments addressed

a85ccb7

[SPARK-3418] Squashed missing alpha bug.

d510c8f

sealed traits Vector and Matrix

418def8

9/17 comments addressed

f79db9c

[SPARK-3418] Fixed MiMa compatibility issues (excluded from check)

d162684

really fixed MiMa issue

272feb9

[WIP][SPARK-1486][MLlib] Initial commit for multi-model training

5e7d744

jkbradley reviewed Sep 19, 2014
View reviewed changes

Lastest updates. Currently broken

2ea711c

jkbradley reviewed Sep 19, 2014
View reviewed changes

brkyvz closed this Jan 29, 2015

brkyvz deleted the SPARK-1486 branch February 3, 2019 20:56

[WIP][SPARK-1486][MLlib] Multi Model Training with Gradient Descent #2451

[WIP][SPARK-1486][MLlib] Multi Model Training with Gradient Descent #2451

Uh oh!

Conversation

brkyvz commented Sep 18, 2014

Uh oh!

SparkQA commented Sep 18, 2014

Uh oh!

SparkQA commented Sep 18, 2014

Uh oh!

AtlasPilotPuppy commented Sep 18, 2014

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

brkyvz commented Sep 19, 2014

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mengxr commented Sep 19, 2014

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jkbradley commented Sep 19, 2014

Uh oh!

brkyvz commented Jan 29, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants