[SPARK-15740] [MLLIB] Word2VecSuite "big model load / save" caused OOM in maven jenkins builds #13509

tmnd1991 · 2016-06-04T08:42:21Z

What changes were proposed in this pull request?

"test big model load / save" in Word2VecSuite, lately resulted into OOM.
Therefore we decided to make the partitioning adaptive (not based on spark default "spark.kryoserializer.buffer.max" conf) and then testing it using a small buffer size in order to trigger partitioning without allocating too much memory for the test.

How was this patch tested?

It was tested running the following unit test:
org.apache.spark.mllib.feature.Word2VecSuite

…igger partitioning

srowen · 2016-06-04T12:50:54Z

(Fix the title please) https://cwiki.apache.org/confluence/display/SPARK/Contributing+to+Spark

tmnd1991 · 2016-06-04T13:11:00Z

I noticed a scala style error, wait till new commit before triggering a jenkins build.

tmnd1991 · 2016-06-11T09:47:38Z

Can anyone verify this?

rxin · 2016-06-15T21:22:24Z

I triggered multiple test runs.

SparkQA · 2016-06-15T22:10:07Z

Test build #3112 has finished for PR 13509 at commit dfcd850.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-06-15T22:10:46Z

Test build #3113 has finished for PR 13509 at commit dfcd850.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-06-15T22:11:28Z

Test build #3111 has finished for PR 13509 at commit dfcd850.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

tmnd1991 · 2016-06-15T22:23:03Z

The only thing I don't like is that "64m" hard coded, but I couldn't find where default spark confs are stored!

jkbradley · 2016-06-21T19:52:22Z

mllib/src/test/scala/org/apache/spark/mllib/feature/Word2VecSuite.scala

+    // est. size of this model, given the formula:
+    // (floatSize * vectorSize + 15) * numWords
+    // (4 * 10 + 15) * 10 = 550
+    // therefore it should generate 12 partitions


"12 partitions" --> "multiple partitions" (The exact number isn't important.)

jkbradley · 2016-06-21T19:55:31Z

I don't think you can access the default confs in this case. The class KryoSerializer seems to store those privately.

tmnd1991 · 2016-06-24T19:37:42Z

I corrected the style errors you pointed out. If you say I cannot retrieve default values, I will leave the 64m hard coded that way.

jkbradley · 2016-07-05T23:45:40Z

I verified locally that the test creates a model file with multiple partitions, so LGTM

I'll merge once tests run again.

Thanks!

SparkQA · 2016-07-06T01:32:25Z

Test build #3164 has finished for PR 13509 at commit 909b6e1.

This patch fails Spark unit tests.
This patch does not merge cleanly.
This patch adds no public classes.

SparkQA · 2016-07-06T06:32:39Z

Test build #3166 has finished for PR 13509 at commit 909b6e1.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

jkbradley · 2016-07-06T19:55:34Z

Merging with master and branch-2.0
Thank you!

… in maven jenkins builds ## What changes were proposed in this pull request? "test big model load / save" in Word2VecSuite, lately resulted into OOM. Therefore we decided to make the partitioning adaptive (not based on spark default "spark.kryoserializer.buffer.max" conf) and then testing it using a small buffer size in order to trigger partitioning without allocating too much memory for the test. ## How was this patch tested? It was tested running the following unit test: org.apache.spark.mllib.feature.Word2VecSuite Author: tmnd1991 <antonio.murgia2@studio.unibo.it> Closes #13509 from tmnd1991/SPARK-15740. (cherry picked from commit 040f6f9) Signed-off-by: Joseph K. Bradley <joseph@databricks.com>

Make partitioning adaptive, set low memory size for kryo buffer to tr…

d5c7668

…igger partitioning

tmnd1991 changed the title ~~SPARK-15740~~ SPARK-15740 MLLIB Jun 4, 2016

tmnd1991 changed the title ~~SPARK-15740 MLLIB~~ [SPARK-15740] [MLLIB] Word2VecSuite "big model load / save" caused OOM in maven jenkins builds Jun 4, 2016

Fix minor scala style error

dfcd850

jkbradley reviewed Jun 21, 2016
View reviewed changes

[Spark-15740] Correct style errors

909b6e1

asfgit closed this in 040f6f9 Jul 6, 2016

[SPARK-15740] [MLLIB] Word2VecSuite "big model load / save" caused OOM in maven jenkins builds #13509

[SPARK-15740] [MLLIB] Word2VecSuite "big model load / save" caused OOM in maven jenkins builds #13509

Uh oh!

Conversation

tmnd1991 commented Jun 4, 2016

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

srowen commented Jun 4, 2016

Uh oh!

tmnd1991 commented Jun 4, 2016

Uh oh!

tmnd1991 commented Jun 11, 2016

Uh oh!

rxin commented Jun 15, 2016

Uh oh!

SparkQA commented Jun 15, 2016

Uh oh!

SparkQA commented Jun 15, 2016

Uh oh!

SparkQA commented Jun 15, 2016

Uh oh!

tmnd1991 commented Jun 15, 2016

Uh oh!

jkbradley Jun 21, 2016

Choose a reason for hiding this comment

Uh oh!

jkbradley commented Jun 21, 2016

Uh oh!

tmnd1991 commented Jun 24, 2016

Uh oh!

jkbradley commented Jul 5, 2016

Uh oh!

SparkQA commented Jul 6, 2016

Uh oh!

SparkQA commented Jul 6, 2016

Uh oh!

jkbradley commented Jul 6, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants