[SPARK-17521]Error when I use sparkContext.makeRDD(Seq()) #15077

codlife · 2016-09-13T11:19:13Z

What changes were proposed in this pull request?

when i use sc.makeRDD below

val data3 = sc.makeRDD(Seq())
println(data3.partitions.length)

I got an error:
Exception in thread "main" java.lang.IllegalArgumentException: Positive number of slices required

We can fix this bug just modify the last line ,do a check of seq.size

  def makeRDD[T: ClassTag](seq: Seq[(T, Seq[String])]): RDD[T] = withScope {
    assertNotStopped()
    val indexToPrefs = seq.zipWithIndex.map(t => (t._2, t._1._2)).toMap
    new ParallelCollectionRDD[T](this, seq.map(_._1), math.max(seq.size, defaultParallelism), indexToPrefs)
  }

How was this patch tested?

manual tests

(If this patch involves UI changes, please attach a screenshot; otherwise, remove this)

NEW

srowen · 2016-09-13T12:40:17Z

core/src/main/scala/org/apache/spark/SparkContext.scala

    assertNotStopped()
    val indexToPrefs = seq.zipWithIndex.map(t => (t._2, t._1._2)).toMap
-    new ParallelCollectionRDD[T](this, seq.map(_._1), seq.size, indexToPrefs)
+    new ParallelCollectionRDD[T](this, seq.map(_._1), math.max(seq.size, defaultParallelism), indexToPrefs)


I would say math.max(seq.size, 1). Really this method would normally just use the provided partition count (called "numSlices" in this old API) but this one doesn't have that parameter, which is more reason it's an odd man out. Still I think the most reasonable behavior is to use at least 1 partition.

To keep the same with sc.parallelize, I think the defalutParallelism is reasonable,
to let the code below be same, I think we should use defaultParallelism,

val rdd = sc.makeRDD(Seq()) val rdd = sc.parallelize(Seq())

but which one to use, you can make a decision.

The problem is that the default is OK because it's changeable, but here someone has no way to change it. I think it might be better to stay conservative.

Really this is such a corner case that it doesn't matter much. It only showed up for you because you specified no type on your Seq. If you had, it would have chosen the other overload which works fine.

ok ,thanks for your explain.

srowen · 2016-09-14T09:12:52Z

Jenkins test this please

SparkQA · 2016-09-14T11:44:38Z

Test build #65362 has finished for PR 15077 at commit f454668.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

srowen · 2016-09-15T08:38:37Z

Merged to master/2.0

## What changes were proposed in this pull request? when i use sc.makeRDD below ``` val data3 = sc.makeRDD(Seq()) println(data3.partitions.length) ``` I got an error: Exception in thread "main" java.lang.IllegalArgumentException: Positive number of slices required We can fix this bug just modify the last line ,do a check of seq.size ``` def makeRDD[T: ClassTag](seq: Seq[(T, Seq[String])]): RDD[T] = withScope { assertNotStopped() val indexToPrefs = seq.zipWithIndex.map(t => (t._2, t._1._2)).toMap new ParallelCollectionRDD[T](this, seq.map(_._1), math.max(seq.size, defaultParallelism), indexToPrefs) } ``` ## How was this patch tested? manual tests (If this patch involves UI changes, please attach a screenshot; otherwise, remove this) Author: codlife <1004910847@qq.com> Author: codlife <wangjianfei15@otcaix.iscas.ac.cn> Closes #15077 from codlife/master. (cherry picked from commit 647ee05) Signed-off-by: Sean Owen <sowen@cloudera.com>

## What changes were proposed in this pull request? when i use sc.makeRDD below ``` val data3 = sc.makeRDD(Seq()) println(data3.partitions.length) ``` I got an error: Exception in thread "main" java.lang.IllegalArgumentException: Positive number of slices required We can fix this bug just modify the last line ,do a check of seq.size ``` def makeRDD[T: ClassTag](seq: Seq[(T, Seq[String])]): RDD[T] = withScope { assertNotStopped() val indexToPrefs = seq.zipWithIndex.map(t => (t._2, t._1._2)).toMap new ParallelCollectionRDD[T](this, seq.map(_._1), math.max(seq.size, defaultParallelism), indexToPrefs) } ``` ## How was this patch tested? manual tests (If this patch involves UI changes, please attach a screenshot; otherwise, remove this) Author: codlife <1004910847@qq.com> Author: codlife <wangjianfei15@otcaix.iscas.ac.cn> Closes apache#15077 from codlife/master.

…#15077

codlife added 8 commits September 10, 2016 10:02

solve spark-17447

673c29b

Update Partitioner.scala

a460905

solve spark-17447

7829bd0

fix code style

8ddc442

solve spark-17447

81c0eb9

Update Partitioner.scala

f5d1e24

Merge branch 'master' of https://github.com/codlife/spark

e717f65

solve SPARK-17521

e426ccf

codlife changed the title fg [SPARK-17521]Error when I use sparkContext.makeRDD(Seq()) Sep 13, 2016

codlife closed this Sep 13, 2016

Merge pull request #2 from apache/master

af1a102

NEW

codlife reopened this Sep 13, 2016

srowen reviewed Sep 13, 2016
View reviewed changes

codlife and others added 3 commits September 13, 2016 21:00

fix

8bfcd6b

Update SparkContext.scala

379cd5a

Merge branch 'master' of https://github.com/codlife/spark

f454668

asfgit closed this in 647ee05 Sep 15, 2016

zzcclp added a commit to zzcclp/spark that referenced this pull request Sep 20, 2016

[EXT][SPARK-17521]Error when I use sparkContext.makeRDD(Seq()) apache…

8277cb8

…#15077

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-17521]Error when I use sparkContext.makeRDD(Seq()) #15077

[SPARK-17521]Error when I use sparkContext.makeRDD(Seq()) #15077

Uh oh!

codlife commented Sep 13, 2016 •

edited

Loading

Uh oh!

srowen Sep 13, 2016

Uh oh!

codlife Sep 13, 2016 •

edited

Loading

Uh oh!

srowen Sep 13, 2016

Uh oh!

codlife Sep 13, 2016

Uh oh!

srowen commented Sep 14, 2016

Uh oh!

SparkQA commented Sep 14, 2016

Uh oh!

srowen commented Sep 15, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SPARK-17521]Error when I use sparkContext.makeRDD(Seq()) #15077

[SPARK-17521]Error when I use sparkContext.makeRDD(Seq()) #15077

Uh oh!

Conversation

codlife commented Sep 13, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

srowen Sep 13, 2016

Choose a reason for hiding this comment

Uh oh!

codlife Sep 13, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

srowen Sep 13, 2016

Choose a reason for hiding this comment

Uh oh!

codlife Sep 13, 2016

Choose a reason for hiding this comment

Uh oh!

srowen commented Sep 14, 2016

Uh oh!

SparkQA commented Sep 14, 2016

Uh oh!

srowen commented Sep 15, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codlife commented Sep 13, 2016 •

edited

Loading

codlife Sep 13, 2016 •

edited

Loading