[SPARK-25479][TEST] Refactor DatasetBenchmark to use main method #22488

wangyum · 2018-09-20T09:50:08Z

What changes were proposed in this pull request?

Refactor DatasetBenchmark to use main method.
Generate benchmark result:

SPARK_GENERATE_BENCHMARK_FILES=1 build/sbt "sql/test:runMain org.apache.spark.sql.DatasetBenchmark"

How was this patch tested?

manual tests

SparkQA · 2018-09-20T10:12:20Z

Test build #96342 has finished for PR 22488 at commit 21b623a.

This patch fails to generate documentation.
This patch merges cleanly.
This patch adds no public classes.

wangyum · 2018-09-20T10:38:52Z

retest this please

SparkQA · 2018-09-20T11:01:54Z

Test build #96344 has finished for PR 22488 at commit 21b623a.

This patch fails to generate documentation.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2018-09-20T13:12:01Z

Test build #96354 has finished for PR 22488 at commit 152c549.

This patch fails to generate documentation.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2018-09-20T21:50:41Z

Test build #96375 has finished for PR 22488 at commit 51ae87e.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

# Conflicts: # sql/core/src/test/scala/org/apache/spark/sql/DatasetBenchmark.scala

SparkQA · 2018-09-21T19:15:01Z

Test build #96440 has finished for PR 22488 at commit 71dfe03.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

wangyum · 2018-09-21T23:20:53Z

retest this please

SparkQA · 2018-09-22T01:24:30Z

Test build #96461 has finished for PR 22488 at commit 71dfe03.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

wangyum · 2018-09-22T01:32:00Z

retest this please

SparkQA · 2018-09-22T05:22:55Z

Test build #96464 has finished for PR 22488 at commit 71dfe03.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

wangyum · 2018-09-24T03:49:59Z

@dongjoon-hyun I think this refactor is ready to go. Thanks.

dongjoon-hyun · 2018-09-24T04:49:33Z

sql/core/src/test/scala/org/apache/spark/sql/DatasetBenchmark.scala

+  val spark = SparkSession.builder
+    .master("local[*]")
+    .appName("Dataset benchmark")
+    .getOrCreate()


Can we move this SparkSession building part into benchmark() function and before runBenchmark("Dataset Benchmark")?

SparkQA · 2018-10-01T18:21:46Z

Test build #96819 has finished for PR 22488 at commit 7990e13.

This patch fails PySpark unit tests.
This patch merges cleanly.
This patch adds no public classes.

wangyum · 2018-10-01T22:23:04Z

retest this please

wangyum · 2018-10-01T22:48:54Z

sql/core/src/test/scala/org/apache/spark/sql/DatasetBenchmark.scala


-  def main(args: Array[String]): Unit = {
-    val spark = SparkSession.builder
+  override def getSparkSession: SparkSession = {


Need override default SparkSession as default SparkSession is:

SparkSession.builder() .master("local[1]") .appName(this.getClass.getCanonicalName) .config(SQLConf.SHUFFLE_PARTITIONS.key, 1) .config(SQLConf.AUTO_BROADCASTJOIN_THRESHOLD.key, 1) .getOrCreate()

SparkQA · 2018-10-02T02:25:59Z

Test build #96832 has finished for PR 22488 at commit 7990e13.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun · 2018-10-02T17:50:13Z

Retest this please.

SparkQA · 2018-10-02T18:03:36Z

Test build #96864 has finished for PR 22488 at commit 7990e13.

This patch fails to build.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun · 2018-10-02T23:41:34Z

@wangyum . Could you review and merge wangyum#13 ?

dongjoon-hyun

+1, LGTM (pending Jenkins)

dongjoon-hyun · 2018-10-03T00:31:00Z

Hi, @jiangxb1987 .
Could you review (and merge) this PR?

wangyum · 2018-10-03T01:43:46Z

Congratulation, @jiangxb1987

SparkQA · 2018-10-03T02:51:42Z

Test build #96879 has finished for PR 22488 at commit d2d0a3e.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2018-10-03T03:48:17Z

Test build #96880 has finished for PR 22488 at commit 27c6493.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun · 2018-10-04T18:57:41Z

Merged to master.

## What changes were proposed in this pull request? Refactor `DatasetBenchmark` to use main method. Generate benchmark result: ```sh SPARK_GENERATE_BENCHMARK_FILES=1 build/sbt "sql/test:runMain org.apache.spark.sql.DatasetBenchmark" ``` ## How was this patch tested? manual tests Closes apache#22488 from wangyum/SPARK-25479. Lead-authored-by: Yuming Wang <yumwang@ebay.com> Co-authored-by: Dongjoon Hyun <dongjoon@apache.org> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>

Refactor DatasetBenchmark

21b623a

Remove useless import

152c549

Fix scala doc issue

51ae87e

Merge remote-tracking branch 'upstream/master' into SPARK-25479

71dfe03

# Conflicts: # sql/core/src/test/scala/org/apache/spark/sql/DatasetBenchmark.scala

dongjoon-hyun reviewed Sep 24, 2018

View reviewed changes

wangyum added 2 commits October 1, 2018 22:49

Merge remote-tracking branch 'upstream/master' into SPARK-25479

361e9f8

merge master

7990e13

wangyum commented Oct 1, 2018

View reviewed changes

wangyum added 2 commits October 3, 2018 06:57

Merge remote-tracking branch 'upstream/master' into SPARK-25479

c0ae4ef

benchmark -> runBenchmarkSuite

d2d0a3e

Update result (#13)

27c6493

dongjoon-hyun approved these changes Oct 2, 2018

View reviewed changes

asfgit closed this in 95ae209 Oct 5, 2018

wangyum deleted the SPARK-25479 branch October 5, 2018 01:40

[SPARK-25479][TEST] Refactor DatasetBenchmark to use main method #22488

[SPARK-25479][TEST] Refactor DatasetBenchmark to use main method #22488

Uh oh!

Conversation

wangyum commented Sep 20, 2018

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

SparkQA commented Sep 20, 2018

Uh oh!

wangyum commented Sep 20, 2018

Uh oh!

SparkQA commented Sep 20, 2018

Uh oh!

SparkQA commented Sep 20, 2018

Uh oh!

SparkQA commented Sep 20, 2018

Uh oh!

SparkQA commented Sep 21, 2018

Uh oh!

wangyum commented Sep 21, 2018

Uh oh!

SparkQA commented Sep 22, 2018

Uh oh!

wangyum commented Sep 22, 2018

Uh oh!

SparkQA commented Sep 22, 2018

Uh oh!

wangyum commented Sep 24, 2018

Uh oh!

dongjoon-hyun Sep 24, 2018

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Oct 1, 2018

Uh oh!

wangyum commented Oct 1, 2018

Uh oh!

wangyum Oct 1, 2018

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Oct 2, 2018

Uh oh!

dongjoon-hyun commented Oct 2, 2018

Uh oh!

SparkQA commented Oct 2, 2018

Uh oh!

dongjoon-hyun commented Oct 2, 2018

Uh oh!

dongjoon-hyun left a comment

Choose a reason for hiding this comment

Uh oh!

dongjoon-hyun commented Oct 3, 2018

Uh oh!

wangyum commented Oct 3, 2018

Uh oh!

SparkQA commented Oct 3, 2018

Uh oh!

SparkQA commented Oct 3, 2018

Uh oh!

dongjoon-hyun commented Oct 4, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants