-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-25479][TEST] Refactor DatasetBenchmark to use main method #22488
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Test build #96342 has finished for PR 22488 at commit
|
|
retest this please |
|
Test build #96344 has finished for PR 22488 at commit
|
|
Test build #96354 has finished for PR 22488 at commit
|
|
Test build #96375 has finished for PR 22488 at commit
|
# Conflicts: # sql/core/src/test/scala/org/apache/spark/sql/DatasetBenchmark.scala
|
Test build #96440 has finished for PR 22488 at commit
|
|
retest this please |
|
Test build #96461 has finished for PR 22488 at commit
|
|
retest this please |
|
Test build #96464 has finished for PR 22488 at commit
|
|
@dongjoon-hyun I think this refactor is ready to go. Thanks. |
| val spark = SparkSession.builder | ||
| .master("local[*]") | ||
| .appName("Dataset benchmark") | ||
| .getOrCreate() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we move this SparkSession building part into benchmark() function and before runBenchmark("Dataset Benchmark")?
|
Test build #96819 has finished for PR 22488 at commit
|
|
retest this please |
|
|
||
| def main(args: Array[String]): Unit = { | ||
| val spark = SparkSession.builder | ||
| override def getSparkSession: SparkSession = { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Need override default SparkSession as default SparkSession is:
SparkSession.builder()
.master("local[1]")
.appName(this.getClass.getCanonicalName)
.config(SQLConf.SHUFFLE_PARTITIONS.key, 1)
.config(SQLConf.AUTO_BROADCASTJOIN_THRESHOLD.key, 1)
.getOrCreate()|
Test build #96832 has finished for PR 22488 at commit
|
|
Retest this please. |
|
Test build #96864 has finished for PR 22488 at commit
|
|
@wangyum . Could you review and merge wangyum#13 ? |
dongjoon-hyun
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1, LGTM (pending Jenkins)
|
Hi, @jiangxb1987 . |
|
Congratulation, @jiangxb1987 |
|
Test build #96879 has finished for PR 22488 at commit
|
|
Test build #96880 has finished for PR 22488 at commit
|
|
Merged to master. |
## What changes were proposed in this pull request? Refactor `DatasetBenchmark` to use main method. Generate benchmark result: ```sh SPARK_GENERATE_BENCHMARK_FILES=1 build/sbt "sql/test:runMain org.apache.spark.sql.DatasetBenchmark" ``` ## How was this patch tested? manual tests Closes apache#22488 from wangyum/SPARK-25479. Lead-authored-by: Yuming Wang <yumwang@ebay.com> Co-authored-by: Dongjoon Hyun <dongjoon@apache.org> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>
What changes were proposed in this pull request?
Refactor
DatasetBenchmarkto use main method.Generate benchmark result:
SPARK_GENERATE_BENCHMARK_FILES=1 build/sbt "sql/test:runMain org.apache.spark.sql.DatasetBenchmark"How was this patch tested?
manual tests