Skip to content

Conversation

@wangyum
Copy link
Member

@wangyum wangyum commented Sep 20, 2018

What changes were proposed in this pull request?

Refactor MiscBenchmark to use main method.
Generate benchmark result:

SPARK_GENERATE_BENCHMARK_FILES=1 build/sbt "sql/test:runMain org.apache.spark.sql.execution.benchmark.MiscBenchmark"

How was this patch tested?

manual tests

(i.toString, (i + 1).toString, (i + 2).toString, (i + 3).toString)
})))).toDF("col", "arr")

df.selectExpr("*", "explode(arr) as arr_col")
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Function name should be explode.

generate stack: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
generate stack wholestage off 17179 / 17719 1.0 1024.0 1.0X
generate stack wholestage on 13674 / 14112 1.2 815.0 1.3X
Copy link
Member Author

@wangyum wangyum Sep 20, 2018

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I will do some deep dive, which was 15.5X before.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Performance declines because of this commit: Disable generate codegen since it fails my workload .

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@rxin Do we have plan to enable generate codegen?

Copy link
Member

@dongjoon-hyun dongjoon-hyun Sep 26, 2018

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Although this is not related to this refactoring, ping @rxin and @kiszk because @kiszk seemed to want to fix the root cause of the failure.

@rxin which operation in Generator makes failure of your workload? commit comment.

@SparkQA
Copy link

SparkQA commented Sep 20, 2018

Test build #96368 has finished for PR 22500 at commit 6252440.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

# Conflicts:
#	sql/core/src/test/scala/org/apache/spark/sql/execution/benchmark/MiscBenchmark.scala
@SparkQA
Copy link

SparkQA commented Oct 1, 2018

Test build #96820 has finished for PR 22500 at commit 58fbdd6.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@dongjoon-hyun
Copy link
Member

Retest this please.

@SparkQA
Copy link

SparkQA commented Oct 2, 2018

Test build #96863 has finished for PR 22500 at commit 58fbdd6.

  • This patch fails to build.
  • This patch merges cleanly.
  • This patch adds no public classes.

*/
class MiscBenchmark extends BenchmarkWithCodegen {

ignore("filter & aggregate without group") {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This refactoring introduces a long function body at runBenchmarkSuite. In general, it's not a better direction.

Could you map each ignore function into an independent function and make runBenchmarkSuite() invoke a series of those functions?

@SparkQA
Copy link

SparkQA commented Oct 3, 2018

Test build #96877 has finished for PR 22500 at commit f7a14e3.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Oct 3, 2018

Test build #96882 has finished for PR 22500 at commit c791249.

  • This patch fails PySpark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@wangyum
Copy link
Member Author

wangyum commented Oct 3, 2018

retest this please

@SparkQA
Copy link

SparkQA commented Oct 3, 2018

Test build #96888 has finished for PR 22500 at commit c791249.

  • This patch fails due to an unknown error code, -9.
  • This patch merges cleanly.
  • This patch adds no public classes.

@wangyum
Copy link
Member Author

wangyum commented Oct 3, 2018

retest this please

@SparkQA
Copy link

SparkQA commented Oct 3, 2018

Test build #96889 has finished for PR 22500 at commit c791249.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@dongjoon-hyun
Copy link
Member

Retest this please.

@SparkQA
Copy link

SparkQA commented Oct 3, 2018

Test build #96903 has finished for PR 22500 at commit c791249.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@wangyum
Copy link
Member Author

wangyum commented Oct 3, 2018

Retest this please.

@SparkQA
Copy link

SparkQA commented Oct 4, 2018

Test build #96910 has finished for PR 22500 at commit c791249.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@wangyum
Copy link
Member Author

wangyum commented Oct 5, 2018

@dongjoon-hyun Is this ready to go?

@dongjoon-hyun
Copy link
Member

Hi, @wangyum .

@wangyum wangyum changed the title [SPARK-25488][TEST] Refactor MiscBenchmark to use main method [SPARK-25488][SQL][TEST] Refactor MiscBenchmark to use main method Oct 6, 2018
@SparkQA
Copy link

SparkQA commented Oct 6, 2018

Test build #97035 has finished for PR 22500 at commit 623998b.

  • This patch fails due to an unknown error code, -9.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Oct 6, 2018

Test build #97032 has finished for PR 22500 at commit 9514aee.

  • This patch fails due to an unknown error code, -9.
  • This patch merges cleanly.
  • This patch adds no public classes.

@dilipbiswal
Copy link
Contributor

retest this please

@SparkQA
Copy link

SparkQA commented Oct 6, 2018

Test build #97036 has finished for PR 22500 at commit 623998b.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

Copy link
Member

@dongjoon-hyun dongjoon-hyun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1, LGTM. Thank you, @wangyum and @dilipbiswal .

Merged to master.

@asfgit asfgit closed this in edf4286 Oct 6, 2018
jackylee-ch pushed a commit to jackylee-ch/spark that referenced this pull request Feb 18, 2019
## What changes were proposed in this pull request?

Refactor `MiscBenchmark ` to use main method.
Generate benchmark result:
```sh
SPARK_GENERATE_BENCHMARK_FILES=1 build/sbt "sql/test:runMain org.apache.spark.sql.execution.benchmark.MiscBenchmark"
```

## How was this patch tested?

manual tests

Closes apache#22500 from wangyum/SPARK-25488.

Lead-authored-by: Yuming Wang <yumwang@ebay.com>
Co-authored-by: Yuming Wang <wgyumg@gmail.com>
Co-authored-by: Dongjoon Hyun <dongjoon@apache.org>
Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants