[SPARK-21153] Use project instead of expand in tumbling windows #18364

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

brkyvz wants to merge 6 commits into apache:master from brkyvz:opt-tumble

Contributor

brkyvz commented Jun 20, 2017 •

edited

Loading

What changes were proposed in this pull request?

Time windowing in Spark currently performs an Expand + Filter, because there is no way to guarantee the amount of windows a timestamp will fall in, in the general case. However, for tumbling windows, a record is guaranteed to fall into a single bucket. In this case, doubling the number of records with Expand is wasteful, and can be improved by using a simple Projection instead.

Benchmarks show that we get an order of magnitude performance improvement after this patch.

How was this patch tested?

Existing unit tests. Benchmarked using the following code:

import org.apache.spark.sql.functions._

spark.time { 
  spark.range(numRecords)
    .select(from_unixtime((current_timestamp().cast("long") * 1000 + 'id / 1000) / 1000) as 'time)
    .select(window('time, "10 seconds"))
    .count()
}

Setup:

1 c3.2xlarge worker (8 cores)

1 B rows ran in 287 seconds after this optimization. I didn't wait for it to finish without the optimization. Shows about 5x improvement for large number of records.

brkyvz added 3 commits

June 20, 2017 09:04


          use project instead of expand in tumbling windows

0aab47f


          add to doc

4d18d81


          simpler filter expr

ced7c33

SparkQA commented Jun 20, 2017

Test build #78293 has finished for PR 18364 at commit 4d18d81.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA commented Jun 20, 2017

Test build #78303 has started for PR 18364 at commit ced7c33.

Contributor

shaneknapp commented Jun 20, 2017

i will retrigger this once jenkins restart

Contributor

shaneknapp commented Jun 20, 2017

test this please

brkyvz added 2 commits

June 20, 2017 14:04


          save

4acdd71


          add record

407a5e9

SparkQA commented Jun 20, 2017

Test build #78319 has started for PR 18364 at commit 407a5e9.

SparkQA commented Jun 20, 2017

Test build #78311 has finished for PR 18364 at commit ced7c33.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

Contributor

shaneknapp commented Jun 21, 2017

test this please

SparkQA commented Jun 21, 2017

Test build #78336 has started for PR 18364 at commit 407a5e9.

Contributor Author

brkyvz commented Jun 21, 2017

retest this please

SparkQA commented Jun 21, 2017

Test build #78397 has finished for PR 18364 at commit 407a5e9.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

marmbrus reviewed

View reviewed changes

sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala

    
                      val window = windowExpressions.head

                      val metadata = window.timeColumn match {

                        case a: Attribute => a.metadata

Contributor

marmbrus Jun 22, 2017

existing: There is a comment above that says "not correct"?

marmbrus reviewed

View reviewed changes

Contributor

marmbrus left a comment

Pretty big speed up!

sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala Outdated

    
                      val windows = Seq.tabulate(maxNumOverlapping + 1) { i =>

                        val windowId = Ceil((PreciseTimestamp(window.timeColumn) - window.startTime) /

                          window.slideDuration)

                      def getWindow(i: Int, maxNumOverlapping: Int): Expression = {

Contributor

marmbrus Jun 22, 2017

I'm not sure I understand maxNumOverlapping as a name that we tabulate over. Isn't it like the overlapNumber or something?

sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala Outdated

    
                        window.timeColumn < windowAttr.getField(WINDOW_END)

                      if (window.windowDuration == window.slideDuration) {

                        val windowStruct = Alias(getWindow(0, 1), WINDOW_COL_NAME)(

                          exprId = windowAttr.exprId, explicitMetadata = Some(metadata))

Contributor

marmbrus Jun 22, 2017

nit: Wrapping is off. Prefer to break at = and if you wrap args, wrap all of them.

sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala Outdated

    
                        // For backwards compatibility we add a filter to filter out nulls

                        val filterExpr = IsNotNull(window.timeColumn)

                        replacedPlan.withNewChildren(Filter(filterExpr,

Contributor

marmbrus Jun 22, 2017

Nit: wrapping, indent query plans like trees.

Contributor

marmbrus Jun 22, 2017

Actually, should we even be doing a projection here? If its just a substitution / filter, perhaps we should just replace it inline?

Contributor Author

brkyvz Jun 23, 2017

I couldn't get inline replacing to work. It breaks EventTimeWatermark tests

sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala Outdated

    
                        replacedPlan.withNewChildren(Filter(filterExpr,

                          Project(windowStruct +: child.output, child)) :: Nil)

                      } else {

Contributor

marmbrus Jun 22, 2017

nit: no blank line

sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/TimeWindow.scala

    
                override def dataType: DataType = LongType

              case class PreciseTimestampConversion(

                  child: Expression,

                  fromType: DataType,

Contributor

marmbrus Jun 22, 2017

The from type should just come from the child right?

Contributor Author

brkyvz Jun 22, 2017

expectsInputTypes does implicit casting at times

Contributor

marmbrus Jun 22, 2017

Maybe we shouldn't be using it then? This is a purely internal expression?

Contributor Author

brkyvz Jun 23, 2017

it is purely internal used for microsecond precision access of Timestamps


          Address comments

c59a0de

SparkQA commented Jun 23, 2017

Test build #78540 has finished for PR 18364 at commit c59a0de.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

Contributor Author

brkyvz commented Jun 23, 2017

retest this please

SparkQA commented Jun 23, 2017

Test build #78544 has finished for PR 18364 at commit c59a0de.

This patch fails PySpark pip packaging tests.
This patch merges cleanly.
This patch adds no public classes.

Contributor Author

brkyvz commented Jun 23, 2017

retest this please

SparkQA commented Jun 24, 2017

Test build #78547 has finished for PR 18364 at commit c59a0de.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

Member

zsxwing commented Jun 26, 2017

LGTM. Merging to master.

asfgit closed this in

5282bae

robert3005 pushed a commit to palantir/spark that referenced this pull request


          [SPARK-21153] Use project instead of expand in tumbling windows

4d50eae

## What changes were proposed in this pull request?

Time windowing in Spark currently performs an Expand + Filter, because there is no way to guarantee the amount of windows a timestamp will fall in, in the general case. However, for tumbling windows, a record is guaranteed to fall into a single bucket. In this case, doubling the number of records with Expand is wasteful, and can be improved by using a simple Projection instead.

Benchmarks show that we get an order of magnitude performance improvement after this patch.

## How was this patch tested?

Existing unit tests. Benchmarked using the following code:

```scala
import org.apache.spark.sql.functions._

spark.time {
  spark.range(numRecords)
    .select(from_unixtime((current_timestamp().cast("long") * 1000 + 'id / 1000) / 1000) as 'time)
    .select(window('time, "10 seconds"))
    .count()
}
```

Setup:
 - 1 c3.2xlarge worker (8 cores)

![image](https://user-images.githubusercontent.com/5243515/27348748-ed991b84-55a9-11e7-8f8b-6e7abc524417.png)

1 B rows ran in 287 seconds after this optimization. I didn't wait for it to finish without the optimization. Shows about 5x improvement for large number of records.

Author: Burak Yavuz <brkyvz@gmail.com>

Closes apache#18364 from brkyvz/opt-tumble.

brkyvz deleted the opt-tumble branch

February 3, 2019 20:54

HeartSaVioR mentioned this pull request

[SPARK-38069][SQL][SS] Improve the calculation of time window #35362

Closed

nyingping mentioned this pull request

[SPARK-39347] [SS] Generate wrong time window when (timestamp-startTime) % slideDuration… #36737

Closed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet