-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-11973] [SQL] push filter through aggregation with alias and literals #9959
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
cc @cloud-fan too |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this is confusing to read. Not clear why this is calling a utility in PushPredicateThroughProject. Move this into a better named place (similar to PredicateHelper)
|
Test build #46668 has finished for PR 9959 at commit
|
|
Test build #46701 has finished for PR 9959 at commit
|
|
Test build #46699 has finished for PR 9959 at commit
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why not move this to the existing trait PredicateHelper?
|
Test build #2116 has finished for PR 9959 at commit
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
a simplification: expr.find(_.isInstanceOf[AggregateExpression]).isDefined, but it's minor, may not worth another roundtrip.
|
LGTM |
|
Test build #46749 has finished for PR 9959 at commit
|
…terals Currently, filter can't be pushed through aggregation with alias or literals, this patch fix that. After this patch, the time of TPC-DS query 4 go down to 13 seconds from 141 seconds (10x improvements). cc nongli yhuai Author: Davies Liu <davies@databricks.com> Closes #9959 from davies/push_filter2. (cherry picked from commit 27d69a0) Signed-off-by: Davies Liu <davies.liu@gmail.com>
|
Merging into master and 1.6 branch. |
This is a followup for #9959. I added more documentation and rewrote some monadic code into simpler ifs. Author: Reynold Xin <rxin@databricks.com> Closes #9995 from rxin/SPARK-11973.
This is a followup for #9959. I added more documentation and rewrote some monadic code into simpler ifs. Author: Reynold Xin <rxin@databricks.com> Closes #9995 from rxin/SPARK-11973. (cherry picked from commit de28e4d) Signed-off-by: Reynold Xin <rxin@databricks.com>
Currently, filter can't be pushed through aggregation with alias or literals, this patch fix that.
After this patch, the time of TPC-DS query 4 go down to 13 seconds from 141 seconds (10x improvements).
cc @nongli @yhuai