-
Notifications
You must be signed in to change notification settings - Fork 3.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ARROW-9678: [Rust] [DataFusion] Improve projection push down to remove unused columns #7919
Conversation
FYI @andygrove and @alamb Again, this is useful mostly to the |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I reviewed this PR carefully and I think the logic looks good -- I actually tried running the new test cases without the code changes which was instructive.
I had only minor suggestions.
This commit makes the projection optimizer remove any projection or aggregation that is not used down the plan.
@andygrove , is there anything we need to work this further? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I didn't get a chance to review this one yet because it was taking me a while to understand the new logic, but I'm happy to approve based on @alamb 's review.
I do have a nagging concern that the logic may not work if the query plan contains aliases that rename columns, but we can address that as a follow up if/when that becomes an issue. |
Thank you, @andygrove ! I encapsulated that thought on ARROW-9830, with issue type "Test". :) |
…e unused columns This PR makes the projection optimizer remove any projection or aggregation that is not used down the plan, thus improving speed and convenience. This is worked on top of apache#7879 and only the last commit is specific to this PR. Closes apache#7919 from jorgecarleitao/projection Authored-by: Jorge C. Leitao <jorgecarleitao@gmail.com> Signed-off-by: Andy Grove <andygrove73@gmail.com>
…e unused columns This PR makes the projection optimizer remove any projection or aggregation that is not used down the plan, thus improving speed and convenience. This is worked on top of apache#7879 and only the last commit is specific to this PR. Closes apache#7919 from jorgecarleitao/projection Authored-by: Jorge C. Leitao <jorgecarleitao@gmail.com> Signed-off-by: Andy Grove <andygrove73@gmail.com>
This PR makes the projection optimizer remove any projection or aggregation that is not used down the plan, thus improving speed and convenience.
This is worked on top of #7879 and only the last commit is specific to this PR.