Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Improvement]: Use partition filter to speed up optimizing plan #2417

Closed
2 of 3 tasks
Tracked by #2176
HuangFru opened this issue Dec 8, 2023 · 3 comments · Fixed by #2436
Closed
2 of 3 tasks
Tracked by #2176

[Improvement]: Use partition filter to speed up optimizing plan #2417

HuangFru opened this issue Dec 8, 2023 · 3 comments · Fixed by #2436

Comments

@HuangFru
Copy link
Contributor

HuangFru commented Dec 8, 2023

Search before asking

  • I have searched in the issues and found no similar issues.

What would you like to be improved?

AMS currently uses PartitionFilter to filter unnecessary files when planning the table before optimizing. But this only filters files and cannot speed up the plan because all the files have been planned. We should consider supporting partition filters on the scan.
image

How should we improve?

Support partition filter on scan to speed up plan.

Are you willing to submit PR?

  • Yes I am willing to submit a PR!

Subtasks

No response

Code of Conduct

@huyuanfeng2018
Copy link
Contributor

huyuanfeng2018 commented Dec 11, 2023

I discussed this issue with @zhoujinsong before, let’s see if he has any opinions. The partition filter conditions cannot be passed in iceberg scan, but the manifest list contains the maximum and minimum values ​​​​of the saved partitions. can use this as an entry point to obtain it maybe.

@zhoujinsong
Copy link
Contributor

zhoujinsong commented Dec 11, 2023

Yes, @wangtaohz and @HuangFru shared a feasible transformation method to me today. I think this issue is related to that. I am looking forward to the implementation of this improvement.

BTW, this issue is related to #1842.

@wangtaohz
Copy link
Contributor

Yes, @wangtaohz and @HuangFru shared a feasible transformation method to me today. I think this issue is related to that. I am looking forward to the implementation of this improvement.

We can transform partition data into a data filter expression, @HuangFru will submit a PR to support it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants