Adaptive Parquet Predicate Pushdown #5523

tustvold · 2024-03-17T02:10:53Z

Is your feature request related to a problem or challenge? Please describe what you are trying to do.

Currently RowSelection stores a list of RowSelector. This is optimised for the case of large runs of skipped or selected rows, allowing this to be pushed down to the underlying decoding machinery. Whilst this works very well for the use-case of skipping data based on the page index, where the selections are necessarily in the thousands of rows, it will potentially degrade in the presence of more granular predicate evaluation, e.g. as performed by ArrowPredicate.

Describe the solution you'd like

In a similar vein to #1248, we should have different strategies based on the selectivity of the predicate. In particular I would like RowSelection to switch between a RowSelector approach that is pushed down to the underlying readers, and a late evaluation approach where it stores a BooleanBuffer that is applied to the columns after the fact

Describe alternatives you've considered

Additional context

The text was updated successfully, but these errors were encountered:

XiangpengHao · 2024-10-21T16:34:35Z

take

XiangpengHao · 2024-10-21T16:42:58Z

I'll take a look at this. Here are some of my plans:

Implement a benchmark, e.g., RowSelection with 100k selectors (as used in ClickBench q21), benchmark and_then, from_filters and intersection
Implement a new RowSelection that uses a BooleanBuffer as the backend. Compare the performance.
Decide the policy when to use which.

Some more context:
I have to switch to a boolean buffer based row selection to reduce the selection overhead in another project. So I kind of already have all the implementation ready. The remaining work for me is to figure out a way to contribute back to arrow-rs.

alamb · 2024-10-22T13:56:36Z

An important goal of this ticket, mentioned by @tustvold in #6454 (comment), is that evaluating predicates in ArrowFilter (aka pushed down predicates) is never worse than decoding the columns first and then filtering them with the filter kernel

If we are able to achieve this goal, it would mean that query engines like DataFusion could push all predicates down into the predicate reader always.

At the moment, since it sometimes faster to apply filters after reading columns than it is via ArrowPredicate sometimes queries get slower when all predicates are pushed down.

When it makes sense to push predicates down depends on their actual selectivity, which is only known for sure during evaluation

Thus, I agree with the conclusion that implementing adaptivity in the lowest level scan will achieve the goal

XiangpengHao · 2024-10-22T14:37:38Z

is that evaluating predicates in ArrowFilter (aka pushed down predicates) is never worse than decoding the columns first and then filtering them with the filter kernel

This is an excellent summary of the goal, it also aligns well with my current project.

Since I have gone quite far on this, I want to share some of the issues I have encountered:

very fast row selection, as described in this ticket.
avoid decoding the predicate columns twice, potentially at the cost of higher memory usage, as described in parquet::column::reader::GenericColumnReader::skip_records still decompresses most data #6454 (comment)
adaptive slice or filter the resulting array, i.e., if the selection is sparse, we should filter/take, otherwise we should slice.
coalesce the resulting record batches. Since the filter is pushed to ParquetExec, we won't have FilterExec therefore no CoalesceBatchExec, which requires the ParquetExec to emit coalesced record batches.

alamb · 2024-10-22T15:03:44Z

coalesce the resulting record batches. Since the filter is pushed to ParquetExec, we won't have FilterExec therefore no CoalesceBatchExec, which requires the ParquetExec to emit coalesced record batches.

There is a structure in datafusion for Coalescing which might help / provide inspiration / be good to port upstream: https://github.com/apache/datafusion/blob/c22abb4ac3f1af8bbdf176ef0198988fc7b0982c/datafusion/physical-plan/src/coalesce/mod.rs#L71

alamb · 2024-10-22T15:05:32Z

adaptive slice or filter the resulting array, i.e., if the selection is sparse, we should filter/take, otherwise we should slice.

I think this is what @tustvold mentioned with the filter kernels that also adaptively decide take / iterate / etc based on the actual selection

tustvold added the enhancement Any new improvement worthy of a entry in the changelog label Mar 17, 2024

tustvold self-assigned this Mar 17, 2024

tustvold mentioned this issue Mar 17, 2024

Enable parquet filter pushdown by default apache/datafusion#3463

Open

6 tasks

tustvold removed their assignment May 29, 2024

alamb mentioned this issue Jul 15, 2024

2024 Q3-Q4 Roadmap? apache/datafusion#11442

Closed

tustvold mentioned this issue Sep 25, 2024

parquet::column::reader::GenericColumnReader::skip_records still decompresses most data #6454

Open

github-actions bot assigned XiangpengHao Oct 21, 2024

alamb mentioned this issue Oct 22, 2024

Oct 21, 2024: This week in DataFusion apache/datafusion#13035

Closed

4 tasks

alamb mentioned this issue Oct 24, 2024

CometBuffer can potentially lead to concurrent modification of a held buffer (aka is "Unsound" in Rust terms) apache/datafusion-comet#1035

Open

This was referenced Oct 24, 2024

Add Parquet RowSelection benchmark #6623

Merged

[Parquet] Add BooleanArray based row selection #6624

Draft

alamb mentioned this issue Oct 29, 2024

Oct 28, 2024: This week in DataFusion apache/datafusion#13167

Closed

3 tasks

alamb mentioned this issue Nov 5, 2024

Nov 5. 2024: This week in DataFusion apache/datafusion#13265

Closed

3 tasks

Dandandan mentioned this issue Nov 7, 2024

ParquetScan with filter takes too much time to process apache/datafusion#13298

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adaptive Parquet Predicate Pushdown #5523

Adaptive Parquet Predicate Pushdown #5523

tustvold commented Mar 17, 2024

XiangpengHao commented Oct 21, 2024

XiangpengHao commented Oct 21, 2024

alamb commented Oct 22, 2024

XiangpengHao commented Oct 22, 2024 •

edited

Loading

alamb commented Oct 22, 2024

alamb commented Oct 22, 2024

Adaptive Parquet Predicate Pushdown #5523

Adaptive Parquet Predicate Pushdown #5523

Comments

tustvold commented Mar 17, 2024

XiangpengHao commented Oct 21, 2024

XiangpengHao commented Oct 21, 2024

alamb commented Oct 22, 2024

XiangpengHao commented Oct 22, 2024 • edited Loading

alamb commented Oct 22, 2024

alamb commented Oct 22, 2024

XiangpengHao commented Oct 22, 2024 •

edited

Loading