Support IGNORE NULLS for LAG window function #9221

comphead · 2024-02-13T19:40:45Z

Which issue does this PR close?

Closes #.
Related #9055

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

comphead · 2024-02-19T18:23:57Z

@mustafasrepo please review whenever you have time.
The IGNORE NULLS is supported to LAG function only

comphead · 2024-02-19T18:24:42Z

datafusion/proto/src/logical_plan/from_proto.rs

@@ -1114,6 +1116,7 @@ pub fn parse_expr(
                        partition_by,
                        order_by,
                        window_frame,
+                        None


Proto can be done as followup

comphead · 2024-02-19T18:27:11Z

datafusion/physical-expr/src/window/lead_lag.rs

            range.end as i64 - self.shift_offset - 1
        } else {
            // LEAD mode
            range.start as i64 - self.shift_offset
        };

-        if idx < 0 || idx as usize >= array.len() {
+        // Support LAG only for now, as LEAD requires some refactoring first


for LEAD function we likely need to refactor the evaluator and how it works.
The problem is for LEAD we have to adjust values that have already been emitted by evaluator which is not doable afaik. @mustafasrepo I would love to get your input how we can solve such challenge. One of solution is to emit not the single value like now, but the entire resulting array so it gives more control

datafusion/expr/src/expr.rs

datafusion/physical-expr/src/window/built_in.rs

datafusion/core/src/datasource/file_format/parquet.rs

datafusion/physical-expr/src/window/lead_lag.rs

mustafasrepo · 2024-02-20T12:15:30Z

Thanks @comphead for this PR.
Currently we have two API for window function calculations:

evaluate_all
evaluate

evaluate_all takes all the data as single batch. In this version we have all available data for decision.
evaluate takes only absolutely necessary section for the calculation of the window function result.
The difference is that WindowAggExec uses evaluate_all, whereas BoundedWindowAggExec uses evaluate API to decrease memory usage, and possibly incremental calculations.

This PR changes evaluate implementation to support LAG. However, as far as I can see its corresponding evaluate_all handling is not done.

Also it seems that current LAG implementation only works for lag 1 (which is the default). However, for other lag values I don't think current implementation will generate correct results.

for LEAD function we likely need to refactor the evaluator and how it works.
The problem is for LEAD we have to adjust values that have already been emitted by evaluator which is not doable afaik. @mustafasrepo I would love to get your input how we can solve such challenge. One of solution is to emit not the single value like now, but the entire resulting array so it gives more control

I think, we can generate correct result without changing the API by keeping track of null_count within the offset interval in running fashion. However, I am not sure though.

I think

we should first add support for evaluate_all API. This support should be easier than the evaluate support. Since evaluate_all API has all the data possible. Lag, Lead support can be implemented for this API. There won't be much difference, as far as I can presume.
Then, we should add support for evaluate API.

I can work on the support for evaluate API. If that is Ok for you.

comphead · 2024-02-20T16:59:49Z

Thanks @mustafasrepo for the detailed feedback. I'll remove leftovers not related to PR.
My next steps:

add tests for LAG with non default offset
I'll try to use evaluate_all, it will be easier you are right as we have all data in place. The only thing concerns me when I run tests I didn't see .evaluate_all has been called
For the evaluate, I appreciate your help. One idea I had is to reverse input array before calling evaluate for only LEAD function and it potentially should work, but there reversing might be expensive

mustafasrepo · 2024-02-21T06:37:08Z

The only thing concerns me when I run tests I didn't see .evaluate_all has been called

evaluate_all is called from WindowAggExec. WindowAggExec works when one of the window frame boundaries include UNBOUNDED FOLLOWING. Hence for the query below

SELECT LAG(c9, 2) OVER(ORDER BY c9), SUM(c9) OVER(order by c9 ROWS BETWEEN UNBOUNDED PRECEDING AND UNBOUNDED FOLLOWING)
FROM aggregate_test_100;

WindowAggExec will be used. Unfortunately, from a SQL query I couldn't come up with a simpler reproducer.

One idea I had is to reverse input array before calling evaluate for only LEAD function and it potentially should work, but there reversing might be expensive
Another approach might be for the table, we can construct a vector with same length to track non null count. Such as for the table below

a
1
2
null
3
null
4

we would construct vector

non_null_count
1
2
2
3
3
4

where count is incremented each time a new non-null value is seen.
LAG, LEAD can work on this vector to determine the place of the value. For instance, for the row index 5 in the original table (where a=4)
LAG(2) should produce 2. We can determine this by finding non_null_count of the row (which is 4 in our case). When we determine LAG on this table we can determine that non_null_count of the result should be 2. Then we can find the index of first 2 in the non_null_count vector. (Other 2s point to null values.). Index would be 1 which has value a=2 in the table.
LEAD can work similarly.

mustafasrepo · 2024-02-21T06:58:03Z

After thinking about the possible solutions, another approach might be for the table below

a
1
2
null
3
null
4

constructing a vector with same length where each entry contains the index of the previous non-null entry.
For the table above, this would be

non-null_pointers
-1
0
0
1
1
3

To find LAG(2) for the row=5 (where a=4). We would follow pointers twice (idx 5 -> 3, 3 ->1). Hence result would the value at index 1, which have a=2. For the LEAD we might need to reverse data, apply LAG as you suggest. If we encounter -1, that would indicate result is None. Because there is no more previous data.

comphead · 2024-02-22T01:45:33Z

@mustafasrepo thanks for suggestions, I've implemented similar approach with tracking nonnull row indexes, so likely it works for non default offset.
I was not able to call LAG in .evaluate_all mode as it just not configured that way.

https://github.com/apache/arrow-datafusion/blob/main/datafusion/core/src/physical_planner.rs#L750 is the condition to select preferred mode.

It will be always Bounded for LAG/LEAD because
https://github.com/apache/arrow-datafusion/blob/cf11a700eb6a5385a6ebade2b92c684380940296/datafusion/physical-expr/src/window/built_in.rs#L276 refers to evaluator.supports_bounded_execution which is true for LAG/LEAD https://github.com/apache/arrow-datafusion/blob/cf11a700eb6a5385a6ebade2b92c684380940296/datafusion/physical-expr/src/window/lead_lag.rs#L234 and uses_window_frame is false

mustafasrepo

Thanks @comphead for this PR. It is LGTM!.
I sent two commits

to include a test which triggers evaluate_all call (we can work on this test in following PR to add support for WindowAggExec), with some minor stylistic changes.
to make algorithm pruning friendly. Previous implementation relied on indices kept track to be correct (when pruned this might not be the case). Hence, I modified implementation so that it produces correct results when pruned.

mustafasrepo · 2024-02-22T09:56:50Z

datafusion/sqllogictest/test_files/window.slt

+set datafusion.execution.batch_size = 1;
+
+query I
+SELECT LAG(c1, 2) IGNORE NULLS OVER()


This test, triggers pruning internally. previous implementation was producing different result than the above result where data is fed as single chunk (because of large batch size), where no pruning is done.

mustafasrepo · 2024-02-22T11:19:19Z

datafusion/sqllogictest/test_files/window.slt

+# LAG window function IGNORE/RESPECT NULLS support with descending order and nondefault offset.
+# To trigger WindowAggExec, we added a sum window function with all of the ranges.
+statement error Execution error: IGNORE NULLS mode for LAG and LEAD is not supported for WindowAggExec
+select lag(a, 2, null) ignore nulls over (order by id desc) as x1,


This test triggers evaluate_all call

comphead · 2024-02-22T16:39:32Z

Thanks @mustafasrepo I'll wait for couple of more hours and then merge it if no other feedback shows up

github-actions bot added sql SQL Planner logical-expr Logical plan and expressions physical-expr Physical Expressions optimizer Optimizer rules core Core DataFusion crate substrait labels Feb 13, 2024

comphead changed the title ~~[WIP] lag/lead ignore nulls~~ [WIP] Support IGNORE NULLS for LAG window function Feb 19, 2024

github-actions bot added the sqllogictest SQL Logic Tests (.slt) label Feb 19, 2024

comphead added 2 commits February 19, 2024 08:50

WIP lag/lead ignore nulls

767f6a9

Support IGNORE NULLS for LAG function

9876849

comphead force-pushed the dev branch from 9a4f185 to 9876849 Compare February 19, 2024 16:50

fmt

341b965

comphead marked this pull request as ready for review February 19, 2024 18:23

comphead requested review from mustafasrepo and alamb February 19, 2024 18:23

comphead commented Feb 19, 2024

View reviewed changes

comphead changed the title ~~[WIP] Support IGNORE NULLS for LAG window function~~ Support IGNORE NULLS for LAG window function Feb 19, 2024

comphead requested a review from viirya February 19, 2024 22:12

mustafasrepo reviewed Feb 20, 2024

View reviewed changes

datafusion/expr/src/expr.rs Outdated Show resolved Hide resolved

mustafasrepo reviewed Feb 20, 2024

View reviewed changes

datafusion/physical-expr/src/window/built_in.rs Outdated Show resolved Hide resolved

mustafasrepo reviewed Feb 20, 2024

View reviewed changes

datafusion/core/src/datasource/file_format/parquet.rs Outdated Show resolved Hide resolved

mustafasrepo reviewed Feb 20, 2024

View reviewed changes

datafusion/physical-expr/src/window/lead_lag.rs Outdated Show resolved Hide resolved

comphead marked this pull request as draft February 20, 2024 18:58

comments

c82c05c

remove comments

0223836

comphead marked this pull request as ready for review February 22, 2024 01:45

comphead requested a review from mustafasrepo February 22, 2024 01:45

Add new tests, minor changes, trigger evalaute_all

5d532fd

mustafasrepo approved these changes Feb 22, 2024

View reviewed changes

Make algorithm pruning friendly

9897de8

mustafasrepo reviewed Feb 22, 2024

View reviewed changes

comphead mentioned this pull request Feb 23, 2024

LAG window function schema issue. Non null conflict. #9320

Closed

comphead merged commit a851ecf into apache:main Feb 23, 2024
23 checks passed

This was referenced Feb 27, 2024

Implement IGNORE NULLS for window functions #9055

Closed

Implement IGNORE NULLS for FIRST_VALUE #9411

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support IGNORE NULLS for LAG window function #9221

Support IGNORE NULLS for LAG window function #9221

comphead commented Feb 13, 2024 •

edited

Loading

comphead commented Feb 19, 2024

comphead Feb 19, 2024

comphead Feb 19, 2024 •

edited

Loading

mustafasrepo commented Feb 20, 2024 •

edited

Loading

comphead commented Feb 20, 2024

mustafasrepo commented Feb 21, 2024

mustafasrepo commented Feb 21, 2024

comphead commented Feb 22, 2024

mustafasrepo left a comment •

edited

Loading

mustafasrepo Feb 22, 2024

mustafasrepo Feb 22, 2024

comphead commented Feb 22, 2024

Support IGNORE NULLS for LAG window function #9221

Support IGNORE NULLS for LAG window function #9221

Conversation

comphead commented Feb 13, 2024 • edited Loading

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

comphead commented Feb 19, 2024

comphead Feb 19, 2024

Choose a reason for hiding this comment

comphead Feb 19, 2024 • edited Loading

Choose a reason for hiding this comment

mustafasrepo commented Feb 20, 2024 • edited Loading

comphead commented Feb 20, 2024

mustafasrepo commented Feb 21, 2024

mustafasrepo commented Feb 21, 2024

comphead commented Feb 22, 2024

mustafasrepo left a comment • edited Loading

Choose a reason for hiding this comment

mustafasrepo Feb 22, 2024

Choose a reason for hiding this comment

mustafasrepo Feb 22, 2024

Choose a reason for hiding this comment

comphead commented Feb 22, 2024

comphead commented Feb 13, 2024 •

edited

Loading

comphead Feb 19, 2024 •

edited

Loading

mustafasrepo commented Feb 20, 2024 •

edited

Loading

mustafasrepo left a comment •

edited

Loading