fix: CometTakeOrderedAndProjectExec native scan node should use child operator's output #896

viirya · 2024-08-30T17:20:25Z

Which issue does this PR close?

Closes #.

Rationale for this change

What changes are included in this PR?

This bug was found while debugging CI failure in #893. In CometTakeOrderedAndProjectExec, we create internal native plan to execute native limit + sort + project. The pseudo scan node created there was incorrectly using CometTakeOrderedAndProjectExec's output attributes but it should be child node's output.

Currently it doesn't cause any error, although it will have incorrect schema in the scan node. Because sort/limit simply takes input so the incorrect schema doesn't make error. For project, we already bind attributes in Spark, as the input data is correct so it has no error too.

But in #893, we need to get the number of columns from the schema of scan node. If the schema is incorrect, the scan node will create incorrect number of array/schema structures which cause error later.

How are these changes tested?

… operator's output

viirya · 2024-08-30T18:34:23Z

Thanks @andygrove

fix: CometTakeOrderedAndProjectExec native scan node should use child…

87a63ec

… operator's output

viirya requested review from andygrove and huaxingao August 30, 2024 17:26

andygrove approved these changes Aug 30, 2024

View reviewed changes

viirya mentioned this pull request Aug 30, 2024

chore: Revise batch pull approach to more follow C Data interface semantics #893

Merged

viirya merged commit bfbb0be into apache:main Aug 30, 2024
76 checks passed

viirya deleted the fix_takeordered branch August 30, 2024 18:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: CometTakeOrderedAndProjectExec native scan node should use child operator's output #896

fix: CometTakeOrderedAndProjectExec native scan node should use child operator's output #896

viirya commented Aug 30, 2024 •

edited

Loading

viirya commented Aug 30, 2024

fix: CometTakeOrderedAndProjectExec native scan node should use child operator's output #896

fix: CometTakeOrderedAndProjectExec native scan node should use child operator's output #896

Conversation

viirya commented Aug 30, 2024 • edited Loading

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

How are these changes tested?

viirya commented Aug 30, 2024

viirya commented Aug 30, 2024 •

edited

Loading