-
Notifications
You must be signed in to change notification settings - Fork 186
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Upgrade to DataFusion 40.0.0-rc1 #644
Conversation
There is a test failure:
|
I recall that count, max, min are supported as window function recently. Maybe there are some changes in DataFusion. cc @huaxingao |
maybe this PR? I need to take a closer look. |
We rely on |
@viirya @huaxingao I made a change so that we fall back to Spark for count in window aggregate for now, and I will file a follow on issue. This PR is ready for review now. edit: I filed #645 |
@@ -208,7 +208,7 @@ object QueryPlanSerde extends Logging with ShimQueryPlanSerde with CometExprShim | |||
expr match { | |||
case agg: AggregateExpression => | |||
agg.aggregateFunction match { | |||
case _: Min | _: Max | _: Count => |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This disables count
for both aggregation and window?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh, nvm, I checked the source code. This is only for window functions.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we add a comment here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Added
* Partial upgrade to DataFusion 40.0.0 * fix * implement more udaf * update bitwise agg * add func names * remove unused imports * remove arrow-string dep * fix copy and paste error * use 40.0.0-rc1 and temporarily ignore failing test * clippy * fall back to Spark for count windows aggregate * address feedback
Which issue does this PR close?
N/A
Rationale for this change
DataFusion is preparing to release version 40.0.0 so I would like to make sure that we are testing with the latest prior to the release.
What changes are included in this PR?
Use a very recent revision of the official DataFusion repo.
How are these changes tested?