-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-42461][CONNECT] Scala Client implement first batch of functions #40050
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
# Conflicts: # connector/connect/client/jvm/src/main/scala/org/apache/spark/sql/functions.scala # connector/connect/common/src/test/resources/query-tests/explain-results/function_max.explain # connector/connect/common/src/test/resources/query-tests/queries/function_max.json # connector/connect/common/src/test/resources/query-tests/queries/function_max.proto.bin
|
Note for the reviewer. I still want to add a couple of tests for duplicate functions. |
dongjoon-hyun
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Did you verify this with Scala 2.13?
To reviewers, please hold on approving this PR. This module has been broken already with Scala 2.13. The broken branch blocks other community member's development severely.
dongjoon-hyun
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FYI, the previous PR leaves 9 UT failures.
[info] *** 9 TESTS FAILED ***
[error] Failed tests:
[error] org.apache.spark.sql.connect.ProtoToParsedPlanTestSuite
[error] (connect / Test / test) sbt.TestsFailedException: Tests unsuccessful
[error] Total time: 41 s, completed Feb 16, 2023, 7:06:12 AM
|
@dongjoon-hyun I will fix those today. I do think we should have a discussion about this. Currently we have both maven and scala-2.13 that are not tested during CI. That seems wrong if both are apparently supported. The mental overhead of testing these manually is very high. |
|
As for blocking other community members severely, the same applies to the lack of testing of scala-2.13. |
# Conflicts: # connector/connect/client/jvm/src/test/scala/org/apache/spark/sql/ClientE2ETestSuite.scala
|
Fix for 2.13 has merged. I am going to hold this off until #40056 is merged. |
dongjoon-hyun
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1, LGTM (Pending CIs).
I assume that this is tested with Scala 2.13 in the same way, @hvanhovell .
|
merging |
### What changes were proposed in this pull request? This PR adds the following functions to Spark Connect Scala Client: - Sort Functions - Aggregate Functions - Misc Functions - Math Functions ### Why are the changes needed? We want to the Spark Connect Scala Client to reach parity with the original functions API. ### Does this PR introduce _any_ user-facing change? Yes, it adds a lot of functions. ### How was this patch tested? Added test for all functions and their significant variations. Closes #40050 from hvanhovell/SPARK-42461. Authored-by: Herman van Hovell <herman@databricks.com> Signed-off-by: Herman van Hovell <herman@databricks.com> (cherry picked from commit e6a84fe) Signed-off-by: Herman van Hovell <herman@databricks.com>
|
thanks all |
### What changes were proposed in this pull request? This PR adds the following functions to Spark Connect Scala Client: - Sort Functions - Aggregate Functions - Misc Functions - Math Functions ### Why are the changes needed? We want to the Spark Connect Scala Client to reach parity with the original functions API. ### Does this PR introduce _any_ user-facing change? Yes, it adds a lot of functions. ### How was this patch tested? Added test for all functions and their significant variations. Closes apache#40050 from hvanhovell/SPARK-42461. Authored-by: Herman van Hovell <herman@databricks.com> Signed-off-by: Herman van Hovell <herman@databricks.com> (cherry picked from commit e6a84fe) Signed-off-by: Herman van Hovell <herman@databricks.com>
What changes were proposed in this pull request?
This PR adds the following functions to Spark Connect Scala Client:
Why are the changes needed?
We want to the Spark Connect Scala Client to reach parity with the original functions API.
Does this PR introduce any user-facing change?
Yes, it adds a lot of functions.
How was this patch tested?
Added test for all functions and their significant variations.