-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-36751][SQL][PYTHON][R] Add bit/octet_length APIs to Scala, Python and R #33992
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
sql/core/src/test/scala/org/apache/spark/sql/StringFunctionsSuite.scala
Outdated
Show resolved
Hide resolved
|
ok to test. |
sql/core/src/test/scala/org/apache/spark/sql/StringFunctionsSuite.scala
Outdated
Show resolved
Hide resolved
|
the change looks fine otherwise. |
|
Kubernetes integration test unable to build dist. exiting with code: 1 |
|
Kubernetes integration test starting |
|
Kubernetes integration test status failure |
|
@HyukjinKwon @sarutak |
sql/core/src/test/scala/org/apache/spark/sql/StringFunctionsSuite.scala
Outdated
Show resolved
Hide resolved
|
Test build #143259 has finished for PR 33992 at commit
|
|
retest this please. |
|
Test build #143249 has finished for PR 33992 at commit
|
|
Kubernetes integration test starting |
|
Kubernetes integration test status failure |
|
Kubernetes integration test unable to build dist. exiting with code: 1 |
|
Test build #143253 has finished for PR 33992 at commit
|
HyukjinKwon
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM otherwise. Let me leave it to @sarutak
|
Test build #143260 has finished for PR 33992 at commit
|
|
Kubernetes integration test starting |
|
Kubernetes integration test status failure |
|
Kubernetes integration test starting |
|
Kubernetes integration test status failure |
|
Test build #143288 has finished for PR 33992 at commit
|
|
Test build #143290 has finished for PR 33992 at commit
|
|
Merged to |
|
Can one of the admins verify this patch? |
…or Examples in docstring ### What changes were proposed in this pull request? This PR is a minor followup of #33992 to fix the warnings during PySpark documentation build: ``` /.../spark/python/pyspark/sql/functions.py:docstring of pyspark.sql.functions.bit_length:19: WARNING: Unexpected section title or transition. ------- /.../spark/python/pyspark/sql/functions.py:docstring of pyspark.sql.functions.octet_length:19: WARNING: Unexpected section title or transition. ------- ``` We should always have the same length of hyphens with the title. ### Why are the changes needed? To remove warnings during the documentation build and show the HTML pages correctly. ### Does this PR introduce _any_ user-facing change? This is not released yet, and only in master branch. So, no to end users. ### How was this patch tested? Manually built the docs via `make clean html` at `python/docs` directory. Closes #34196 from HyukjinKwon/SPARK-36751. Authored-by: Hyukjin Kwon <gurwls223@apache.org> Signed-off-by: Kousuke Saruta <sarutak@oss.nttdata.com>
What changes were proposed in this pull request?
octet_length: caliculate the byte length of strings
bit_length: caliculate the bit length of strings
Those two string related functions are only implemented on SparkSQL, not on Scala, Python and R.
Why are the changes needed?
Those functions would be useful for multi-bytes character users, who mainly working with Scala, Python or R.
Does this PR introduce any user-facing change?
Yes. Users can call octet_length/bit_length APIs on Scala(Dataframe), Python, and R.
How was this patch tested?
unit tests