-
Notifications
You must be signed in to change notification settings - Fork 29.1k
[SPARK-47411][SQL] Support StringInstr & FindInSet functions to work with collated strings #45643
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
Closed
Changes from all commits
Commits
Show all changes
53 commits
Select commit
Hold shift + click to select a range
a4d3592
Add support for instr and unit test in CollationStringExpressionsSuit…
miland-db eb2d7c5
Correct code style
miland-db 9340831
Remove blank line from CollationStringExpressionsSuite.scala
miland-db 465e814
Correct comment indentation
miland-db f3f30d8
Add unit tests for INSTR operation
miland-db 9cb92d3
Add doGenCode for FindInSet
miland-db 834be70
Rewrite unit tests for INSTR and FIND_IN_SET
miland-db db2453a
Correct return value when substr is not found in INSTR method
miland-db 91b648a
Update unit tests for StringInStr and FindInSet
miland-db 1062521
Remove tests on non-explicit default collation
miland-db 42700c7
Merge branch 'apache:master' into miland-db/substr-functions
miland-db 427ea25
Improve signature of testInStr
miland-db 546b3b0
Merge branch 'master' into miland-db/substr-functions
miland-db 108d707
Remove E2E test for collation mismatch. This will be added in Implici…
miland-db 822ecd2
Resolve merge problems with master
miland-db f730d05
Improve scala style
miland-db de7b591
Solve whitespace scala style problem
miland-db f0ee8fd
Add lazy val collationId
miland-db 4ac6885
Remove repeated code
miland-db b931333
Improve test format
miland-db 0fd51d5
Improve indexOf method
miland-db 28fa7f0
Remove checks in return statement of collatedIndexOf method
miland-db bab96ac
Merge branch 'master' into substr-functions
miland-db 4666aff
Add branch for collated findInSet
miland-db ca8a37c
Add branch for collation check in StringInstr
miland-db 4ffab78
Improve naming of collation aware methods
miland-db 037d6be
Improve java style
miland-db 0a22909
Improve collationAwareIndexOf performance
miland-db b3be85d
Fix indentation
miland-db c4c0fe7
Add more tests for instr
miland-db 5b29f76
Add more tests
miland-db 877828e
Remove collation match type checks
miland-db 038a071
Merge branch 'master' into substr-functions
miland-db b35a8ac
Merge with the latest master
miland-db 8b06014
Remove checkInputDataTypes
miland-db 2c454af
Merge branch 'master' into substr-functions
miland-db 4dbc26e
Merge branch 'master' into substr-functions
miland-db fbd1c00
Refactor code and move it to CollationSupport
miland-db 960af54
Improve codegen and run tests
miland-db 05cd6c4
Unify collationAwareIndexOf for return value to have same semantics a…
miland-db 053efa0
Break line at 100 chars
miland-db c65d68e
Add new version of getStringSearch
miland-db ae33a38
Rename StringInstr params and class in CollationSupport
miland-db be1b52c
Go from nullSafeCodeGen to defineCodeGen
miland-db 5894d2f
Refactor testing
miland-db 75dc0bd
Remove empty lines
miland-db c712b4b
Improve CollationAware indexOf to have the same semantics as UTF8Stri…
miland-db 3c37f35
Add new e2e test
miland-db cd860b9
Revert unused import deletion
miland-db 3fa2502
Fix codegen
miland-db b12f176
Merge branch 'master' into substr-functions
miland-db b35d718
Remove unused import
miland-db 1ee5ad6
Add new tests
miland-db File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.