-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-42484] [SQL] UnsafeRowUtils better error message #40073
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/StateStore.scala
Outdated
Show resolved
Hide resolved
5623f25 to
7cb2853
Compare
|
Some tests always fail with Length of string "select id from hive. I guess that's because my folder name is https://github.com/WweiL/oss-spark/actions/runs/4235566927/jobs/7359363624 So I changed the hard-coded length here to a variable-length one. @MaxGekk @srielau @itholic I found that this is related to the PR (#39977) you pushed / reviewed. Can you guys also take a look? Thanks! |
|
Thanks for catching out, @WweiL ! |
|
@cloud-fan Can you merge this to master when you get a chance? Thank you! |
|
thanks, merging to master! |
|
Was this merged only to Spark 3.5 (master branch)? The JIRA ticket is not properly marked for fix version as well as status, and we need to make it clear to determine the version range to apply SPARK-42572. |
Yes this was only merged to master. I've updated the version in SPARK-42572. BTW is there a way to quickly decide what's the version of the current master branch? I thought it was 3.4... |
|
Sorry I seem to look at different JIRA ticket. SPARK-42484 contains the fixed version, 3.5.0. That said, SPARK-42572 only needs to be applied to master branch. |
…ateRowFormat ### What changes were proposed in this pull request? #40073 accidentally changed the relationship of the two `if` statement in `StateStoreProvider.validateStateRowFormat`. Before they were inclusive, i.e. ``` if (a) { // <code> if (b) { // <code> } } ``` It was changed to parallel, i.e. ``` if (a) { // <code> } if (b) { // <code> } ``` This PR change it back to the original behavior and add a unit test to prevent it in the future. ### Why are the changes needed? As above. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Unit test Closes #40187 from WweiL/SPARK-42572-stateStore-logic-test. Authored-by: Wei Liu <wei.liu@databricks.com> Signed-off-by: Jungtaek Lim <kabhwan.opensource@gmail.com>
What changes were proposed in this pull request?
Showing the essential information when throwing
InvalidUnsafeRowException. Including where the check failed, and status of theunsafeRowandexpctedSchemaExample output:
Why are the changes needed?
Right now if such error happens, it's hard to track where it errored, and what the misbehaved row & schema looks like. With this change these information are more clear.
Does this PR introduce any user-facing change?
No
How was this patch tested?
Unit tests