-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-32105][SQL]Refactor current ScriptTransformationExec code #27983
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
| (3, "3", 3.0, BigDecimal(3.0), new Timestamp(3), Date.valueOf("2015-05-23")) | ||
| ).toDF("a", "b", "c", "d", "e", "f") // Note column d's data type is Decimal(38, 18) | ||
| df.createTempView("v") | ||
|
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Here add test case of Timestamp and DateType
|
@gatorsmile @tejasapatil @wangyum I have restart implement ScriptTransform in sql/core now, Rewrite origin pr in current code. Hope for your review and suggestion |
|
Hi, @AngersZhuuuu . This is too big as a follow-up. Please create a new JIRA issue for this. |
|
ok to test |
|
Test build #120696 has finished for PR 27983 at commit
|
Seems not need to create new jira? just delete follow-up? |
|
@dongjoon-hyun Seem sometimes it won't catch the subprocess's exception. |
|
Test build #122194 has finished for PR 27983 at commit
|
|
Test build #122214 has finished for PR 27983 at commit
|
|
cc @cloud-fan Can you take a look? |
|
Test build #122637 has finished for PR 27983 at commit
|
@dongjoon-hyun For the latest failed test, Did I miss something? |
|
Retest this please |
|
Test build #123517 has finished for PR 27983 at commit
|
|
retest this please |
|
Test build #123525 has finished for PR 27983 at commit
|
|
Test build #125305 has finished for PR 27983 at commit
|
|
retest this please |
|
Test build #125324 has finished for PR 27983 at commit
|
| import org.apache.spark.sql.types.DataType | ||
| import org.apache.spark.util.{CircularBuffer, SerializableConfiguration, Utils} | ||
|
|
||
| trait BaseScriptTransform extends UnaryExecNode { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah, my bad. Probably, BaseScriptTransformationExec is better.. #27983 (comment)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah, my bad. Probably,
BaseScriptTransformationExecis better.. #27983 (comment)
Yea, done
|
Could you update the PR description, too? |
Done, sorry for always forgot to do that.... |
|
Test build #125424 has finished for PR 27983 at commit
|
|
Could you check the comment, too? #27983 (comment) |
|
185f26c to
e5cd3d8
Compare
|
Test build #125431 has finished for PR 27983 at commit
|
|
Test build #125428 has finished for PR 27983 at commit
|
|
retest this please |
|
Test build #125452 has finished for PR 27983 at commit
|
|
retest this please |
|
Test build #125461 has finished for PR 27983 at commit
|
|
retest this please |
|
Test build #125492 has started for PR 27983 at commit |
|
retest this please |
|
Test build #125537 has finished for PR 27983 at commit
|
|
retest this please |
|
Test build #125567 has finished for PR 27983 at commit
|
|
retest this please |
|
Test build #125587 has started for PR 27983 at commit |
|
retest this please |
|
Test build #125645 has finished for PR 27983 at commit
|
|
thanks, merging to master! |
What changes were proposed in this pull request?
hive/execution/ScriptTransformationExectohive/execution/HiveScriptTransformationExec(don't rename file)BaseScriptTransformationExecabout common code used acrossSparkScriptTransformationExec(next pr add this)andHiveScriptTransformationExecBaseScriptTransformationWriterThreadof writing data thread acrossSparkScriptTransformationWriterThread(added next for support transform in sql/core )andHiveScriptTransformationWriterThread,HiveScriptTransformationWriterThreadadditionally supports Hive serde formatScriptstrategies in hive module toHiveScript, in next pr will addSparkScriptstrategies for support transform in sql/core.Todo List;
BaseScriptTransformationExec, which would run script operator in SQL mode (without Hive).The output of script would be read as a string and column values are extracted by using a delimiter (default : tab character)
- Solve string value error about Date/Timestamp in ScriptTransform
- support use transform with aggregation
- support array/map as transform's input
Why are the changes needed?
Support run transform in SQL mode without hive
Does this PR introduce any user-facing change?
Yes
How was this patch tested?
Added UT