[SPARK-27340][SS][TESTS][FOLLOW-UP] Rephrase API comments and simplify tests #28390

xuanyuanking · 2020-04-28T09:39:04Z

What changes were proposed in this pull request?

Rephrase the API doc for Column.as
Simplify the UTs

Why are the changes needed?

Address comments in #28326

Does this PR introduce any user-facing change?

No

How was this patch tested?

New UT added.

xuanyuanking · 2020-04-28T09:41:25Z

cc @cloud-fan @dongjoon-hyun @HeartSaVioR

SparkQA · 2020-04-28T15:37:05Z

Test build #121987 has finished for PR 28390 at commit 6e98444.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

gatorsmile

As @HeartSaVioR pointed out, this PR is actually changing the behaviors in metadata propagation for these public Column APIs. Let us add it to the migration guide?

cloud-fan · 2020-04-29T16:52:27Z

docs/sql-migration-guide.md


  - In Spark 3.1, SQL UI data adopts the `formatted` mode for the query plan explain results. To restore the behavior before Spark 3.0, you can set `spark.sql.ui.explainMode` to `extended`.

+  - In Spark 3.1, the column metadata will always be propagated in the API `name` and `as`. In Spark version 3.0 and earlier, the metadata of `NamedExpression` is set as the `explicitMetadata` for the new column. To restore the behavior before Spark 3.0, you can use the API `as(alias: String, metadata: Metadata)` with explicit metadata.


I think the patch has been merged to 3.0?

in the API name and as -> in the API Column#name and Column#as

In Spark version 3.0 and earlier, the metadata of NamedExpression is set as the explicitMetadata for the new column.

I think the old behavior is using the metadata of NamedExpression at the time the API was called. The metadata won't change even if the underlying NamedExpression changes metadata.

Thanks, let me emphasize the usage of explicitMetadata.

SparkQA · 2020-04-29T19:11:00Z

Test build #122067 has finished for PR 28390 at commit 1101d05.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2020-04-30T06:08:10Z

Test build #122098 has finished for PR 28390 at commit d9baf7a.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

cloud-fan · 2020-04-30T06:24:13Z

thanks, merging to master/3.0!

…y tests ### What changes were proposed in this pull request? - Rephrase the API doc for `Column.as` - Simplify the UTs ### Why are the changes needed? Address comments in #28326 ### Does this PR introduce any user-facing change? No ### How was this patch tested? New UT added. Closes #28390 from xuanyuanking/SPARK-27340-follow. Authored-by: Yuanjian Li <xyliyuanjian@gmail.com> Signed-off-by: Wenchen Fan <wenchen@databricks.com> (cherry picked from commit 7195a18) Signed-off-by: Wenchen Fan <wenchen@databricks.com>

xuanyuanking · 2020-04-30T07:04:35Z

Thanks for the review.

dongjoon-hyun · 2020-05-03T23:33:59Z

Thank you all.

address comments

6e98444

probot-autolabeler bot added SQL STRUCTURED STREAMING labels Apr 28, 2020

xuanyuanking mentioned this pull request Apr 28, 2020

[SPARK-27340][SS] Alias on TimeWindow expression cause watermark metadata lost #28326

Closed

cloud-fan approved these changes Apr 28, 2020

View reviewed changes

gatorsmile requested changes Apr 28, 2020

View reviewed changes

address comment

1101d05

probot-autolabeler bot added the DOCS label Apr 29, 2020

cloud-fan reviewed Apr 29, 2020

View reviewed changes

rephrase

d9baf7a

cloud-fan closed this in 7195a18 Apr 30, 2020

xuanyuanking deleted the SPARK-27340-follow branch April 30, 2020 07:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-27340][SS][TESTS][FOLLOW-UP] Rephrase API comments and simplify tests #28390

[SPARK-27340][SS][TESTS][FOLLOW-UP] Rephrase API comments and simplify tests #28390

xuanyuanking commented Apr 28, 2020 •

edited by gatorsmile

Loading

Uh oh!

xuanyuanking commented Apr 28, 2020

Uh oh!

SparkQA commented Apr 28, 2020

Uh oh!

gatorsmile left a comment

Uh oh!

cloud-fan Apr 29, 2020

Uh oh!

cloud-fan Apr 29, 2020

Uh oh!

cloud-fan Apr 29, 2020

Uh oh!

xuanyuanking Apr 30, 2020

Uh oh!

SparkQA commented Apr 29, 2020

Uh oh!

SparkQA commented Apr 30, 2020

Uh oh!

cloud-fan commented Apr 30, 2020

Uh oh!

xuanyuanking commented Apr 30, 2020

Uh oh!

dongjoon-hyun commented May 3, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants


		- In Spark 3.1, SQL UI data adopts the `formatted` mode for the query plan explain results. To restore the behavior before Spark 3.0, you can set `spark.sql.ui.explainMode` to `extended`.

		- In Spark 3.1, the column metadata will always be propagated in the API `name` and `as`. In Spark version 3.0 and earlier, the metadata of `NamedExpression` is set as the `explicitMetadata` for the new column. To restore the behavior before Spark 3.0, you can use the API `as(alias: String, metadata: Metadata)` with explicit metadata.

[SPARK-27340][SS][TESTS][FOLLOW-UP] Rephrase API comments and simplify tests #28390

[SPARK-27340][SS][TESTS][FOLLOW-UP] Rephrase API comments and simplify tests #28390

Conversation

xuanyuanking commented Apr 28, 2020 • edited by gatorsmile Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

xuanyuanking commented Apr 28, 2020

Uh oh!

SparkQA commented Apr 28, 2020

Uh oh!

gatorsmile left a comment

Choose a reason for hiding this comment

Uh oh!

cloud-fan Apr 29, 2020

Choose a reason for hiding this comment

Uh oh!

cloud-fan Apr 29, 2020

Choose a reason for hiding this comment

Uh oh!

cloud-fan Apr 29, 2020

Choose a reason for hiding this comment

Uh oh!

xuanyuanking Apr 30, 2020

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Apr 29, 2020

Uh oh!

SparkQA commented Apr 30, 2020

Uh oh!

cloud-fan commented Apr 30, 2020

Uh oh!

xuanyuanking commented Apr 30, 2020

Uh oh!

dongjoon-hyun commented May 3, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

xuanyuanking commented Apr 28, 2020 •

edited by gatorsmile

Loading