[SPARK-31956][SQL] Do not fail if there is no ambiguous self join #28783

cloud-fan · 2020-06-10T12:00:33Z

What changes were proposed in this pull request?

This is a followup of #28695 , to fix the problem completely.

The root cause is that, df("col").as("name") is not a column reference anymore, and should not have the special column metadata. However, this was broken in ba7adc4#diff-ac415c903887e49486ba542a65eec980L1050-L1053

This PR fixes the regression, by strip the special column metadata in Column.name, which is the behavior before #28326 .

Why are the changes needed?

Fix a regression. We shouldn't fail if there is no ambiguous self-join.

Does this PR introduce any user-facing change?

Yes, the query in the test can run now.

How was this patch tested?

updated test

cloud-fan · 2020-06-10T12:01:03Z

cc @HyukjinKwon @xuanyuanking

HyukjinKwon

LGTM

HyukjinKwon · 2020-06-10T12:45:45Z

Shall we file a new JIRA instead of using SPARK-28344 since RC3 will likely pass and the fixed version conflicts.

xuanyuanking

LGTM

SparkQA · 2020-06-10T17:29:17Z

Test build #123761 has finished for PR 28783 at commit 38c0508.

This patch fails PySpark unit tests.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun · 2020-06-10T19:22:21Z

Thank you for fixing this, @cloud-fan !

dongjoon-hyun · 2020-06-10T19:22:41Z

Retest this please.

dongjoon-hyun

+1, LGTM. I verified Python UT locally.
Merged to master/3.0

### What changes were proposed in this pull request? This is a followup of #28695 , to fix the problem completely. The root cause is that, `df("col").as("name")` is not a column reference anymore, and should not have the special column metadata. However, this was broken in ba7adc4#diff-ac415c903887e49486ba542a65eec980L1050-L1053 This PR fixes the regression, by strip the special column metadata in `Column.name`, which is the behavior before #28326 . ### Why are the changes needed? Fix a regression. We shouldn't fail if there is no ambiguous self-join. ### Does this PR introduce _any_ user-facing change? Yes, the query in the test can run now. ### How was this patch tested? updated test Closes #28783 from cloud-fan/self-join. Authored-by: Wenchen Fan <wenchen@databricks.com> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org> (cherry picked from commit c400519) Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>

SparkQA · 2020-06-11T01:04:42Z

Test build #123778 has finished for PR 28783 at commit 38c0508.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

### What changes were proposed in this pull request? This is a followup of apache#28695 , to fix the problem completely. The root cause is that, `df("col").as("name")` is not a column reference anymore, and should not have the special column metadata. However, this was broken in apache@ba7adc4#diff-ac415c903887e49486ba542a65eec980L1050-L1053 This PR fixes the regression, by strip the special column metadata in `Column.name`, which is the behavior before apache#28326 . ### Why are the changes needed? Fix a regression. We shouldn't fail if there is no ambiguous self-join. ### Does this PR introduce _any_ user-facing change? Yes, the query in the test can run now. ### How was this patch tested? updated test Closes apache#28783 from cloud-fan/self-join. Authored-by: Wenchen Fan <wenchen@databricks.com> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org> (cherry picked from commit c400519) Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>

This is a followup of apache#28695 , to fix the problem completely. The root cause is that, `df("col").as("name")` is not a column reference anymore, and should not have the special column metadata. However, this was broken in apache@ba7adc4#diff-ac415c903887e49486ba542a65eec980L1050-L1053 This PR fixes the regression, by strip the special column metadata in `Column.name`, which is the behavior before apache#28326 . Fix a regression. We shouldn't fail if there is no ambiguous self-join. Yes, the query in the test can run now. updated test Closes apache#28783 from cloud-fan/self-join. Authored-by: Wenchen Fan <wenchen@databricks.com> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>

don't fail if there is no ambiguous self join

38c0508

probot-autolabeler bot added the SQL label Jun 10, 2020

HyukjinKwon changed the title ~~[SPARK-28344][SQL] don't fail if there is no ambiguous self join~~ [SPARK-28344][SQL][FOLLOW-UP] Do not fail if there is no ambiguous self join Jun 10, 2020

HyukjinKwon approved these changes Jun 10, 2020

View reviewed changes

xuanyuanking approved these changes Jun 10, 2020

View reviewed changes

cloud-fan changed the title ~~[SPARK-28344][SQL][FOLLOW-UP] Do not fail if there is no ambiguous self join~~ [SPARK-31956][SQL] Do not fail if there is no ambiguous self join Jun 10, 2020

dongjoon-hyun approved these changes Jun 10, 2020

View reviewed changes

dongjoon-hyun closed this in c400519 Jun 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-31956][SQL] Do not fail if there is no ambiguous self join #28783

[SPARK-31956][SQL] Do not fail if there is no ambiguous self join #28783

Uh oh!

cloud-fan commented Jun 10, 2020

Uh oh!

cloud-fan commented Jun 10, 2020

Uh oh!

HyukjinKwon left a comment

Uh oh!

HyukjinKwon commented Jun 10, 2020

Uh oh!

xuanyuanking left a comment

Uh oh!

SparkQA commented Jun 10, 2020

Uh oh!

dongjoon-hyun commented Jun 10, 2020

Uh oh!

dongjoon-hyun commented Jun 10, 2020

Uh oh!

dongjoon-hyun left a comment

Uh oh!

SparkQA commented Jun 11, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[SPARK-31956][SQL] Do not fail if there is no ambiguous self join #28783

[SPARK-31956][SQL] Do not fail if there is no ambiguous self join #28783

Uh oh!

Conversation

cloud-fan commented Jun 10, 2020

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

cloud-fan commented Jun 10, 2020

Uh oh!

HyukjinKwon left a comment

Choose a reason for hiding this comment

Uh oh!

HyukjinKwon commented Jun 10, 2020

Uh oh!

xuanyuanking left a comment

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Jun 10, 2020

Uh oh!

dongjoon-hyun commented Jun 10, 2020

Uh oh!

dongjoon-hyun commented Jun 10, 2020

Uh oh!

dongjoon-hyun left a comment

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Jun 11, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants