[SPARK-28285][SQL][PYTHON][TESTS] Convert and port 'outer-join.sql' into UDF test base #25103

huaxingao · 2019-07-11T00:47:36Z

What changes were proposed in this pull request?

This PR adds some tests converted from outer-join.sql to test UDFs. Please see contribution guide of this umbrella ticket - SPARK-27921.

Diff comparing to 'outer-join.sql'

diff --git a/sql/core/src/test/resources/sql-tests/results/outer-join.sql.out b/sql/core/src/test/resources/sql-tests/results/udf/udf-outer-join.sql.out
index 5db3bae5d0..819f786070 100644
--- a/sql/core/src/test/resources/sql-tests/results/outer-join.sql.out
+++ b/sql/core/src/test/resources/sql-tests/results/udf/udf-outer-join.sql.out
@@ -24,17 +24,17 @@ struct<>
 
 -- !query 2
 SELECT
-  (SUM(COALESCE(t1.int_col1, t2.int_col0))),
-     ((COALESCE(t1.int_col1, t2.int_col0)) * 2)
+  (udf(SUM(udf(COALESCE(t1.int_col1, t2.int_col0))))),
+     (udf(COALESCE(t1.int_col1, t2.int_col0)) * 2)
 FROM t1
 RIGHT JOIN t2
-  ON (t2.int_col0) = (t1.int_col1)
-GROUP BY GREATEST(COALESCE(t2.int_col1, 109), COALESCE(t1.int_col1, -449)),
+  ON udf(t2.int_col0) = udf(t1.int_col1)
+GROUP BY udf(GREATEST(COALESCE(udf(t2.int_col1), 109), COALESCE(t1.int_col1, udf(-449)))),
          COALESCE(t1.int_col1, t2.int_col0)
-HAVING (SUM(COALESCE(t1.int_col1, t2.int_col0)))
-            > ((COALESCE(t1.int_col1, t2.int_col0)) * 2)
+HAVING (udf(SUM(COALESCE(udf(t1.int_col1), udf(t2.int_col0)))))
+            > (udf(COALESCE(t1.int_col1, t2.int_col0)) * 2)
 -- !query 2 schema
-struct<sum(coalesce(int_col1, int_col0)):bigint,(coalesce(int_col1, int_col0) * 2):int>
+struct<CAST(udf(cast(sum(cast(cast(udf(cast(coalesce(int_col1, int_col0) as string)) as int) as bigint)) as string)) AS BIGINT):bigint,(CAST(udf(cast(coalesce(int_col1, int_col0) as string)) AS INT) * 2):int>
 -- !query 2 output
 -367   -734
 -507   -1014
@@ -70,10 +70,10 @@ spark.sql.crossJoin.enabled true
 SELECT *
 FROM (
 SELECT
-    COALESCE(t2.int_col1, t1.int_col1) AS int_col
+    udf(COALESCE(udf(t2.int_col1), udf(t1.int_col1))) AS int_col
     FROM t1
     LEFT JOIN t2 ON false
-) t where (t.int_col) is not null
+) t where (udf(t.int_col)) is not null
 -- !query 6 schema
 struct<int_col:int>
 -- !query 6 output

How was this patch tested?

Tested as guided in SPARK-27921.

SparkQA · 2019-07-11T04:30:54Z

Test build #107493 has finished for PR 25103 at commit 91d6955.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

…nto UDF test base

SparkQA · 2019-07-18T00:59:45Z

Test build #107801 has finished for PR 25103 at commit 5955d46.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

HyukjinKwon · 2019-07-18T04:03:42Z

retest this please

HyukjinKwon · 2019-07-18T04:20:22Z

Looks good to me in general if the tests pass

SparkQA · 2019-07-18T07:05:02Z

Test build #107816 has finished for PR 25103 at commit 5955d46.

This patch fails due to an unknown error code, -9.
This patch merges cleanly.
This patch adds no public classes.

huaxingao · 2019-07-18T21:07:27Z

retest this please

SparkQA · 2019-07-19T01:07:50Z

Test build #107861 has finished for PR 25103 at commit 5955d46.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

HyukjinKwon · 2019-07-19T02:00:05Z

@huaxingao, can you update the diff comparing to the original file? Looks a bit weird.

huaxingao · 2019-07-19T02:14:09Z

diff updated. @HyukjinKwon

HyukjinKwon · 2019-07-19T03:16:22Z

Merged to master.

huaxingao · 2019-07-19T03:22:09Z

Thanks a lot for your help! @HyukjinKwon

dongjoon-hyun added PYSPARK SQL TESTS labels Jul 11, 2019

huaxingao added 2 commits July 17, 2019 12:24

[SPARK-28285][SQL][PYTHON][TESTS] Convert and port 'outer-join.sql' i…

aefdf4f

…nto UDF test base

add a few more udf

5955d46

huaxingao force-pushed the spark-28285 branch from 91d6955 to 5955d46 Compare July 17, 2019 21:52

HyukjinKwon closed this in 20578e8 Jul 19, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-28285][SQL][PYTHON][TESTS] Convert and port 'outer-join.sql' into UDF test base #25103

[SPARK-28285][SQL][PYTHON][TESTS] Convert and port 'outer-join.sql' into UDF test base #25103

Uh oh!

huaxingao commented Jul 11, 2019 •

edited

Loading

Uh oh!

SparkQA commented Jul 11, 2019

Uh oh!

SparkQA commented Jul 18, 2019

Uh oh!

HyukjinKwon commented Jul 18, 2019

Uh oh!

HyukjinKwon commented Jul 18, 2019

Uh oh!

SparkQA commented Jul 18, 2019

Uh oh!

huaxingao commented Jul 18, 2019

Uh oh!

SparkQA commented Jul 19, 2019

Uh oh!

HyukjinKwon commented Jul 19, 2019

Uh oh!

huaxingao commented Jul 19, 2019

Uh oh!

HyukjinKwon commented Jul 19, 2019

Uh oh!

huaxingao commented Jul 19, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[SPARK-28285][SQL][PYTHON][TESTS] Convert and port 'outer-join.sql' into UDF test base #25103

[SPARK-28285][SQL][PYTHON][TESTS] Convert and port 'outer-join.sql' into UDF test base #25103

Uh oh!

Conversation

huaxingao commented Jul 11, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

SparkQA commented Jul 11, 2019

Uh oh!

SparkQA commented Jul 18, 2019

Uh oh!

HyukjinKwon commented Jul 18, 2019

Uh oh!

HyukjinKwon commented Jul 18, 2019

Uh oh!

SparkQA commented Jul 18, 2019

Uh oh!

huaxingao commented Jul 18, 2019

Uh oh!

SparkQA commented Jul 19, 2019

Uh oh!

HyukjinKwon commented Jul 19, 2019

Uh oh!

huaxingao commented Jul 19, 2019

Uh oh!

HyukjinKwon commented Jul 19, 2019

Uh oh!

huaxingao commented Jul 19, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

huaxingao commented Jul 11, 2019 •

edited

Loading