[SPARK-28038][SQL][TEST] Port text.sql #24862

wangyum · 2019-06-13T12:46:02Z

What changes were proposed in this pull request?

This PR is to port text.sql from PostgreSQL regression tests. https://github.com/postgres/postgres/blob/REL_12_BETA2/src/test/regress/sql/text.sql

The expected results can be found in the link: https://github.com/postgres/postgres/blob/REL_12_BETA2/src/test/regress/expected/text.out

When porting the test cases, found a PostgreSQL specific features that do not exist in Spark SQL:
SPARK-28037: Add built-in String Functions: quote_literal

Also, found three inconsistent behavior:
SPARK-27930: Spark SQL's format_string can not fully support PostgreSQL's format
SPARK-28036: Built-in udf left/right has inconsistent behavior
SPARK-28033: String concatenation should low priority than other operators

How was this patch tested?

N/A

sql/core/src/test/resources/sql-tests/inputs/pgSQL/text.sql

SparkQA · 2019-06-13T14:50:53Z

Test build #106464 has finished for PR 24862 at commit 1068a10.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2019-07-19T12:49:25Z

Test build #107897 has finished for PR 24862 at commit 4be0f9e.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2019-07-19T13:50:00Z

Test build #107899 has finished for PR 24862 at commit c9d3d16.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun · 2019-07-30T00:38:28Z

Retest this please.

maropu · 2019-07-30T01:06:20Z

sql/core/src/test/resources/sql-tests/inputs/pgSQL/text.sql

+-- As of 8.3 we have removed most implicit casts to text, so that for example
+-- this no longer works:
+-- Spark SQL implicit cast integer to string
+select length(42);


FYI: If we strictly follow ANSI/SQL, we don't allow this implicit cast along with PostgresSQL.
cc: @gengliangwang

Is this casting an integer to string? If yes, I think it is allowed in ANSI SQL and up-cast.

maropu · 2019-07-30T01:14:01Z

sql/core/src/test/resources/sql-tests/inputs/pgSQL/text.sql

+-- an unknown literal.  So these work:
+-- [SPARK-28033] String concatenation low priority than other arithmeticBinary
+select string('four: ') || 2+2;
+select string('four: ') || 2+2;


nit: duplicate test

Update it to select 'four: ' || 2+2;?

https://github.com/postgres/postgres/blob/REL_12_BETA2/src/test/regress/sql/text.sql#L25-L26

maropu · 2019-07-30T01:24:12Z

sql/core/src/test/resources/sql-tests/inputs/pgSQL/text.sql

+select concat_ws(NULL,10,20,null,30) is null;
+select reverse('abcde');
+-- [SPARK-28036] Built-in udf left/right has inconsistent behavior
+-- select i, left('ahoj', i), right('ahoj', i) from range(-5, 5) t(i) order by i;


Why did you comment out this? (I just want to check current output...)

Because of ANSI mode:

spark-sql> select left('12345', 2); 12 spark-sql> set spark.sql.parser.ansi.enabled=true; spark.sql.parser.ansi.enabled true spark-sql> select left('12345', 2); Error in query: no viable alternative at input 'left'(line 1, pos 7) == SQL == select left('12345', 2) -------^^^

https://issues.apache.org/jira/browse/SPARK-28479

The output if disable ANSI mode:

Spark SQL:

spark-sql> select i, left('ahoj', i), right('ahoj', i) from range(-5, 6) t(i) order by i; -5 -4 -3 -2 -1 0 1 a j 2 ah oj 3 aho hoj 4 ahoj ahoj 5 ahoj ahoj

PostgreSQL:

postgres=# select i, left('ahoj', i), right('ahoj', i) from generate_series(-5, 5) t(i) order by i; i | left | right ----+------+------- -5 | | -4 | | -3 | a | j -2 | ah | oj -1 | aho | hoj 0 | | 1 | a | j 2 | ah | oj 3 | aho | hoj 4 | ahoj | ahoj 5 | ahoj | ahoj (11 rows)

Ah, I see. Can you turn temporarily off the mode for the query here?

SparkQA · 2019-07-30T04:04:10Z

Test build #108355 has finished for PR 24862 at commit c9d3d16.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2019-07-30T14:17:30Z

Test build #108382 has finished for PR 24862 at commit 10204c6.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

HyukjinKwon · 2019-07-31T02:36:09Z

Merged to master.

wangyum added 2 commits June 13, 2019 12:44

Add text.sql

304c5c7

Add result

1068a10

wangyum commented Jun 13, 2019

View reviewed changes

sql/core/src/test/resources/sql-tests/inputs/pgSQL/text.sql Outdated Show resolved Hide resolved

dongjoon-hyun added TEST SQL TESTS and removed TEST labels Jun 13, 2019

wangyum added 2 commits July 18, 2019 11:06

Merge remote-tracking branch 'upstream/master' into SPARK-28038

4be0f9e

Move to REL_12_BETA2

c9d3d16

wangyum changed the title ~~[WIP][SPARK-28038][SQL][TEST] Port text.sql~~ [SPARK-28038][SQL][TEST] Port text.sql Jul 19, 2019

maropu reviewed Jul 30, 2019

View reviewed changes

Add [SPARK-28479] Parser error when enabling ANSI mode

10204c6

maropu approved these changes Jul 31, 2019

View reviewed changes

HyukjinKwon closed this in 261e113 Jul 31, 2019

wangyum deleted the SPARK-28038 branch July 31, 2019 02:42

[SPARK-28038][SQL][TEST] Port text.sql #24862

[SPARK-28038][SQL][TEST] Port text.sql #24862

Uh oh!

Conversation

wangyum commented Jun 13, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

Uh oh!

SparkQA commented Jun 13, 2019

Uh oh!

SparkQA commented Jul 19, 2019

Uh oh!

SparkQA commented Jul 19, 2019

Uh oh!

dongjoon-hyun commented Jul 30, 2019

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Jul 30, 2019

Uh oh!

SparkQA commented Jul 30, 2019

Uh oh!

HyukjinKwon commented Jul 31, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

wangyum commented Jun 13, 2019 •

edited

Loading