[SPARK-17732][SQL] ALTER TABLE DROP PARTITION should support comparators #15302

dongjoon-hyun · 2016-09-29T20:16:40Z

What changes were proposed in this pull request?

This PR aims to support comparators, e.g. '<', '<=', '>', '>=', again in Apache Spark 2.0 for backward compatibility.

Spark 1.6.2

scala> sql("CREATE TABLE sales(id INT) PARTITIONED BY (country STRING, quarter STRING)")
res0: org.apache.spark.sql.DataFrame = [result: string]

scala> sql("ALTER TABLE sales DROP PARTITION (country < 'KR')")
res1: org.apache.spark.sql.DataFrame = [result: string]

Spark 2.0

scala> sql("CREATE TABLE sales(id INT) PARTITIONED BY (country STRING, quarter STRING)")
res0: org.apache.spark.sql.DataFrame = []

scala> sql("ALTER TABLE sales DROP PARTITION (country < 'KR')")
org.apache.spark.sql.catalyst.parser.ParseException:
mismatched input '<' expecting {')', ','}(line 1, pos 42)

After this PR, it's supported.

How was this patch tested?

Pass the Jenkins test with a newly added testcase.

SparkQA · 2016-09-29T21:49:55Z

Test build #66123 has finished for PR 15302 at commit c6c52fe.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-09-30T00:05:32Z

Test build #66130 has finished for PR 15302 at commit b59d622.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun · 2016-10-01T15:41:44Z

Rebased to the master.

SparkQA · 2016-10-01T17:21:52Z

Test build #66219 has finished for PR 15302 at commit eca9c86.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun · 2016-10-01T17:53:33Z

The only failure is the following in ColumnTypeSuite. It's irrelevant and the testsuite passed locally.

[info] - MAP append/extract *** FAILED *** (2 milliseconds)
[info]   java.lang.IllegalArgumentException:
[info]   at java.nio.Buffer.position(Buffer.java:244)
[info]   at org.apache.spark.sql.catalyst.expressions.UnsafeRow.writeFieldTo(UnsafeRow.java:650)

dongjoon-hyun · 2016-10-01T17:53:39Z

Retest this please.

SparkQA · 2016-10-01T20:06:48Z

Test build #66225 has finished for PR 15302 at commit eca9c86.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun · 2016-10-01T20:08:51Z

Hi, @hvanhovell .
Could you review this PR about 'ALTER TABLE DROP PARTITION'?

dongjoon-hyun · 2016-10-03T19:33:37Z

Hi, @hvanhovell .
Could you give some opinion about this PR when you have some time?

hvanhovell · 2016-10-03T20:19:27Z

@dongjoon-hyun I have taken a quick look. Shouldn't we just use Expressions for filtering partitions?

dongjoon-hyun · 2016-10-03T20:54:48Z

Thank you for review, @hvanhovell ! Do you mean SQL grammar or listPartition?

dongjoon-hyun · 2016-10-03T20:55:16Z

Recently, I've watched you improved those related function greatly.

hvanhovell · 2016-10-03T21:04:47Z

@dongjoon-hyun I think that AlterTableDropPartitionCommand just should take a set of catalyst Expressions instead of a PartitionRangeSpec.

I have added a few things the Catalog but I think that we shouldn't use those here. The `SessionCatalog is much more suited here.

dongjoon-hyun · 2016-10-03T21:09:13Z

I see. Then, how can evaluate the generic expression? Is it okay to use 'eval(null)'?

dongjoon-hyun · 2016-10-03T21:10:41Z

Thank you for the direction. I'll proceed to improve in that way.

SparkQA · 2016-10-03T21:23:42Z

Test build #66276 has finished for PR 15302 at commit 3c0585b.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun · 2016-10-03T22:27:49Z

The only failure looks irrelevant. Anyway, I'm revising the PR.

[info] *** 1 SUITE ABORTED ***
[error] Error: Total 2604, Failed 0, Errors 1, Passed 2603, Ignored 48
[error] Error during tests:
[error]     org.apache.spark.sql.jdbc.JDBCWriteSuite
[error] (sql/test:test) sbt.TestsFailedException: Tests unsuccessful
[error] Total time: 718 s, completed Oct 3, 2016 2:23:41 PM

dongjoon-hyun · 2016-10-17T09:42:49Z

Hi, @hvanhovell .

When using Expression, I faced two situations.

checkAnalysis raises exceptions because the column is unresolved, e.g., country is unresolved.
As a workaround, I tried to use string literal 'country', but then optimizer ConstantFolding replaces that as false because 'country' < 'KR' is false.

ALTER TABLE sales DROP PARTITION (country < 'KR')

To avoid this situations, I can add some rule to checkAnalysis. But, it seems not a good idea. Could you give some advice for this?

dongjoon-hyun · 2016-10-17T20:00:57Z

With today's master, it's like the following. Should we use expression in AlterTableDropPartitionCommand?

org.apache.spark.sql.AnalysisException: cannot resolve '`country`' given input columns: []; line 1 pos 23;
'AlterTableDropPartitionCommand `sales`, [('country < KR)], false, false

dongjoon-hyun · 2016-10-19T20:21:58Z

Hi, @hvanhovell .
In DDL, do we have an example to use Expression like this?

dongjoon-hyun · 2016-10-31T23:04:15Z

Hi, @hvanhovell .
I made another attempt #15704 by using 'Expression' as you commented.

hvanhovell · 2016-10-31T23:28:39Z

@dongjoon-hyun I'll take a look tomorrow.

dongjoon-hyun · 2016-10-31T23:56:05Z

Thank you, @hvanhovell !

dongjoon-hyun · 2016-11-14T20:48:53Z

I'm closing this PR in favor of #15704 .

[SPARK-17732][SQL] ALTER TABLE DROP PARTITION should support comparators

3c0585b

dongjoon-hyun mentioned this pull request Oct 20, 2016

[SPARK-18028][SQL] simplify TableFileCatalog #15568

Closed

dongjoon-hyun mentioned this pull request Nov 3, 2016

[SPARK-17732][SQL] ALTER TABLE DROP PARTITION should support comparators #15704

Closed

dongjoon-hyun closed this Nov 14, 2016

dongjoon-hyun mentioned this pull request Nov 30, 2016

[SPARK-17732][SQL] ALTER TABLE DROP PARTITION should support comparators #15987

Closed

dongjoon-hyun deleted the SPARK-17732 branch January 7, 2019 07:03

[SPARK-17732][SQL] ALTER TABLE DROP PARTITION should support comparators #15302

[SPARK-17732][SQL] ALTER TABLE DROP PARTITION should support comparators #15302

Uh oh!

Conversation

dongjoon-hyun commented Sep 29, 2016

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

SparkQA commented Sep 29, 2016

Uh oh!

SparkQA commented Sep 30, 2016

Uh oh!

dongjoon-hyun commented Oct 1, 2016

Uh oh!

SparkQA commented Oct 1, 2016

Uh oh!

dongjoon-hyun commented Oct 1, 2016

Uh oh!

dongjoon-hyun commented Oct 1, 2016

Uh oh!

SparkQA commented Oct 1, 2016

Uh oh!

dongjoon-hyun commented Oct 1, 2016

Uh oh!

dongjoon-hyun commented Oct 3, 2016

Uh oh!

hvanhovell commented Oct 3, 2016

Uh oh!

dongjoon-hyun commented Oct 3, 2016

Uh oh!

dongjoon-hyun commented Oct 3, 2016

Uh oh!

hvanhovell commented Oct 3, 2016

Uh oh!

dongjoon-hyun commented Oct 3, 2016

Uh oh!

dongjoon-hyun commented Oct 3, 2016

Uh oh!

SparkQA commented Oct 3, 2016

Uh oh!

dongjoon-hyun commented Oct 3, 2016

Uh oh!

dongjoon-hyun commented Oct 17, 2016

Uh oh!

dongjoon-hyun commented Oct 17, 2016

Uh oh!

dongjoon-hyun commented Oct 19, 2016

Uh oh!

dongjoon-hyun commented Oct 31, 2016

Uh oh!

hvanhovell commented Oct 31, 2016

Uh oh!

dongjoon-hyun commented Oct 31, 2016

Uh oh!

dongjoon-hyun commented Nov 14, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants