Skip to content

Conversation

@wangyum
Copy link
Member

@wangyum wangyum commented Jan 30, 2018

What changes were proposed in this pull request?

The SPARK-27403 fixed CTAS cannot update statistics even if spark.sql.statistics.size.autoUpdate.enabled is enabled, as mentioned in SPARK-23263.

This pr adds tests for that fix.

How was this patch tested?

N/A

@SparkQA
Copy link

SparkQA commented Jan 30, 2018

Test build #86790 has finished for PR 20430 at commit 08d31c0.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

Copy link
Member

@felixcheung felixcheung left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@cloud-fan
Copy link
Contributor

CC @wzhfy

@wzhfy
Copy link
Contributor

wzhfy commented Feb 1, 2018

Can we specialize this CTAS case? For data changing commands like INSERT, I think we should remove the stats if auto update is disabled, because the previous stats are inaccurate after the insertion.

@HyukjinKwon
Copy link
Member

ping @wangyum for @wzhfy's comment.

@wangyum
Copy link
Member Author

wangyum commented Jul 16, 2018

Thanks @HyukjinKwon How about close this? CTAS has other issues , as mentioned in SPARK-24766. I will try to fix it if there is a chance.

@HyukjinKwon
Copy link
Member

Sure, please close and go ahead in the best way you could perform better.

@wangyum wangyum closed this Jul 16, 2018
@wangyum wangyum changed the title [SPARK-23263][SQL] Create table stored as parquet should update table size if automatic update table size is enabled [SPARK-23263][SQL] CTAS should update stat if autoUpdate statistics is enabled Nov 8, 2018
wangyum added 2 commits May 9, 2019 10:11
# Conflicts:
#	sql/core/src/main/scala/org/apache/spark/sql/execution/command/CommandUtils.scala
#	sql/core/src/test/scala/org/apache/spark/sql/StatisticsCollectionSuite.scala
#	sql/hive/src/test/scala/org/apache/spark/sql/hive/StatisticsSuite.scala
@wangyum wangyum reopened this May 9, 2019
@wangyum wangyum changed the title [SPARK-23263][SQL] CTAS should update stat if autoUpdate statistics is enabled [SPARK-23263][TEST] CTAS should update stat if autoUpdate statistics is enabled May 9, 2019
@SparkQA
Copy link

SparkQA commented May 9, 2019

Test build #105272 has finished for PR 20430 at commit 1c9caa0.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@wangyum
Copy link
Member Author

wangyum commented May 9, 2019

cc @dongjoon-hyun

@dongjoon-hyun
Copy link
Member

Retest this please.

Copy link
Member

@dongjoon-hyun dongjoon-hyun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@wangyum . I added two comments. Could you address them?

@SparkQA
Copy link

SparkQA commented May 13, 2019

Test build #105365 has finished for PR 20430 at commit 1c9caa0.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@wangyum
Copy link
Member Author

wangyum commented May 20, 2019

retest this please

@SparkQA
Copy link

SparkQA commented May 20, 2019

Test build #105553 has finished for PR 20430 at commit 1c9caa0.

  • This patch fails due to an unknown error code, -9.
  • This patch merges cleanly.
  • This patch adds no public classes.

@wangyum
Copy link
Member Author

wangyum commented May 20, 2019

retest this please

@SparkQA
Copy link

SparkQA commented May 20, 2019

Test build #105560 has finished for PR 20430 at commit 1c9caa0.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@wangyum
Copy link
Member Author

wangyum commented May 20, 2019

retest this please

@SparkQA
Copy link

SparkQA commented May 20, 2019

Test build #105563 has finished for PR 20430 at commit 1c9caa0.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented May 27, 2019

Test build #105826 has finished for PR 20430 at commit 36fd737.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@HyukjinKwon
Copy link
Member

@wangyum, can you replace and update? I think it's close to go.

@SparkQA
Copy link

SparkQA commented Jun 14, 2019

Test build #106518 has finished for PR 20430 at commit c9949eb.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Jun 16, 2019

Test build #106555 has finished for PR 20430 at commit d4f9035.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@HyukjinKwon
Copy link
Member

Merged to master.

@wangyum wangyum deleted the SPARK-23263 branch June 20, 2019 05:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants