[SPARK-18370][SQL] Add table information to InsertIntoHadoopFsRelationCommand #15832

hvanhovell · 2016-11-09T16:53:03Z

What changes were proposed in this pull request?

InsertIntoHadoopFsRelationCommand does not keep track if it inserts into a table and what table it inserts to. This can make debugging these statements problematic. This PR adds table information the InsertIntoHadoopFsRelationCommand. Explaining this SQL command insert into prq select * from range(0, 100000) now yields the following executed plan:

== Physical Plan ==
ExecutedCommand
   +- InsertIntoHadoopFsRelationCommand file:/dev/assembly/spark-warehouse/prq, ParquetFormat, <function1>, Map(serialization.format -> 1, path -> file:/dev/assembly/spark-warehouse/prq), Append, CatalogTable(
	Table: `default`.`prq`
	Owner: hvanhovell
	Created: Wed Nov 09 17:42:30 CET 2016
	Last Access: Thu Jan 01 01:00:00 CET 1970
	Type: MANAGED
	Schema: [StructField(id,LongType,true)]
	Provider: parquet
	Properties: [transient_lastDdlTime=1478709750]
	Storage(Location: file:/dev/assembly/spark-warehouse/prq, InputFormat: org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat, OutputFormat: org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat, Serde: org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe, Properties: [serialization.format=1]))
         +- Project [id#7L]
            +- Range (0, 100000, step=1, splits=None)

How was this patch tested?

Added extra checks to the ParquetMetastoreSuite

hvanhovell · 2016-11-09T16:53:12Z

cc @srinathshankar

srinathshankar

LGTM

SparkQA · 2016-11-09T17:46:33Z

Test build #68413 has finished for PR 15832 at commit dea8a57.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

hvanhovell · 2016-11-09T17:51:26Z

retest this please

SparkQA · 2016-11-09T20:22:29Z

Test build #68414 has finished for PR 15832 at commit dea8a57.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

rxin · 2016-11-09T20:25:25Z

Merging in master/branch-2.1.

…nCommand ## What changes were proposed in this pull request? `InsertIntoHadoopFsRelationCommand` does not keep track if it inserts into a table and what table it inserts to. This can make debugging these statements problematic. This PR adds table information the `InsertIntoHadoopFsRelationCommand`. Explaining this SQL command `insert into prq select * from range(0, 100000)` now yields the following executed plan: ``` == Physical Plan == ExecutedCommand +- InsertIntoHadoopFsRelationCommand file:/dev/assembly/spark-warehouse/prq, ParquetFormat, <function1>, Map(serialization.format -> 1, path -> file:/dev/assembly/spark-warehouse/prq), Append, CatalogTable( Table: `default`.`prq` Owner: hvanhovell Created: Wed Nov 09 17:42:30 CET 2016 Last Access: Thu Jan 01 01:00:00 CET 1970 Type: MANAGED Schema: [StructField(id,LongType,true)] Provider: parquet Properties: [transient_lastDdlTime=1478709750] Storage(Location: file:/dev/assembly/spark-warehouse/prq, InputFormat: org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat, OutputFormat: org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat, Serde: org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe, Properties: [serialization.format=1])) +- Project [id#7L] +- Range (0, 100000, step=1, splits=None) ``` ## How was this patch tested? Added extra checks to the `ParquetMetastoreSuite` Author: Herman van Hovell <hvanhovell@databricks.com> Closes #15832 from hvanhovell/SPARK-18370. (cherry picked from commit d8b81f7) Signed-off-by: Reynold Xin <rxin@databricks.com>

…nCommand ## What changes were proposed in this pull request? `InsertIntoHadoopFsRelationCommand` does not keep track if it inserts into a table and what table it inserts to. This can make debugging these statements problematic. This PR adds table information the `InsertIntoHadoopFsRelationCommand`. Explaining this SQL command `insert into prq select * from range(0, 100000)` now yields the following executed plan: ``` == Physical Plan == ExecutedCommand +- InsertIntoHadoopFsRelationCommand file:/dev/assembly/spark-warehouse/prq, ParquetFormat, <function1>, Map(serialization.format -> 1, path -> file:/dev/assembly/spark-warehouse/prq), Append, CatalogTable( Table: `default`.`prq` Owner: hvanhovell Created: Wed Nov 09 17:42:30 CET 2016 Last Access: Thu Jan 01 01:00:00 CET 1970 Type: MANAGED Schema: [StructField(id,LongType,true)] Provider: parquet Properties: [transient_lastDdlTime=1478709750] Storage(Location: file:/dev/assembly/spark-warehouse/prq, InputFormat: org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat, OutputFormat: org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat, Serde: org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe, Properties: [serialization.format=1])) +- Project [id#7L] +- Range (0, 100000, step=1, splits=None) ``` ## How was this patch tested? Added extra checks to the `ParquetMetastoreSuite` Author: Herman van Hovell <hvanhovell@databricks.com> Closes apache#15832 from hvanhovell/SPARK-18370.

Add table information to InsertIntoHadoopFsRelationCommand.

dea8a57

srinathshankar approved these changes Nov 9, 2016

View reviewed changes

asfgit closed this in d8b81f7 Nov 9, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-18370][SQL] Add table information to InsertIntoHadoopFsRelationCommand #15832

[SPARK-18370][SQL] Add table information to InsertIntoHadoopFsRelationCommand #15832

Uh oh!

hvanhovell commented Nov 9, 2016

Uh oh!

hvanhovell commented Nov 9, 2016

Uh oh!

srinathshankar left a comment

Uh oh!

SparkQA commented Nov 9, 2016

Uh oh!

hvanhovell commented Nov 9, 2016

Uh oh!

SparkQA commented Nov 9, 2016

Uh oh!

rxin commented Nov 9, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[SPARK-18370][SQL] Add table information to InsertIntoHadoopFsRelationCommand #15832

[SPARK-18370][SQL] Add table information to InsertIntoHadoopFsRelationCommand #15832

Uh oh!

Conversation

hvanhovell commented Nov 9, 2016

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

hvanhovell commented Nov 9, 2016

Uh oh!

srinathshankar left a comment

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Nov 9, 2016

Uh oh!

hvanhovell commented Nov 9, 2016

Uh oh!

SparkQA commented Nov 9, 2016

Uh oh!

rxin commented Nov 9, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants