[SPARK-25135][SQL] FileFormatWriter should respect the schema of Hive #22287

wangyum · 2018-08-30T18:15:17Z

What changes were proposed in this pull request?

This pr fix FileFormatWriter's dataSchema should respect the schema of Hive. Otherwise there will be two issues.

Throwing an exception(This can be reproduce by added test case):

java.util.NoSuchElementException: None.get
	at scala.None$.get(Option.scala:347)
	at scala.None$.get(Option.scala:345)
	at org.apache.spark.sql.execution.datasources.FileFormatWriter$$anonfun$3$$anonfun$4.apply(FileFormatWriter.scala:87)
	at org.apache.spark.sql.execution.datasources.FileFormatWriter$$anonfun$3$$anonfun$4.apply(FileFormatWriter.scala:87)

The schema of the Hive table is not the same as the schema of the parquet file.

How was this patch tested?

Unit tests for FileFormatWriter should respect the schema of Hive.
Manual tests for didn't break UI issues fixed by SPARK-22834:

SparkQA · 2018-08-30T18:42:24Z

Test build #95483 has finished for PR 22287 at commit b54953a.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

FileFormatWriter should respect the input query schema in HIVE

b54953a

gengliangwang mentioned this pull request Sep 1, 2018

[WIP][SPARK-25305][SQL] Respect attribute name in CollapseProject and ColumnPruning #22311

Closed

wangyum closed this Sep 6, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-25135][SQL] FileFormatWriter should respect the schema of Hive #22287

[SPARK-25135][SQL] FileFormatWriter should respect the schema of Hive #22287

Uh oh!

wangyum commented Aug 30, 2018

Uh oh!

SparkQA commented Aug 30, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[SPARK-25135][SQL] FileFormatWriter should respect the schema of Hive #22287

[SPARK-25135][SQL] FileFormatWriter should respect the schema of Hive #22287

Uh oh!

Conversation

wangyum commented Aug 30, 2018

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

SparkQA commented Aug 30, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants