Skip to content

Conversation

@singhpk234
Copy link
Contributor

About the change :

Presently when doing schema conversion we were setting metadata for all the the columns, but we should add ____metadata_col in metadata column which can be used by spark to check if the column is metadata col and drop it if required like,
here :
https://github.com/apache/spark/blob/master/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/v2Commands.scala#L221-L225

Note : MetadataAttribute extractor uses the above key in attribute meta data to find if the attribute is metadata attribute or not.

This PR includes :
(i) Fix for above
(ii) Fix an existing minor typo in TestSparkSchemaUtil


Testing Done

Added an UT for the same.

cc @rdblue @aokolnychyi @jackye1995 @RussellSpitzer

@rdblue rdblue merged commit 7d6bbc4 into apache:master Jun 28, 2022
@rdblue
Copy link
Contributor

rdblue commented Jun 28, 2022

Thanks, @singhpk234!

namrathamyske pushed a commit to namrathamyske/iceberg that referenced this pull request Jul 10, 2022
namrathamyske pushed a commit to namrathamyske/iceberg that referenced this pull request Jul 10, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants