Skip to content

Conversation

@HotSushi
Copy link
Contributor

Hive queries that spawn MR jobs will work after this change because:
1) Table schema in RecordReader is now read from the inputsplit
2) HiveSerde on mappers would use input config

Also added a test class which simulates linkedin's way of storing metadata and tests testcases written upstream such as testScanEmptyTable(), testScanTable(), testJoinTables()

cc: @shardulm94

@HotSushi
Copy link
Contributor Author

With changes in #43 : IcebergRecordReader can get correct schema

With changes in #45: HiveIcebergSerde can get correct schema

@HotSushi HotSushi closed this Oct 29, 2020
@HotSushi HotSushi deleted the serialize_schema_in_inputsplit branch November 20, 2020 00:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant