[SPARK-2489][SQL] Support Parquet's optional fixed_len_byte_array #35902
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This pull request is a follow up for this pull request: #20826
The pull request above was closed because of inactivity.
What changes were proposed in this pull request?
This PR adds support for reading Parquet FIXED_LENGTH_BYTE_ARRAYs
as a Binary column if no OriginalType is specified.
Why are the changes needed?
Parquet-avro writes the Avro fixed type as a Parquet
FIXED_LENGTH_BYTE_ARRAY type. Currently when trying to
load Parquet files with a column of this type with Spark
SQL it throws an exception.
Does this PR introduce any user-facing change?
No.
How was this patch tested?
Unit tests where implemented to support the change.
I also tested the change locally and now I am able to load
the Parquet files as expected.