Improve page index metadata loading in SerializedFileReader::new_with_options
#6491
Labels
enhancement
Any new improvement worthy of a entry in the changelog
parquet
Changes to the parquet crate
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
As @etseidl pointed out in https://github.com/apache/arrow-rs/pull/6466/files#r1778966728
We can use the new ParquetMetaDataLoader API to read the page indexes in more efficiently (fewer IOs for example)
However, when I tried to implement it, we caught what appears to be a subtle bug -- specifically that the predicates would have been ignored: https://github.com/apache/arrow-rs/pull/6466/files#r1783526090 -- no tests failed.
Describe the solution you'd like
I would like to:
SerializedFileReader::new_with_options
, and clean up the code to use the new ParquetMetaDataReaderDescribe alternatives you've considered
leave as is
Additional context
The text was updated successfully, but these errors were encountered: