Skip to content

Conversation

@cristipp
Copy link

Which issue does this PR close?

N/A

Rationale for this change

Adds a cache for ParquetMetaData. Parsing Parquet metadata is relatively expensive, caching improves performance of repeated queries from the same file.

What changes are included in this PR?

N/A

Are there any user-facing changes?

ParquetTable::new and ParquetExec::new take an additional argument, parquet_metadata_cache.

@cristipp cristipp changed the title Pmdc Adds a cache for ParquetMetaData. Apr 7, 2022
@cristipp cristipp merged commit 9caae4f into cube Apr 26, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants