You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem or challenge?
It is common in databases and other analytic system to have additional external "indexes" (perhaps stored in the "metadata catalog", perhaps stored alongside the data files, perhaps embedded in the files, perhaps elsewhere)
These indexes are used to speed up queries by "pruning": specifically evaluating a predicate on the index and then only reading the portions of files that would pass the filters in the query. In #10546 we showed how to create a index for entire files.
I would also like to create an example of how to create such an index for row groups within a file (showing how to read it without re-reading the metadata each time)
Is your feature request related to a problem or challenge?
It is common in databases and other analytic system to have additional external "indexes" (perhaps stored in the "metadata catalog", perhaps stored alongside the data files, perhaps embedded in the files, perhaps elsewhere)
These indexes are used to speed up queries by "pruning": specifically evaluating a predicate on the index and then only reading the portions of files that would pass the filters in the query. In #10546 we showed how to create a index for entire files.
I would also like to create an example of how to create such an index for row groups within a file (showing how to read it without re-reading the metadata each time)
To complete this example, I think we need:
ParquetExec
(enable custom indexes, finer grained pushdown) #9929Describe the solution you'd like
No response
Describe alternatives you've considered
No response
Additional context
This is a follow on to #10546
The text was updated successfully, but these errors were encountered: