I recently gave a talk about using DataFusion to build specialized indexes for querying Parquet: https://www.youtube.com/watch?v=74YsJT1-Rdk I think it would also make a nice blog post on the datafusion site, so I am filing a ticket to track doing so