In https://github.com/apache/datafusion/pull/16445 we discussed two approaches: - Push down min/max of the hash table. This is what DuckDB does and what we ended up implementing in https://github.com/apache/datafusion/pull/16445 - It's also possible for us to push down the entire hash table to prune rows during scans. This is what this issue tracks. Some questions to answer: - Should we always push down both the hash table and bounds? - Should we push down the entire hash table as is or should we build bloom filters, an `IN LIST` expression or other?