Push down entire hash table from HashJoinExec into scans

In https://github.com/apache/datafusion/pull/16445 we discussed two approaches:
- Push down min/max of the hash table. This is what DuckDB does and what we ended up implementing in https://github.com/apache/datafusion/pull/16445
- It's also possible for us to push down the entire hash table to prune rows during scans. This is what this issue tracks.

Some questions to answer:
- Should we always push down both the hash table and bounds?
- Should we push down the entire hash table as is or should we build bloom filters, an `IN LIST` expression or other?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Push down entire hash table from HashJoinExec into scans #17171

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Push down entire hash table from HashJoinExec into scans #17171

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions