Skip to content

Commit

Permalink
Implement merge join (#2242)
Browse files Browse the repository at this point in the history
* Implement Sort-Merge join (#141)

* Complete doc comments and pass cargo clippy

* Implement metrics for SMJ

* Support join columns with different sort options

* Update datafusion/core/src/physical_plan/sort_merge_join.rs

Add detailed comments of the ordering requirements of two input children.

Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>

* use indices instead of ArrayBuilders for constructing output record batches

* Support timestamp/decimal types in join columns

* Add fuzz test and fix edge cases

* Support float32/64 data types in comparison

* Fix lint issues

Co-authored-by: zhangli20 <zhangli20@kuaishou.com>
Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>
  • Loading branch information
3 people authored Apr 22, 2022
1 parent d97cd3b commit 8867353
Show file tree
Hide file tree
Showing 3 changed files with 1,974 additions and 0 deletions.
1 change: 1 addition & 0 deletions datafusion/core/src/physical_plan/mod.rs
Original file line number Diff line number Diff line change
Expand Up @@ -566,6 +566,7 @@ pub mod metrics;
pub mod planner;
pub mod projection;
pub mod repartition;
pub mod sort_merge_join;
pub mod sorts;
pub mod stream;
pub mod type_coercion;
Expand Down
Loading

0 comments on commit 8867353

Please sign in to comment.