-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Open
Description
This is my weekly plan, mostly for my own organizational need. I am making it public in the hopes that helps others to see what I am working on -- also I spend so much time in github the interface is very familiar to me and I can cross link all the issues I am working
Projects I am supporting actively (in priority order)
- Improve DataFusion ClickBench performance (via pushdown predicates): Enable parquet filter pushdown (
filter_pushdown) by default #3463 - Support "pre-image" for pruning predicate evaluation #18320 with @sdf-jkl
- Make
push_batch_with_filterup to 3x faster arrow-rs#8951 with @Dandandan - Speeding up planning with @askalt Avoid recomputing
PlanPropertiesredundently #19796 - Struct field pushdown with @adriangb : Allow struct field access projections to be pushed down into scans #19538
- Help push along ALP in Parquet: GH-48701: [C++][Parquet] Add ALPpd encoding arrow#48345 (and related tickets)
- new pruning logic with @2010YOUY01 feat: Prune complex/nested predicates via statistics propagation #19609
Tickets I am actively working on
- Release object store
0.13.1(maintenance) - Target Jan 2026 arrow-rs-object-store#598 - Release object store
0.12.5(maintenance) - Target Dec 2025 arrow-rs-object-store#582 - Release DataFusion 52.1.0 or 52.0.1 (minor/patch) Release (Jan 2026) #19784
- Blog post for the DataFusion
52.0.0release #19691 - Reduce overhead to create an Array from ArrayData (
make_array) arrow-rs#9061 - [Parquet] Remove uses of ArrayData in favor of Direct Array construction arrow-rs#9128
Notes to myself: a duplicate entry unchecked means I need to go back and re-review
PR review queue (rough order)
- TODO
Metadata
Metadata
Assignees
Labels
No labels