Skip to content

Commit 9726699

Browse files
avantgardneriojoroKr21Dandandanthinkharderdevfsdvh
authored
Upgrade to DataFusion 44 (#368)
* Get working build * Add pool_size method to MemoryPool (#218) (#230) * Add pool_size method to MemoryPool * Fix * Fmt Co-authored-by: Daniël Heres <danielheres@gmail.com> * Respect `IGNORE NULLS` flag in `ARRAY_AGG` (#260) (apache#15544) v48 * Hook for doing distributed `CollectLeft` joins (#269) * Ignore writer shutdown error (#271) * ignore writer shutdown error * cargo check * Fix bug in `swap_hash_join` (#278) * Try and fix swap_hash_join * Only swap projections when join does not have projections * just backport upstream fix * remove println * Support Duration in min/max agg functions (#283) (apache#15310) v47 * Support Duration in min/max agg functions * Attempt to fix build * Attempt to fix build - Fix chrono version * Revert "Attempt to fix build - Fix chrono version" This reverts commit fd76fe6. * Revert "Attempt to fix build" This reverts commit 9114b86. --------- Co-authored-by: svranesevic <svranesevic@users.noreply.github.com> * Fix panics in array_union (#287) (apache#15149) v48 * Drop rust-toolchain * Fix panics in array_union * Fix the chrono * Backport `GroupsAccumulator` for Duration min/max agg (#288) (apache#15322) v47 * Fix array_sort for empty record batch (#290) (apache#15149) v48 * fix: rewrite fetch, skip of the Limit node in correct order (apache#14496) v46 * fix: rewrite fetch, skip of the Limit node in correct order * style: fix clippy * Support aliases in ConstEvaluator (apache#14734) (#281) v46 * Support aliases in ConstEvaluator (apache#14734) Not sure why they are not supported. It seems that if we're not careful, some transformations can introduce aliases nested inside other expressions. * Format Cargo.toml * Preserve the name of grouping sets in SimplifyExpressions (#282) (apache#14888) v46 Whenever we use `recompute_schema` or `with_exprs_and_inputs`, this ensures that we obtain the same schema. * Support Duration in min/max agg functions (#284) (apache#15310) v47 Co-authored-by: svranesevic <svranesevic@users.noreply.github.com> * fix case_column_or_null with nullable when conditions (apache#13886) v45 * fix case_column_or_null with nullable when conditions * improve sqllogictests for case_column_or_null --------- Co-authored-by: zhangli20 <zhangli20@kuaishou.com> * Fix casewhen (apache#14156) v45 * Cherry-pick topk limit pushdown fix (apache#14192) v45 * fix: FULL OUTER JOIN and LIMIT produces wrong results (apache#14338) v45 * fix: FULL OUTER JOIN and LIMIT produces wrong results * Fix minor slt testing * fix test (cherry picked from commit ecc5694) * Cherry-pick global limit fix (apache#14245) v45 * fix: Limits are not applied correctly (apache#14418) v46 * fix: Limits are not applied correctly * Add easy fix * Add fix * Add slt testing * Address comments * Disable grouping set in CSE * Fix spm + limit (apache#14569) v46 * prost 0.13 / fix parquet dep * Delete unreliable checks * Segfault in ByteGroupValueBuilder (#294) (apache#15968) v50 * test to demonstrate segfault in ByteGroupValueBuilder * check for offset overflow * clippy (cherry picked from commit 5bdaeaf) * Update arrow dependency to include rowid (#295) * Update arrow version * Feat: Add fetch to CoalescePartitionsExec (apache#14499) (#298) v46 * add fetch info to CoalescePartitionsExec * use Statistics with_fetch API on CoalescePartitionsExec * check limit_reached only if fetch is assigned Co-authored-by: mertak-synnada <mertak67+synaada@gmail.com> * Fix `CoalescePartitionsExec` proto serialization (apache#15824) (#299) v48 * add fetch to CoalescePartitionsExecNode * gen proto code * Add test * fix * fix build * Fix test build * remove comments Co-authored-by: 张林伟 <lewiszlw520@gmail.com> * Add JoinContext with JoinLeftData to TaskContext in HashJoinExec (#300) * Add JoinContext with JoinLeftData to TaskContext in HashJoinExec * Expose random state as const * re-export ahash::RandomState * JoinContext default impl * Add debug log when setting join left data * Update arrow version for not preserving dict_id (#303) * Use partial aggregation schema for spilling to avoid column mismatch in GroupedHashAggregateStream (apache#13995) (#302) v45 * Refactor spill handling in GroupedHashAggregateStream to use partial aggregate schema * Implement aggregate functions with spill handling in tests * Add tests for aggregate functions with and without spill handling * Move test related imports into mod test * Rename spill pool test functions for clarity and consistency * Refactor aggregate function imports to use fully qualified paths * Remove outdated comments regarding input batch schema for spilling in GroupedHashAggregateStream * Update aggregate test to use AVG instead of MAX * assert spill count * Refactor partial aggregate schema creation to use create_schema function * Refactor partial aggregation schema creation and remove redundant function * Remove unused import of Schema from arrow::datatypes in row_hash.rs * move spill pool testing for aggregate functions to physical-plan/src/aggregates * Use Arc::clone for schema references in aggregate functions (cherry picked from commit 81b50c4) Co-authored-by: kosiew <kosiew@gmail.com> * Update tag * Push limits past windows (#337) (apache#17347) v50 * Restore old method for DQE * feat(optimizer): Enable filter pushdown on window functions (apache#14026) v45 * Avoid Aliased Window Expr Enter Unreachable Code (apache#14109) v45 (cherry picked from commit fda500a) * Use `Expr::qualified_name()` and `Column::new()` to extract partition keys from window and aggregate operators (#355) (apache#17757) v51 * Update PR template to be relevant to our fork * Make limit pushdown work for SortPreservingMergeExec (apache#17893) (#361) * re-publicise functions DQE relies on * Handle columns in with_new_exprs with a Join (apache#15055) (#384) apache#15055 * handle columns in with_new_exprs with Join * test doesn't return result * take join from result * clippy * make test fallible * accept any pair of expression for new_on in with_new_exprs for Join * use with_capacity Co-authored-by: delamarch3 <68732277+delamarch3@users.noreply.github.com> --------- Co-authored-by: Georgi Krastev <georgi.krastev@coralogix.com> Co-authored-by: Daniël Heres <danielheres@gmail.com> Co-authored-by: Dan Harris <1327726+thinkharderdev@users.noreply.github.com> Co-authored-by: Faiaz Sanaulla <105630300+fsdvh@users.noreply.github.com> Co-authored-by: Sava Vranešević <20240220+svranesevic@users.noreply.github.com> Co-authored-by: svranesevic <svranesevic@users.noreply.github.com> Co-authored-by: Yingwen <realevenyag@gmail.com> Co-authored-by: Zhang Li <richselian@gmail.com> Co-authored-by: zhangli20 <zhangli20@kuaishou.com> Co-authored-by: Aleksey Kirilishin <54231417+avkirilishin@users.noreply.github.com> Co-authored-by: xudong.w <wxd963996380@gmail.com> Co-authored-by: Qi Zhu <821684824@qq.com> Co-authored-by: Martins Purins <martins.purins@coralogix.com> Co-authored-by: mertak-synnada <mertak67+synaada@gmail.com> Co-authored-by: 张林伟 <lewiszlw520@gmail.com> Co-authored-by: kosiew <kosiew@gmail.com> Co-authored-by: nuno-faria <nunofpfaria@gmail.com> Co-authored-by: Berkay Şahin <124376117+berkaysynnada@users.noreply.github.com> Co-authored-by: Mason Hall <mason.hall@coralogix.com> Co-authored-by: delamarch3 <68732277+delamarch3@users.noreply.github.com>
1 parent 3cc3fca commit 9726699

File tree

89 files changed

+4204
-1808
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

89 files changed

+4204
-1808
lines changed

.github/actions/setup-builder/action.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -37,7 +37,7 @@ runs:
3737
run: |
3838
RETRY=("ci/scripts/retry" timeout 120)
3939
echo "Installing ${{ inputs.rust-version }}"
40-
"${RETRY[@]}" rustup toolchain install ${{ inputs.rust-version }}
40+
"${RETRY[@]}" rustup toolchain install ${{ inputs.rust-version }} --no-self-update
4141
"${RETRY[@]}" rustup default ${{ inputs.rust-version }}
4242
"${RETRY[@]}" rustup component add rustfmt
4343
- name: Configure rust runtime env

.github/actions/setup-macos-aarch64-builder/action.yaml

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -39,8 +39,7 @@ runs:
3939
- name: Setup Rust toolchain
4040
shell: bash
4141
run: |
42-
rustup update stable
43-
rustup toolchain install stable
42+
rustup toolchain install stable --no-self-update
4443
rustup default stable
4544
rustup component add rustfmt
4645
- name: Configure rust runtime env

.github/actions/setup-macos-builder/action.yaml

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -39,8 +39,7 @@ runs:
3939
- name: Setup Rust toolchain
4040
shell: bash
4141
run: |
42-
rustup update stable
43-
rustup toolchain install stable
42+
rustup toolchain install stable --no-self-update
4443
rustup default stable
4544
rustup component add rustfmt
4645
- name: Configure rust runtime env

.github/actions/setup-rust-runtime/action.yaml

Lines changed: 0 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -20,8 +20,6 @@ description: 'Setup Rust Runtime Environment'
2020
runs:
2121
using: "composite"
2222
steps:
23-
- name: Run sccache-cache
24-
uses: mozilla-actions/sccache-action@v0.0.4
2523
- name: Configure runtime env
2624
shell: bash
2725
# do not produce debug symbols to keep memory usage down
@@ -31,8 +29,6 @@ runs:
3129
# Set debuginfo=line-tables-only as debuginfo=0 causes immensely slow build
3230
# See for more details: https://github.com/rust-lang/rust/issues/119560
3331
run: |
34-
echo "RUSTC_WRAPPER=sccache" >> $GITHUB_ENV
35-
echo "SCCACHE_GHA_ENABLED=true" >> $GITHUB_ENV
3632
echo "RUST_BACKTRACE=1" >> $GITHUB_ENV
3733
echo "RUSTFLAGS=-C debuginfo=line-tables-only -C incremental=false" >> $GITHUB_ENV
3834

.github/pull_request_template.md

Lines changed: 5 additions & 36 deletions
Original file line numberDiff line numberDiff line change
@@ -1,40 +1,9 @@
1-
## Which issue does this PR close?
1+
<!--- Replace this ticket id with a real ticket when not working on a hotfix -->
2+
[VTX-9685]
23

3-
<!--
4-
We generally require a GitHub issue to be filed for all bug fixes and enhancements and this helps us generate change logs for our releases. You can link an issue to this PR using the GitHub syntax. For example `Closes #123` indicates that this PR will close issue #123.
5-
-->
4+
Upstream PR #: <!-- please try to make an upstream PR and put the number here, even if it isn't merged yet -->
65

7-
Closes #.
6+
or
87

9-
## Rationale for this change
8+
[ ] This PR is for Coralogix only (please document why in your commit message)
109

11-
<!--
12-
Why are you proposing this change? If this is already explained clearly in the issue then this section is not needed.
13-
Explaining clearly why changes are proposed helps reviewers understand your changes and offer better suggestions for fixes.
14-
-->
15-
16-
## What changes are included in this PR?
17-
18-
<!--
19-
There is no need to duplicate the description in the issue here but it is sometimes worth providing a summary of the individual changes in this PR.
20-
-->
21-
22-
## Are these changes tested?
23-
24-
<!--
25-
We typically require tests for all PRs in order to:
26-
1. Prevent the code from being accidentally broken by subsequent changes
27-
2. Serve as another way to document the expected behavior of the code
28-
29-
If tests are not included in your PR, please explain why (for example, are they covered by existing tests)?
30-
-->
31-
32-
## Are there any user-facing changes?
33-
34-
<!--
35-
If there are user-facing changes then we may require documentation to be updated before approving the PR.
36-
-->
37-
38-
<!--
39-
If there are any breaking changes to public APIs, please add the `api change` label.
40-
-->

.github/workflows/audit.yml

Lines changed: 0 additions & 43 deletions
This file was deleted.

.github/workflows/dependencies.yml

Lines changed: 0 additions & 52 deletions
This file was deleted.

.github/workflows/dev.yml

Lines changed: 0 additions & 51 deletions
This file was deleted.

.github/workflows/large_files.yml

Lines changed: 0 additions & 55 deletions
This file was deleted.

0 commit comments

Comments
 (0)