Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add max_dyn and min_dyn for max/min for dictionary array #2585

Merged
merged 5 commits into from
Aug 25, 2022

Conversation

viirya
Copy link
Member

@viirya viirya commented Aug 24, 2022

Which issue does this PR close?

Closes #2584.

Rationale for this change

What changes are included in this PR?

Are there any user-facing changes?

@github-actions github-actions bot added the arrow Changes to the arrow crate label Aug 24, 2022
@viirya
Copy link
Member Author

viirya commented Aug 24, 2022

cc @sunchao

arrow/src/compute/kernels/aggregate.rs Outdated Show resolved Hide resolved
arrow/src/compute/kernels/aggregate.rs Outdated Show resolved Hide resolved
@@ -215,6 +215,68 @@ where
}
}

/// Returns the min of values in the array.
pub fn min_dyn<T, A: ArrayAccessor<Item = T::Native>>(array: A) -> Option<T::Native>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Typically the _dyn kernels take a trait object, these don't appear to. Perhaps we could choose a different name?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hmm, min_of_array?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wonder if we can't just have a single min kernel for all types? We currently have separate implementations for strings, primitives, etc... which is no longer needed?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I guess that we have separate min for different type of arrays because there was no common accessor previously.

But the min for primitive array, it has simd and non simd versions. I'm hesitant to replace them with this ArrayAccessor version. Do you think it is okay?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think lets press forward with this PR as is, but renamed to min_array, and I'll write up a ticket to look into this

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Renamed to min_array, max_array and sum_array

)
}

fn min_max_dyn_helper<T, A: ArrayAccessor<Item = T::Native>, F, M>(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
fn min_max_dyn_helper<T, A: ArrayAccessor<Item = T::Native>, F, M>(
fn min_max_array_helper<T, A: ArrayAccessor<Item = T::Native>, F, M>(

@viirya viirya merged commit c64ca4f into apache:master Aug 25, 2022
@ursabot
Copy link

ursabot commented Aug 25, 2022

Benchmark runs are scheduled for baseline = 8eea918 and contender = c64ca4f. c64ca4f is a master commit associated with this PR. Results will be available as each benchmark for each run completes.
Conbench compare runs links:
[Skipped ⚠️ Benchmarking of arrow-rs-commits is not supported on ec2-t3-xlarge-us-east-2] ec2-t3-xlarge-us-east-2
[Skipped ⚠️ Benchmarking of arrow-rs-commits is not supported on test-mac-arm] test-mac-arm
[Skipped ⚠️ Benchmarking of arrow-rs-commits is not supported on ursa-i9-9960x] ursa-i9-9960x
[Skipped ⚠️ Benchmarking of arrow-rs-commits is not supported on ursa-thinkcentre-m75q] ursa-thinkcentre-m75q
Buildkite builds:
Supported benchmarks:
ec2-t3-xlarge-us-east-2: Supported benchmark langs: Python, R. Runs only benchmarks with cloud = True
test-mac-arm: Supported benchmark langs: C++, Python, R
ursa-i9-9960x: Supported benchmark langs: Python, R, JavaScript
ursa-thinkcentre-m75q: Supported benchmark langs: C++, Java

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
arrow Changes to the arrow crate
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add max_dyn and min_dyn for max/min for dictionary array
4 participants