[Rust] Implement micro benchmarks for each operator #94

alamb · 2021-04-26T13:18:35Z

Note: migrated from original JIRA: https://issues.apache.org/jira/browse/ARROW-9551

We should implement criterion microbenchmarks for each operator so that we can test the impact of code changes on performance and catch regressions.

alamb · 2021-04-26T13:18:36Z

Comment from Andrew Lamb(alamb) @ 2021-04-26T12:32:40.877+0000:

Migrated to github: https://github.com/apache/arrow-rs/issues/89

OscarTHZhang · 2022-06-25T00:10:14Z

Hi, I'd like to explore this ticket, but I wonder how and where the benchmark should be run, and also what test workload each operator should be running against?

alamb · 2022-06-26T12:32:55Z

Thanks @OscarTHZhang

I think part of this ticket would be to define a reasonable test workload

Here are some examples of benches that might server as inspiration:

Single Operator: SortPreservingMerge: https://github.com/apache/arrow-datafusion/blob/master/datafusion/core/benches/merge.rs
"Targeted" SQL (aggregates): https://github.com/apache/arrow-datafusion/blob/master/datafusion/core/benches/aggregate_query_sql.rs
"Targeted" SQL (filter): https://github.com/apache/arrow-datafusion/blob/master/datafusion/core/benches/filter_query_sql.rs

Maybe the first thing to do is to take stock of the current coverage and propose some additions?

OscarTHZhang · 2022-08-08T04:48:07Z

Hi @alamb,

Here are some questions up on my mind

At what granularity a benchmark should be operating at?
For aggregate, for example, do we also need to implement the micro benchmarks for all the aggregate functions and also all the physical aggregate expressions (like correlation)?

I think we can divide the micro-bench into 2 types (as described above)

Single Operator bench
Targeted SQL

For all the aggregations, if we are going to implement them all, we can simply write the targeted SQL benchmarks.
For operators that operates on column- and table-granularity with output sill as columns and tables, set up the single operator bench for them, such as merge, join, filter.

How does this sound? Anything missing?

alamb · 2022-08-09T11:59:02Z

Hi @OscarTHZhang Thanks for commenting on this ticket.

I think we can divide the micro-bench into 2 types (as described above)

I think the core goal for the ticket is to ensure the vast majority of the time is spent doing the operation rather than reading data.

It might make sense to go through existing benchmarks and try to see what coverage we already have

End to end benchmarks: https://github.com/apache/arrow-datafusion/tree/master/benchmarks

more micro level benchmarks:
https://github.com/apache/arrow-datafusion/tree/master/datafusion/core/benches

There are already some benchmarks that appear to be Targeted SQL that you describe, for example https://github.com/apache/arrow-datafusion/blob/master/datafusion/core/benches/sql_planner.rs and https://github.com/apache/arrow-datafusion/blob/master/datafusion/core/benches/aggregate_query_sql.rs

There are also some benchmarks for operators that are used as part of other operations, such as https://github.com/apache/arrow-datafusion/blob/master/datafusion/core/benches/merge.rs

spencerwilson · 2023-10-20T23:53:46Z

Not sure how strong the suggestion of using Criterion was, but I recently discovered Divan. It may be worth evaluating.

https://nikolaivazquez.com/blog/divan/
https://github.com/nvzqz/divan
author's announcement on r/rust: https://www.reddit.com/r/rust/comments/1703xwe/announcing_divan_fast_and_simple_benchmarking_for/

(I have no affiliation; am just an aspiring OSS contributor browsing the good-first-issues 🙈)

spencerwilson · 2023-10-21T06:07:25Z

https://github.com/bheisler/iai could be a good fit for benchmarking those ExecutionPlan implementations that do little or no I/O. It reports not durations of wall time, but rather exact counts or estimates of low-level metrics:

bench_fibonacci_short
  Instructions:                1735
  L1 Accesses:                 2364
  L2 Accesses:                    1
  RAM Accesses:                   1
  Estimated Cycles:            2404

I’m not sure if there are any caveats around using it to measure async-style Rust code, though.

edmondop · 2023-11-14T19:14:13Z

Hi @OscarTHZhang Thanks for commenting on this ticket.

I think we can divide the micro-bench into 2 types (as described above)

I think the core goal for the ticket is to ensure the vast majority of the time is spent doing the operation rather than reading data.

It might make sense to go through existing benchmarks and try to see what coverage we already have

End to end benchmarks: master/benchmarks

more micro level benchmarks: master/datafusion/core/benches

There are already some benchmarks that appear to be Targeted SQL that you describe, for example master/datafusion/core/benches/sql_planner.rs and master/datafusion/core/benches/aggregate_query_sql.rs

There are also some benchmarks for operators that are used as part of other operations, such as master/datafusion/core/benches/merge.rs

@alamb the way this issue title is phrased, it seems the right way to address is to extend the benchmarks which you shared here as micro-benchmarks.
master/datafusion/core/benches

is that correct?

mnorfolk03 · 2024-10-14T16:33:22Z

@alamb Is this still issue something that would be liked? If so, I'd like a shot at it for my first issue.

I think I could start with implementing some microbenchmarks for the physical plan operators? Such as: filter, limit, union, and the different types of joins to get started -- I didn't see any in the repo, although I may have missed them.

Let me know your thoughts thanks!

alamb · 2024-10-15T10:38:36Z

Hi @mnorfolk03 👋 -- thanks.

I think since this ticket was filed, we have moved more into "end to end" type benchmarks like in https://github.com/apache/datafusion/tree/main/benchmarks

I think Joins are an area we don't really have any great benchmarks -- we only have the TPCH queries

The art of writing benchmarks is choosing what to benchmark I think, so it is often a bit hard to choose.

Perhaps you could start with creating a benchmark for physical planning (aka the process of creating the final optimized ExecutionPlan) which is not an area we have a lot of coverage

You could perhaps use the report on #12738 to create a planning benchmark in https://github.com/apache/datafusion/blob/main/datafusion/core/benches/sql_planner.rs ?

mnorfolk03 · 2024-10-17T15:55:35Z

Hi @mnorfolk03 👋 -- thanks.

I think since this ticket was filed, we have moved more into "end to end" type benchmarks like in https://github.com/apache/datafusion/tree/main/benchmarks

I think Joins are an area we don't really have any great benchmarks -- we only have the TPCH queries

The art of writing benchmarks is choosing what to benchmark I think, so it is often a bit hard to choose.

Perhaps you could start with creating a benchmark for physical planning (aka the process of creating the final optimized ExecutionPlan) which is not an area we have a lot of coverage

You could perhaps use the report on #12738 to create a planning benchmark in https://github.com/apache/datafusion/blob/main/datafusion/core/benches/sql_planner.rs ?

Thanks I'll look into it and start working on it!

alamb · 2024-10-18T13:22:13Z

@askalt may have added some in #12950 -- maybe you can review the benchmarks there and see if there are others worth adding

alamb · 2024-10-24T19:18:54Z

Given the lack of specificity on this ticket (it tracks a basic idea rather than any particular project I think) I'll claim it is done for the moment

I think a better approach is to add microbenchmarks for operators we are planning to improve

alamb added the datafusion Changes in the datafusion crate label Apr 26, 2021

houqp added good first issue Good for newcomers help wanted Extra attention is needed labels Sep 15, 2021

mnorfolk03 mentioned this issue Oct 24, 2024

chore: Added a number of physical planning join benchmarks #13085

Merged

berkaysynnada mentioned this issue Oct 24, 2024

Round robin polling between tied winners in sort preserving merge synnada-ai/datafusion-upstream#41

Closed

alamb closed this as completed Oct 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Rust] Implement micro benchmarks for each operator #94

[Rust] Implement micro benchmarks for each operator #94

alamb commented Apr 26, 2021

alamb commented Apr 26, 2021

OscarTHZhang commented Jun 25, 2022

alamb commented Jun 26, 2022

OscarTHZhang commented Aug 8, 2022 •

edited

Loading

alamb commented Aug 9, 2022

spencerwilson commented Oct 20, 2023

spencerwilson commented Oct 21, 2023

edmondop commented Nov 14, 2023

mnorfolk03 commented Oct 14, 2024

alamb commented Oct 15, 2024

mnorfolk03 commented Oct 17, 2024

alamb commented Oct 18, 2024

alamb commented Oct 24, 2024

[Rust] Implement micro benchmarks for each operator #94

[Rust] Implement micro benchmarks for each operator #94

Comments

alamb commented Apr 26, 2021

alamb commented Apr 26, 2021

OscarTHZhang commented Jun 25, 2022

alamb commented Jun 26, 2022

OscarTHZhang commented Aug 8, 2022 • edited Loading

alamb commented Aug 9, 2022

spencerwilson commented Oct 20, 2023

spencerwilson commented Oct 21, 2023

edmondop commented Nov 14, 2023

mnorfolk03 commented Oct 14, 2024

alamb commented Oct 15, 2024

mnorfolk03 commented Oct 17, 2024

alamb commented Oct 18, 2024

alamb commented Oct 24, 2024

OscarTHZhang commented Aug 8, 2022 •

edited

Loading