FIX : some benchmarks are failing #15367

getChan · 2025-03-23T16:24:08Z

Which issue does this PR close?

Closes distinct_query_sql benchmark is failing #15213 .

Rationale for this change

It is not certain, but it seems that plan creation and collect() should share the same runtime.. It is presumed that the issue occurred because RepartitionExec lazily polls within the runtime.

I will add more details if I find anything additional.

What changes are included in this PR?

Move the Runtime::new() intobench_function
Ensure that plan creation (ctx.sql()) and collect() share the same runtime.

Are these changes tested?

yes. below test are succeded

cargo bench -p datafusion --bench topk_aggregate
cargo bench -p datafusion --bench distinct_query_sql

Are there any user-facing changes?

No.

alamb

Thanks for this @getChan

alamb · 2025-03-25T20:59:54Z

datafusion/core/benches/distinct_query_sql.rs

-        |b| b.iter(|| run(distinct_trace_id_100_partitions_100_000_samples_limit_100.0.clone(),
-                                   distinct_trace_id_100_partitions_100_000_samples_limit_100.1.clone())),
+        |b| b.iter(|| {
+            let rt = Runtime::new().unwrap();


I think this means that the benchmark will include the time to create each tokio runtime (with a bunch of threads, etc)

To avoid this I think you can create the runtime once, and then use it for each tieration:

let rt = Runtime::new().unwrap(); c.bench_function( format!("distinct query with {} partitions and {} samples per partition with limit {}", partitions, samples, limit).as_str(), |b| b.iter(|| {

Got it. Changed the runtime to be shared between loops.

However, it seems that other benchmark codes still have runtime creation within the iteration.

Filed as #15507

2010YOUY01 · 2025-03-30T08:56:13Z

Thank you for the fix! I noticed there are two other tests panicked on the same line of source code, is this fix still applicable?

Running sqllogictest with sqlite check: https://github.com/apache/datafusion/actions/runs/14152508679/job/39647603410
custom_datasource example: https://github.com/apache/datafusion/actions/runs/14152369014/job/39647355093

getChan · 2025-03-30T12:00:17Z

Thank you for the fix! I noticed there are two other tests panicked on the same line of source code, is this fix still applicable?

Running sqllogictest with sqlite check: https://github.com/apache/datafusion/actions/runs/14152508679/job/39647603410

custom_datasource example: https://github.com/apache/datafusion/actions/runs/14152369014/job/39647355093

@2010YOUY01
This PR is to fix the bench test code. The partition not used yet error can occur depending on the execution environment of each test (it is not likely an error in repartitionExec). Therefore, this PR does not solve the issue you mentioned.

alamb

Thank you @getChan and @Omega359

alamb · 2025-03-31T18:28:41Z

Let's get this in and keep iterating on the benchmarks

* distinct_query_sql, topk_aggregate * cargo clippy * cargo fmt * share runtime

github-actions bot added the core Core DataFusion crate label Mar 23, 2025

alamb reviewed Mar 25, 2025

View reviewed changes

getChan added 3 commits March 27, 2025 00:33

distinct_query_sql, topk_aggregate

69b8e4a

cargo clippy

71eefa7

cargo fmt

c36e781

getChan force-pushed the fix-repartition-bench-bug branch from c7414b2 to c36e781 Compare March 26, 2025 15:37

github-actions bot removed functions Changes to functions implementation datasource Changes to the datasource crate labels Mar 26, 2025

share runtime

0365d70

This was referenced Mar 31, 2025

bench: Extract tokio Runtime creation from benchmarking functions #15507

Closed

Extract tokio runtime creation from hot loop in benchmarks #15508

Merged

alamb approved these changes Mar 31, 2025

View reviewed changes

alamb merged commit bde9803 into apache:main Mar 31, 2025
29 checks passed

nirnayroy pushed a commit to nirnayroy/datafusion that referenced this pull request May 2, 2025

FIX : some benchmarks are failing (apache#15367)

8a16813

* distinct_query_sql, topk_aggregate * cargo clippy * cargo fmt * share runtime

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FIX : some benchmarks are failing #15367

FIX : some benchmarks are failing #15367

Uh oh!

getChan commented Mar 23, 2025 •

edited

Loading

Uh oh!

alamb left a comment

Uh oh!

alamb Mar 25, 2025

Uh oh!

getChan Mar 26, 2025

Uh oh!

Omega359 Mar 31, 2025

Uh oh!

2010YOUY01 commented Mar 30, 2025

Uh oh!

getChan commented Mar 30, 2025

Uh oh!

alamb left a comment

Uh oh!

Uh oh!

alamb commented Mar 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

FIX : some benchmarks are failing #15367

FIX : some benchmarks are failing #15367

Uh oh!

Conversation

getChan commented Mar 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

alamb left a comment

Choose a reason for hiding this comment

Uh oh!

alamb Mar 25, 2025

Choose a reason for hiding this comment

Uh oh!

getChan Mar 26, 2025

Choose a reason for hiding this comment

Uh oh!

Omega359 Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

2010YOUY01 commented Mar 30, 2025

Uh oh!

getChan commented Mar 30, 2025

Uh oh!

alamb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

alamb commented Mar 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

getChan commented Mar 23, 2025 •

edited

Loading