Use `fetch` limit in get_sorted_iter #3545

Dandandan · 2022-09-20T14:53:21Z

Which issue does this PR close?

Rationale for this change

Provides a small speedup vs the earlier results.

We can see from the output from explain analyze select l_orderkey from t order by l_orderkey limit 10;

Before:

| SortPreservingMergeExec: [l_orderkey@0 ASC NULLS LAST], metrics=[output_rows=8192, elapsed_compute=4.21486ms, spill_count=0, spilled_bytes=0, mem_used=0]                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         
|   SortExec: [l_orderkey@0 ASC NULLS LAST], metrics=[output_rows=73230, elapsed_compute=267.190575ms, spill_count=0, spilled_bytes=0]                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |

After(note, only 160 rows=10 rows * 16 partitions)

|   SortPreservingMergeExec: [l_orderkey@0 ASC NULLS LAST], metrics=[output_rows=160, elapsed_compute=74.848µs, spill_count=0, spilled_bytes=0, mem_used=0]                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              
|    SortExec: [l_orderkey@0 ASC NULLS LAST], metrics=[output_rows=160, elapsed_compute=261.594385ms, spill_count=0, spilled_bytes=0]

What changes are included in this PR?

Are there any user-facing changes?

Dandandan · 2022-09-20T15:32:29Z

datafusion/core/src/physical_plan/sorts/sort.rs

@@ -273,6 +275,7 @@ impl MemoryConsumer for ExternalSorter {
            &self.expr,
            self.session_config.batch_size(),
            tracking_metrics,
+            self.fetch,


Nice thing is that it also reduces disk spilling, as sort + limit is done before writing.

Although to be honest, I would hope that if there is a LIMIT on the query we could probably avoid the spilling entirely

Yeah maybe the spilling could see the remaining batch is so small it could add the sorted data to memory again - avoiding the spill 🤔

As a future PR / optimization perhaps

Dandandan · 2022-09-20T15:33:25Z

datafusion/core/src/physical_plan/sorts/sort.rs

+        .map(|i| row_indices[*i as usize])
+        .collect();
+
+    Ok(SortedIterator::new(row_indices, batch_size))


Some cleanup - we can do this immediately instead of keeping it in SortedIterator

Dandandan · 2022-09-21T15:39:23Z

datafusion/core/src/physical_optimizer/parallel_sort.rs

-// distributed with this work for additional information
-// regarding copyright ownership.  The ASF licenses this file
-// to you under the Apache License, Version 2.0 (the
-// "License"); you may not use this file except in compliance


Moved this to planner - seems a bit more simple. We don't need access to the parent anymore now.

alamb

This looks like a great improvement -- nice work @Dandandan

datafusion/core/src/physical_plan/planner.rs

alamb · 2022-09-21T16:27:02Z

datafusion/core/src/physical_plan/sorts/sort.rs

@@ -273,6 +275,7 @@ impl MemoryConsumer for ExternalSorter {
            &self.expr,
            self.session_config.batch_size(),
            tracking_metrics,
+            self.fetch,


Although to be honest, I would hope that if there is a LIMIT on the query we could probably avoid the spilling entirely

alamb · 2022-09-21T16:27:36Z

datafusion/core/src/physical_plan/sorts/sort.rs

@@ -374,44 +379,38 @@ fn get_sorted_iter(
            })
        })
        .collect::<Result<Vec<_>>>()?;
-    let indices = lexsort_to_indices(&sort_columns, None)?;
+    let indices = lexsort_to_indices(&sort_columns, fetch)?;


alamb · 2022-09-21T16:28:52Z

datafusion/core/src/physical_plan/sorts/sort.rs

-    /// Indexes into the input representing the correctly sorted total output
-    indices: UInt32Array,
-    /// Map each each logical input index to where it can be found in the sorted input batches
+    /// Sorted composite index of where to find the rows in buffered batches


Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>

ursabot · 2022-09-21T17:13:20Z

Benchmark runs are scheduled for baseline = 0a2b0a7 and contender = ff718d0. ff718d0 is a master commit associated with this PR. Results will be available as each benchmark for each run completes.
Conbench compare runs links:
[Skipped ⚠️ Benchmarking of arrow-datafusion-commits is not supported on ec2-t3-xlarge-us-east-2] ec2-t3-xlarge-us-east-2
[Skipped ⚠️ Benchmarking of arrow-datafusion-commits is not supported on test-mac-arm] test-mac-arm
[Skipped ⚠️ Benchmarking of arrow-datafusion-commits is not supported on ursa-i9-9960x] ursa-i9-9960x
[Skipped ⚠️ Benchmarking of arrow-datafusion-commits is not supported on ursa-thinkcentre-m75q] ursa-thinkcentre-m75q
Buildkite builds:
Supported benchmarks:
ec2-t3-xlarge-us-east-2: Supported benchmark langs: Python, R. Runs only benchmarks with cloud = True
test-mac-arm: Supported benchmark langs: C++, Python, R
ursa-i9-9960x: Supported benchmark langs: Python, R, JavaScript
ursa-thinkcentre-m75q: Supported benchmark langs: C++, Java

alamb · 2022-09-21T17:56:23Z

This PR appears to have broken the build -- PR to fix: #3576

Add fetch, fix length

c42bb7b

github-actions bot added the core Core DataFusion crate label Sep 20, 2022

Dandandan added 2 commits September 20, 2022 16:53

Add fetch, fix length

1b4a7d9

Simplify implementation a bit

b5b6812

Dandandan marked this pull request as ready for review September 20, 2022 15:20

Dandandan commented Sep 20, 2022

View reviewed changes

Dandandan added 2 commits September 20, 2022 17:43

Simplify

6157d32

Doc

f3bb742

Dandandan requested a review from alamb September 20, 2022 15:46

Dandandan added 2 commits September 20, 2022 18:37

Reorder

1a1b8e5

Move parallel sort to planner

6de4a21

Dandandan commented Sep 21, 2022

View reviewed changes

Simplify a bit more

c6bb97c

alamb approved these changes Sep 21, 2022

View reviewed changes

Update datafusion/core/src/physical_plan/planner.rs

b30dd8b

Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>

Dandandan merged commit ff718d0 into apache:master Sep 21, 2022

yahoNanJing mentioned this pull request Oct 10, 2022

SQL with order by limit returns nothing apache/datafusion-ballista#334

Closed

This was referenced Apr 12, 2023

Push down limit to sort #3530

Merged

Push down limit to SortPreservingMergeExec and SortPreservingMergeStream #6000

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use `fetch` limit in get_sorted_iter #3545

Use `fetch` limit in get_sorted_iter #3545

Dandandan commented Sep 20, 2022 •

edited

Loading

Dandandan Sep 20, 2022

alamb Sep 21, 2022

Dandandan Sep 21, 2022

alamb Sep 21, 2022

Dandandan Sep 20, 2022

Dandandan Sep 21, 2022

alamb left a comment

alamb Sep 21, 2022

alamb Sep 21, 2022

alamb Sep 21, 2022

ursabot commented Sep 21, 2022

alamb commented Sep 21, 2022

Use fetch limit in get_sorted_iter #3545

Use fetch limit in get_sorted_iter #3545

Conversation

Dandandan commented Sep 20, 2022 • edited Loading

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are there any user-facing changes?

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alamb left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ursabot commented Sep 21, 2022

alamb commented Sep 21, 2022

Use `fetch` limit in get_sorted_iter #3545

Use `fetch` limit in get_sorted_iter #3545

Dandandan commented Sep 20, 2022 •

edited

Loading