adding `RowsReader` and writer #14149

Lordworms · 2025-01-16T06:50:23Z

Which issue does this PR close?

part of #7053
Adding Rowformat reader and writer for spill

Closes #.

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Lordworms · 2025-01-16T06:51:01Z

I got two following PR for implement SortPreservingMergeStream in Row format and change the logics in SortExec

2010YOUY01

I believe this feature is important to external sorting's performance, thank you. Left some suggestions.

I got two following PR for implement SortPreservingMergeStream in Row format and change the logics in SortExec

Perhaps first let SPM's input and output both support Rows format? This seems easier to do because only one operator needs to be changed. And larger sort query includes two levels of of SPM, we can get some performance improvement from it

datafusion/physical-plan/src/sorts/row_serde.rs

2010YOUY01 · 2025-01-16T09:14:18Z

datafusion/physical-plan/src/sorts/row_serde.rs

+        let mut current_offset = 0u32;
+        let mut row_data = Vec::new();
+
+        for i in 0..rows.num_rows() {


I think we can directly copy all rows' data at once, instead of one by one.

But perhaps we can leave those performance-related improvements to a follow on PR, and set up a benchmark first. I suspect we can also miss some other unnecessary mem copies 🤔

That's what I want to do in the begining, but the implementation does not allow us to get data of Rows.

2010YOUY01 · 2025-01-16T09:18:08Z

datafusion/physical-plan/src/sorts/row_serde.rs

+    use std::sync::Arc;
+    use tempfile::NamedTempFile;
+
+    use crate::sorts::row_serde::{RowReader, RowWriter};


Maybe we can enumerate more tests for edge cases:

call write_rows() multiple times

Write only one row

Write one row for multiple times

Include variable length field like string

I'll add more test coverage

2010YOUY01 · 2025-01-16T09:22:10Z

datafusion/physical-plan/src/sorts/row_serde.rs

+        row_offsets.len() * 4 // row offsets
+    }
+
+    pub fn finish(mut self) -> Result<(), DataFusionError> {


I think we need some mechanism to prevent writing again into a finished RowsWriter

writer.write_rows(batch1); writer.write_rows(batch2); writer.finsh(); writer.write_rows(batch3); // should fail

Lordworms · 2025-01-16T23:33:07Z

I believe this feature is important to external sorting's performance, thank you. Left some suggestions.

I got two following PR for implement SortPreservingMergeStream in Row format and change the logics in SortExec

Perhaps first let SPM's input and output both support Rows format? This seems easier to do because only one operator needs to be changed. And larger sort query includes two levels of of SPM, we can get some performance improvement from it

I believe this feature is important to external sorting's performance, thank you. Left some suggestions.

I got two following PR for implement SortPreservingMergeStream in Row format and change the logics in SortExec

Perhaps first let SPM's input and output both support Rows format? This seems easier to do because only one operator needs to be changed. And larger sort query includes two levels of of SPM, we can get some performance

I think we have to both change GroupHashExec and SortExec as well since these two Executions are using column format right now.

improvement from it

Also since we keep column format for single column sort, I'm not sure whether change SortPreservingMergeStream should be a good choice over adding RowformatMergeStream. Kind of hard to choose here

alamb · 2025-01-18T17:48:10Z

@2010YOUY01 when you think this is ready for me to review please let me know. @Lordworms 👋 thank you 🙏

2010YOUY01

@alamb I think it's ready for another look

When we reach the agreement on design after a second opinion, I'd recommend to add more documentation including

File format encoding
Doc comment (a simple example with write of Rows then read them back)

2010YOUY01 · 2025-01-19T05:41:56Z

datafusion/physical-plan/Cargo.toml

@@ -33,6 +33,7 @@ workspace = true

 [features]
 force_hash_collisions = []
+compress = ["flate2"]


datafusion crate also has this dependency

datafusion/datafusion/core/Cargo.toml

Line 46 in e9a77e0

compression = ["xz2", "bzip2", "flate2", "zstd", "async-compression", "tokio-util"]

Is it possible to move the dependency to workspace level, to keep the version the same.
I think even the whole compression feature can be moved to the workspace level, if we want to support different compression for spilling 🤔 (we can do this in another PR)

I think that would be better

2010YOUY01 · 2025-01-19T05:44:26Z

datafusion/physical-plan/src/sorts/row_serde.rs

+        let metadata_size = self.metadata_size(&row_offsets);
+
+        let writer = self.writer.as_mut().ok_or_else(|| {
+            DataFusionError::Internal("Cannot write to finished RowWriter".to_string())


Great! Can we add a failing test case for this scenario

datafusion/physical-plan/src/sorts/row_serde.rs

adding rowreader and writer

0e28e91

github-actions bot added the physical-expr Physical Expressions label Jan 16, 2025

Lordworms added 3 commits January 15, 2025 22:57

fix clippy

eb54dc3

fix clippy

347f10f

fix clippy

d59d1c8

2010YOUY01 reviewed Jan 16, 2025

View reviewed changes

adding tests

5b70982

alamb changed the title ~~adding RowrsReader and writer~~ adding RowsReader and writer Jan 17, 2025

2010YOUY01 reviewed Jan 19, 2025

View reviewed changes

Lordworms added 2 commits January 18, 2025 22:32

add test

f68d962

refact

8fdb6bd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adding `RowsReader` and writer #14149

adding `RowsReader` and writer #14149

Lordworms commented Jan 16, 2025 •

edited

Loading

Lordworms commented Jan 16, 2025

2010YOUY01 left a comment •

edited

Loading

2010YOUY01 Jan 16, 2025

Lordworms Jan 16, 2025

2010YOUY01 Jan 16, 2025

Lordworms Jan 16, 2025

2010YOUY01 Jan 16, 2025

Lordworms commented Jan 16, 2025 •

edited

Loading

alamb commented Jan 18, 2025

2010YOUY01 left a comment

2010YOUY01 Jan 19, 2025

Lordworms Jan 19, 2025

2010YOUY01 Jan 19, 2025

Lordworms Jan 19, 2025

adding RowsReader and writer #14149

Are you sure you want to change the base?

adding RowsReader and writer #14149

Conversation

Lordworms commented Jan 16, 2025 • edited Loading

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Lordworms commented Jan 16, 2025

2010YOUY01 left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Lordworms commented Jan 16, 2025 • edited Loading

alamb commented Jan 18, 2025

2010YOUY01 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

adding `RowsReader` and writer #14149

adding `RowsReader` and writer #14149

Lordworms commented Jan 16, 2025 •

edited

Loading

2010YOUY01 left a comment •

edited

Loading

Lordworms commented Jan 16, 2025 •

edited

Loading