Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Minor: Add documentation + diagrams for ExternalSorter #7179

Merged
merged 4 commits into from
Aug 3, 2023

Conversation

alamb
Copy link
Contributor

@alamb alamb commented Aug 2, 2023

Which issue does this PR close?

Part of #5885

Rationale for this change

While working on #5885 I have studied this code more carefully and wanted to document how it works (which is part of how I convince myself I understand the code -- if I can document it)

What changes are included in this PR?

Doc strings and ASCII diagrams

Are these changes tested?

N/A

Are there any user-facing changes?

No

@alamb
Copy link
Contributor Author

alamb commented Aug 2, 2023

@yjshen or @Dandandan I don't know if you might have time to review this PR, but if so I would appreciate it

Copy link
Member

@yjshen yjshen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thanks @alamb!

/// [`Self::insert_batch`] as a stream of [`RecordBatch`]es.
///
/// This may be an in memory sort/merge if all input fit into memory, or
/// a streaming merge from spill files on disk.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This process could either be an in-memory sort/merge if all the input fits into memory or a combined streaming merge incorporating both in-memory batches and data from spill files on disk.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @yjshen -- I updated the wording in 8d7d08f

@yjshen yjshen merged commit fcddabc into apache:main Aug 3, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
core Core DataFusion crate
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants