-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Sort file names in a directory #2730 #2735
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fix: #2730
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Partitions are processed in parallel on multiple threads so DataFusion cannot provide any guarantee of ordering of results unless the query contains an ORDER BY clause.
However, sorting files by filename seems reasonable to me from a UX point of view so I am fine with this change.
Codecov Report
@@ Coverage Diff @@
## master #2735 +/- ##
==========================================
+ Coverage 84.89% 84.92% +0.03%
==========================================
Files 270 270
Lines 47817 47915 +98
==========================================
+ Hits 40593 40693 +100
+ Misses 7224 7222 -2
Continue to review full report at Codecov.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A test might be good to ensure this behavior doesn't revert in the future
@yourenawo there appears to be a CI failure https://github.com/apache/arrow-datafusion/runs/6902662999?check_suite_focus=true due to not running cargo fmt
Added a test case for sorting directories.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @yourenawo ! Looks great
* Update local.rs * Update local.rs * Update local.rs Added a test case for sorting directories.
Sort file names in a directory
fixed: #2730