Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
ENH: add arrow engine to read_csv #31817
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ENH: add arrow engine to read_csv #31817
Changes from all commits
f22ff46
8ae43e4
09074df
6be276d
df4fa7e
9cd9a6f
ecaf3fd
b3c3287
474baf4
2cd9937
48ff255
3d15a56
c969373
98aa134
b9c6d2c
67c5db6
7f891a6
11fc737
23425f7
d9b7a1f
b8adf3c
01c0394
ba5620f
2570c82
b3a1f66
d46ceed
d67925c
6378459
9d64882
852ecf9
93382b4
f1bb4e2
14c13ab
7876b4e
4426642
008acab
2dddae7
261ef6a
88e200a
bf063ab
ede2799
e8eff08
87cfcf5
55139ee
c1aeecf
62fc9d6
b53a620
f13113d
f9ce2e4
4158d6a
d34e75f
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
add a versionchanged tag here 1.2
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
add a versionchanged 1.2 tag here
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we might want to refactor this entire section to provide a more table like comparision of all of the parsers, if you'd create an issue for this
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why is all this needed?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This has been answered before: #31817 (comment) (and the above comment has been added based on your comment)
It's to 1) import a submodule (
pyarrow.csv
in this case) and 2) to support passing a different version as in our global minimum versions dictionary.Now I suppose that the submodule importing is not necessarily needed. Right now this PR does:
but I suppose this could also be:
And then this additional code to directly import a submodule with
import_optional_dependency
is not needed (although where it is used, I think it is a bit cleaner to be able to directly import the submodule)There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@jorisvandenbossche importing as a submodule is required, you can't access the csv module by doing
pyarrow.csv
as far as I remember, and if you doimport pyarrow.csv
, the it won't validate the version and will not error for pyarrow<0.15