-
Notifications
You must be signed in to change notification settings - Fork 512
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
idea: Add parquet arrow async write/read support #4868
Comments
Interesting, I'm going to have a try🤩 |
Perfect! |
Perhaps we could have an |
I'm considering introducing multiple versions of parquet via
Due to we are still using parquet 51.0.0 🥲. Would you like to have any ideas? cc @waynexia |
Since arrow's release going to be stable, maybe we can keep track with upstream instead? |
Make sense🥹 |
For |
Depends on multiple versions of parquet may bring lots of burdens on maintenance, especially when the versions increase. This is usually achieved via maintaining multiple branches and are released separately to migrate the API change of parquet, which doesn't seems to match our situation at present neither. So I prefer not to have this at least at beginning. We can reconsider this when there's a real requirement like a specific version of parquet is LTS and used widely. |
Seems we can use |
Cool! |
We can implement native parquet arrow support to make our users happy:
By implementing this integration, users can avoid using the low-performance
AsyncRead
/AsyncWrite
trait or creating their own shims.Steps:
AsyncWriter
for parquet integrations #4958AsyncReader
for parquet integrations #4972The text was updated successfully, but these errors were encountered: