-
Notifications
You must be signed in to change notification settings - Fork 847
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Optimized Writing of Arrow Byte Array to Parquet #1764
Labels
enhancement
Any new improvement worthy of a entry in the changelog
parquet
Changes to the parquet crate
performance
Comments
tustvold
added
the
enhancement
Any new improvement worthy of a entry in the changelog
label
May 30, 2022
tustvold
added a commit
to tustvold/arrow-rs
that referenced
this issue
Jul 11, 2022
tustvold
added a commit
to tustvold/arrow-rs
that referenced
this issue
Jul 11, 2022
tustvold
added a commit
to tustvold/arrow-rs
that referenced
this issue
Jul 16, 2022
tustvold
added a commit
that referenced
this issue
Jul 17, 2022
tustvold
added a commit
to tustvold/arrow-rs
that referenced
this issue
Jul 17, 2022
tustvold
added a commit
that referenced
this issue
Jul 21, 2022
This was referenced Jul 21, 2022
tustvold
added a commit
to tustvold/arrow-rs
that referenced
this issue
Jul 29, 2022
tustvold
added a commit
to tustvold/arrow-rs
that referenced
this issue
Jul 29, 2022
tustvold
added a commit
to tustvold/arrow-rs
that referenced
this issue
Jul 29, 2022
tustvold
added a commit
to tustvold/arrow-rs
that referenced
this issue
Jul 29, 2022
tustvold
added a commit
that referenced
this issue
Aug 1, 2022
* Optimized writing of byte array to parquet (#1764) * Review feedback * Fix logical conflict
tustvold
added a commit
to tustvold/arrow-rs
that referenced
this issue
Aug 4, 2022
tustvold
added a commit
to tustvold/arrow-rs
that referenced
this issue
Aug 5, 2022
tustvold
added a commit
to tustvold/arrow-rs
that referenced
this issue
Aug 5, 2022
tustvold
added a commit
that referenced
this issue
Aug 5, 2022
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
enhancement
Any new improvement worthy of a entry in the changelog
parquet
Changes to the parquet crate
performance
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
A significant amount of effort has been put into making the reading of byte arrays from parquet fast:
We should invest some effort in making the writer performance comparable.
Describe the solution you'd like
Currently in order to write byte array types from arrow:
Vec<ByteArray>
It would be a significant performance win to be able to elide these first two steps. This would likely involve much the same process as was followed for the reader:
Describe alternatives you've considered
We could not do this
The text was updated successfully, but these errors were encountered: