-
Notifications
You must be signed in to change notification settings - Fork 147
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Compressed Shuffle (Arrow-IPC compression) #4
Comments
IPC block-based compression is supported now. we can still switch to column-based compression if it achieves better compression and performance. |
Great work! Is that possible to report new benchmark results for the latest master? @richox |
We could always explore buffer based compression when it gets direct support from arrow-rs later. |
we got some performance issue when running on STS with small memory and broadcast join enabled. i guest we have to implement native BHJ before we get a better benchmark result. |
we implemented a custom designed format for serializing record batches in the latest version. arrow-ipc format is no longer used because we found some performance issues while compressing with low level zstd. |
Upstream issues:
[TODO] Rust side: apache/arrow-rs#1709
[Partly Finished?] Java side: https://issues.apache.org/jira/browse/ARROW-8672
The text was updated successfully, but these errors were encountered: