-
Notifications
You must be signed in to change notification settings - Fork 81
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove dependency on hadoop-common #806
Comments
I spent a few hours yesterday trying to make this work; I got everything except LZ4 to work, this is the branch: The remaining failure:
|
I just tried loading a LZ4 parquet file in the IDEA "Avro and parquet viewer plugin", it fails. The plugin can load GZIP compressed, but not LZ4 compressed; tried eth_v1_p1_cLZ4.parquet from Amanda's deephaven-core-parquet-examples repository, which community core can load. |
It doesn't look like we can remove the dependency on The above mentioned approach which uses Note that |
Agreed, I think this issue should be closed, to be reevaluated if we ever rewrite our compression codec handling again. |
Thanks Colin, I can close it once we merge #4457 |
Related to #294
Related to #901
During review on #798 dug more into why we need some hadoop dependencies.
Essentially, parquet uses
org.apache.hadoop.conf.Configuration
fromhadoop-common
. Unfortunately,hadoop-common
has sprawling dependencies that makes it undesirable for inclusion as part of a library.Potentially relevant link.
https://issues.apache.org/jira/browse/PARQUET-1822
http://mail-archives.apache.org/mod_mbox/parquet-dev/202001.mbox/%3cCAO4re1m-Y9X3yQABX1_XaSaof4NZWBb8Tg_TBXgepK8rCJfU-g@mail.gmail.com%3e
https://stackoverflow.com/questions/59939309/read-local-parquet-file-without-hadoop-path-api
https://github.com/benwatson528/intellij-avro-parquet-plugin
The text was updated successfully, but these errors were encountered: