-
Notifications
You must be signed in to change notification settings - Fork 3.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BUG: libhdfs3
not getting picked up correctly when using conda
#15019
Comments
It seems that I think that we need to implement |
The Arrow HDFS implementation is based on the JNI libhdfs, so it is expected that it doesn't work with libhdfs3. And since libhdfs doesn't seem to be packages by conda-forge, I don't think there is a way to run the hdfs tests in the conda-forge build purely based on conda packages (our own tests install the JNI library manually on top of the conda env, see eg ci/docker/conda-python-hdfs.dockerfile) We have had integration with libhdfs3 as well in the past (and you could switch between both drivers), but this was removed almost 3 years ago (#6432), because the libhdfs3 project is unmaintained. Also the dask filesystem wrapper using libhdfs3 is archived (https://github.com/dask/hdfs3). Unless the libhdfs3 project would be revived, I don't think we should currently consider adding support for it again. |
Cool, thanks for the input. I wasn't aware of the split between |
Oh, the author of the pull request is me... I didn't remember it... I close this because we don't support |
Describe the bug, including details regarding any error messages, version, and platform.
#14832 switched on the test suite within the conda-builds, which itself synced with conda-forge/arrow-cpp-feedstock#875
In the process of doing so, I removed
arrow/dev/tasks/conda-recipes/arrow-cpp/meta.yaml
Lines 351 to 352 in 2c768a1
because the tests always showed:
SKIPPED [24] test_hdfs.py:48: No libhdfs available on system
Even with a patch as follows:
this remained the case. I thought this might be necessary because in the conda-forge world, the name of the libhdfs binary contains the "3", but there seem to be other issues at play here as well.
To fix this issue, the line quoted above should be uncommented, and the conda tests should not show
SKIPPED [24] test_hdfs.py:48: No libhdfs available on system
anymore (at least on unix).Component(s)
C++, Continuous Integration
The text was updated successfully, but these errors were encountered: