-
Notifications
You must be signed in to change notification settings - Fork 176
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ImportError: libarrow.so.14: cannot open shared object file: No such file or directory when running on Dataflow #83
Comments
This also happens when trying to run TF Transform (since that pulls in TFDV). I experienced this again this morning. The first time I ran it, the DF job failed with this error, and without touching any code and re-running again, it was able to completely finish. |
@andrewsmartin , Make sure you also include the exact command if possible to produce the output included in your test case. If you are unclear what to include see the issue template displayed in the Github new issue template. We ask for this in the issue submission template, because it is really difficult to help without that information. Thanks! |
Hi @rmothukuru, thanks for the reply. Here are the library versions I am using: Tensorflow: 1.14.0 This might be more of an issue for https://github.com/tensorflow/tfx, since I am submitting the job to Dataflow through their orchestration layer (https://github.com/tensorflow/tfx/blob/master/tfx/components/base/base_executor.py#L80). The custom setup file I'm using for beam looks like this:
I am doing this in order to work around tensorflow/tfx#649. I will try to follow up shortly with a way to reproduce using the example taxi dataset, I understand it's hard to diagnose without that. |
If it makes more sense to discuss this over at https://github.com/tensorflow/tfx, I'm happy to close this! |
@andrewsmartin , |
Hi,
We are trying to upgrade to TFX 0.14.0 for our pipelines, but when running Statistics Gen on Dataflow, we frequently run into this error:
We are using Beam 2.14 (as per TFX compatibility matrix) and Python 3.6. It's very strange because this error doesn't happen all the time, which makes me think it could an environment issue on some of the dataflow workers, but I'm not entirely sure.
We install
tfx==0.14.0
on the beam workers by providing a custom setup file.The text was updated successfully, but these errors were encountered: