-
Notifications
You must be signed in to change notification settings - Fork 715
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Slim versions of TFX Docker images #6921
Comments
If that TFX image is based on the latest (1.16dev) image, then that is quite a saving, almost half. Interesting. Did you find it hard to build, @pritamdodeja ? And: |
One reason (of many) that these large images are problematic is that GCP DataFlow jobs take forever to spin up new workers - anywhere from 15 to 30 minutes, in my experience! I initially thought this might be due to lengthy dependency installation on worker startup (as described here), but I've confirmed that my dependencies are pre-installed in my custom docker image (based on DataFlow system logs confirm the long duration of the image pull:
^ This happens for every worker that DataFlow spins up, which makes these jobs very slow to scale. Anything that can be done to reduce the size of this |
Dear Users, |
This issue has been marked stale because it has no recent activity since 7 days. It will be closed if no further activity occurs. Thank you. |
Hello! 👋 We've created a Docker image that significantly reduces the size compared to standard TFX Docker images.
It has been tested successfully on Vertex-ai pipeline. Here is the Dockerfile:
|
Excellent work, @KholofeloPhahlamohlaka-TAL ! (Full disclosure: we work together, but I think he deserves praise on the world wide web as well!) |
Hi @KholofeloPhahlamohlaka-TAL |
We would be very interested in anyone's experience building TFX images with Nvidia GPU support. In our experience, this can easily double the size, and it's not easy to get TF/TFX/CUDA etc versions to align and be 'found' by TF |
Could we have Docker images that are slimmer? Some examples of TFX Docker image sizes (compressed, even):
TFX 1.0: 5.67GB
TFX 1.5: 6.65GB
TFX 1.10: 8.53GB
TFX 1.15: 11.4GB
At least an explanation why the image sizes keep on growing would be great.
Or is the recommended way to build a Docker image yourself off a slim Python or Ubuntu image?
The text was updated successfully, but these errors were encountered: