-
Notifications
You must be signed in to change notification settings - Fork 214
Initializing GPU device takes very long #149
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Running into this on Linux as well, for |
@rnett what device do you have? |
TF compiles the kernels for compute levels that it doesn't ship with, and we only compile for 3.5 and 7.0 because the build times out on Github Actions otherwise. You can change this line https://github.com/tensorflow/java/blob/master/tensorflow-core/tensorflow-core-api/build.sh#L27 to whatever compute levels you want and then do a full rebuild to get binaries for your specific use case. |
Yeah, I just figured that out (#200). If it is build times I guess there's not much we can do. I'm using a 1070. |
I am just wondering if compute levels 3.5 and 7.0 are the most sensible choices then. I'd expect by now 6.1 is much more common than 3.5. I also doubt building tensorflow/java yourself is far from the nicest user experience one can offer. Is it possible to break-up the build into multiple actions? Github actions have a 6 hour limit, a workflow has a 72 hour limit. |
We've been trying to get more build resources for over a year, when we do it'll be simple to build for more GPU targets. At the moment this gives us coverage over the things tf python supports at the cost of slow initialization for many users. |
Please make sure that this is a bug. As per our GitHub Policy, we only address code/doc bugs, performance issues, feature requests and build/installation issues on GitHub. tag:bug_template
System information
You can collect some of this information using our environment capture script
You can also obtain the TensorFlow version with
python -c "import tensorflow as tf; print(tf.GIT_VERSION, tf.VERSION)"
Describe the current behavior
Opening a GPU device takes very long (close to 10 minutes), I am guessing it is compiling CUDA kernels.
Describe the expected behavior
Shorter waiting time when opening the GPU device.
Code to reproduce the issue
Provide a reproducible test case that is the bare minimum necessary to generate the problem.
Other info / logs
Include any logs or source code that would be helpful to diagnose the problem. If including tracebacks, please include the full traceback. Large logs and files should be attached.
The text was updated successfully, but these errors were encountered: