-
Notifications
You must be signed in to change notification settings - Fork 699
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Submitted tfjobs cease to start running under unknown conditions #203
Comments
Please provide the logs of the TfJob operator pod. Also if I have access to this cluster please leave it up and running when it happens so I can inspect it. |
Sure will do. |
I think I'm encountering this myself.
|
I opened #218 for the specific issue I encountered. Chris we can continue to use this issue to track your particular problem. |
Sounds good. |
/lifecycle stale |
See logs. Sometimes see a previously working deployment no longer launch tfjobs and upon re-deploying cluster the same attempted tfjob deployment then works. The next time it happens I can get log or config data, let me know what you'd need.
The text was updated successfully, but these errors were encountered: