-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Skaffold Deploy Fails on Readiness Probe Failures of Terminating Pods #6758
Comments
After writing this issue I realized that our pod is actually exiting with a code 137 (meaning the process was killed by a signal SIGKILL). This indicates the pod may not be terminating gracefully. We will keep investigating on our end. |
After more investigation, part of the issue was related to the Python I am still curious whether we expect Skaffold status checks to fail because of issues related to the terminating pods? It seems that even though there was an underlying problem in our application, it should not matter whether pods are exiting gracefully or not. Only the health of new pods should matter. In our case, the new pods were fine, and it would have seemed reasonable for Skaffold to deem the deployment stable. |
I was not able to reproduce this issue with what I believe was a similar setup. Can you try using the latest version of skaffold - Also, are you explicitly setting a |
Yes, you are right, we are! We are using the following environment variable during
We recently started doing this as a workaround to #2745. Without this flag, pods were always restarted on deploy (because of a changing pod template hash), even though no changes happened in the original deployment definition. |
Ah I see, the issue you are encountering is related to both deploys having the same value for
|
We just encountered the same issue once again with a different deployment (not the Python deployment which we worked around by removing the watchtower library preventing quick shutdown of the pod). This time we tried @briandealwis's suggestion to update to the latest version and it fixed our Thank you for your help! |
Expected behavior
When running
skaffold deploy
, status checks / stabilization shouldn't fail because the readiness probe of a terminating pod is failing. As far as I understand, the following discussion establishes that readiness probe failures for terminating pods are expected.kubernetes/kubernetes#98571 (comment)
kubernetes/kubernetes#52817
Actual behavior
Skaffold fails to stabilize when a terminating pod has a readiness probe failure.
Information
Steps to reproduce the behavior
skaffold deploy
Logs
In the logs above:
pod/my-service-8c5d4f4f4-t8c7j
is terminating. Pod logs indicate that the pod is sent a TERM signal and is able to gracefully terminate with some expected readiness probe failurespod/my-service-bbcc9d978-ksh2q
is startingThe text was updated successfully, but these errors were encountered: