-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
pull-kubernetes-e2e-gce instantly failing all runs with python error #30759
Comments
/priority critical-urgent |
There are no sig labels on this issue. Please add an appropriate label by using one of the following commands:
Please see the group list for a listing of the SIGs, working groups, and committees available. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
@aojea I don't think it's related. #30695 triggered the build of new images but the failing test still use |
It impacts a lot more jobs than just e2e-gce. I saw it this AM when I was trying to test new presubmits I added. /test pull-crio-cgroupv1-node-e2e-eviction Were some examples of those that failed. And I also see node-e2e-containerd failing now. Seems to be any job that uses the bootstrap.py image. |
the testgrid is not reflecting all the failures, this appears to be failing on all PRs |
last passing run I see is at 9:52AM(ET): https://prow.k8s.io/view/gs/kubernetes-jenkins/pr-logs/pull/120755/pull-kubernetes-e2e-gce/1704131427878047744 first failing run I see with this failure is at 11:07AM(ET): https://prow.k8s.io/view/gs/kubernetes-jenkins/pr-logs/pull/120755/pull-kubernetes-e2e-gce/1704150135245639680 |
@liggitt attached a PR that might fix this issue, but there might be other locations that break. |
bootstrap.py jobs will clone test-infra so changes to the scripts will be picked up immediately (yes, it's terrible, and deprecated) changes to the image should be controlled by the tags |
That run appears to have The bootstrap script itself hasn't had a commit for 6 months, did we change one of the job parameters, maybe in a preset or similar? Or actually, maybe something changed with the compute metadata service? |
here's how i could recreate the problem
|
Kicked off a test here. Seems to be running. |
(#30763 should be live in bootstrap.py jobs now and per above comment seems to be working) |
confused about why but glad #30763 seems to have worked |
#30763 is the right thing to land as long term in python3 that's how the code should work. we were probably lucky that it worked this long |
resolving this as the tests are progressing now, thanks for the fix... $1 for tapping with a hammer, $999 for knowing where to tap |
I suspect either:
|
What happened:
looks like something just changed in test-infra and broke the pull-kubernetes-e2e-gce job:
What you expected to happen:
How to reproduce it (as minimally and precisely as possible):
seen in all runs of kubernetes/kubernetes#120755
Please provide links to example occurrences, if any:
Anything else we need to know?:
The text was updated successfully, but these errors were encountered: