-
Notifications
You must be signed in to change notification settings - Fork 298
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[4.6, 4.7] installer bootstrap never completes although cluster install completes #502
Comments
Similar issue at |
This is also affecting newer 4.6 nightlies |
Seems to be a regression from https://bugzilla.redhat.com/show_bug.cgi?id=1918281 |
We've been experiencing a similar issue since 4.40 (that's the last known working version that installs properly with IPI on VMWare. We see this in the output of the installer for 4.60 that was released today: INFO API v1.19.2-1049+f173eb4a83e557-dirty up This repeats until we see: FATAL Bootstrap failed to complete: failed to wait for bootstrapping to complete: timed out waiting for the condition. I belive our logs are similar to those already attached to this issue, but if our logs would be useful, we are happy to share them. |
For OKD, this is actually an issue with /etc/resolv.conf and how it interacts with systemd-resolved. We fixed prepender code for the masters and workers and now the same fix is needed for the bootstrap nodes... for OKD, /etc/resolv.conf should be a link... OCP doesn't use systemd-resolved so it should be a static file Best way to check is to ssh into the nodes and check /etc/resolv.conf |
Fix is in upstream installer. openshift/installer#4654 |
Describe the bug
When installing newer 4.6 and 4.7 clusters (vSphere IPI) bootstrap never completes. However, cluster install continues and if I do a wait-for install complete it will finish.
There is another issue which may be related... When the api-int IP fails over from the boot strap, the API becomes unavailable and does not comeback. If I reboot the master node with the api-int IP, the cluster API comes back online and installation continues. This has started happening within the last week or so.... I can open another bug if needed for this
Version
vSphere IPI
How reproducible
100%
Log bundle
openshift_install.log
Must-gather
qa-c1v4-kqlzb-master-1 is the master node which had to be rebooted manually...
The text was updated successfully, but these errors were encountered: