Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

4.16.0-okd-scos.1 Sinlge node installation not working properly #27

Open
LennertMertens opened this issue Nov 25, 2024 · 2 comments
Open

Comments

@LennertMertens
Copy link

When installing a single node OKD cluster, the installation is not properly proceeding because the oc command seems to be missing in the image. I manually installed the oc binary but when the installation continues, it reports an unhealthy cluster state.

Nov 25 10:02:57 control-plane podman[3607]: 2024-11-25 10:02:57.672576541 +0000 UTC m=+0.030907715 image pull ac3f3dae3c06d7338ff9481caab6fa7f958b52094f1825c8d4da80d47d16876f quay.io/okd/scos-release@sha256:06ffff6c6951046d03df0784bc18132c368a84fe72bcfb529484a58872c3a2e1
Nov 25 10:02:57 control-plane podman[3634]: 2024-11-25 10:02:57.818192064 +0000 UTC m=+0.049742304 container remove 5789219a171912cd0e2d116fff3f8dc10cf38205284509f2672877dbc9a8d7b4 (image=quay.io/okd/scos-release@sha256:06ffff6c6951046d03df0784bc18132c368a84fe72bcfb529484a58872c3a2e1, name=competent_leakey, io.openshift.release=4.16.0-okd-scos.1, io.openshift.release.base-image-digest=sha256:7d8d6875c9e8c9aa0eab546f354b92555a6c7621393a1ea98da4ecbf29e263e3)
Nov 25 10:02:57 control-plane bootkube.sh[2944]: Moving OpenShift manifests in with the rest of them
Nov 25 10:02:57 control-plane bootkube.sh[3666]: /usr/local/bin/bootkube.sh: line 81: oc: command not found
Nov 25 10:02:57 control-plane systemd[1]: bootkube.service: Main process exited, code=exited, status=127/n/a
Nov 25 10:02:57 control-plane systemd[1]: bootkube.service: Failed with result 'exit-code'.
Nov 25 10:02:57 control-plane systemd[1]: bootkube.service: Consumed 2.572s CPU time.
Nov 25 10:03:03 control-plane systemd[1]: bootkube.service: Scheduled restart job, restart counter is at 1.
Nov 25 10:03:05 control-plane systemd[1]: Started bootkube.service - Bootstrap a Kubernetes cluster.
@bshephar
Copy link

This usually only happens until the node pulls down the rpm-ostree image for the version you're installing. Since we start the bootstrap with Fedora CoreOS and then use rpm-ostree to rebase on CentOS Stream CoreOS. Then the installation completes. So typically, this error is only transient until the node is rebooted from the SCOS image and then at that time, it will have the tools available.

I think we would need the must-gather from that node to definitively say whether or not this is the case in your environment.

@BeardOverflow
Copy link

@bshephar It is not a pulling error.

rpm-ostree does not work in certain scenarios like coreos/rpm-ostree#4547

This is a blocking error because it does never pivot to SCOS.

Long discussed here: okd-project/okd#2041

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants