-
Notifications
You must be signed in to change notification settings - Fork 48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
lws controller cannot reconcile the pod to the right status #391
Comments
|
I tried it, kubectl get lws/sglang -oyaml get nothing useful info Here are some logs from lws-controller pods
|
HI @wangyuan249 |
the whole statefulset list info is too long, more than 2000 columns. cluster-dump.json is also too long more than 10000 columns.
|
Great thanks. It seems I've identified the problem. The ordinals are not present in the StatefulSet; this feature is supported after Kubernetes 1.27 as detailed here: https://kubernetes.io/blog/2023/04/28/statefulset-start-ordinal/. So, the feature may be disabled in the cluster. Could you help to print the |
kubectl version |
oh I got it, our cluster is 1.26 。。。 |
ref to https://github.com/kubernetes-sigs/lws/blob/main/docs/setup/install.md#before-you-begin, the lws needs >=1.27 |
Please close this if fixed your issue. Thanks! |
You can still own this feature in v1.26 if manually enabled the feature gate.
|
What happened:
lws controller cannot reconcile the pod to the right status
What you expected to happen:
How to reproduce it (as minimally and precisely as possible):
Anything else we need to know?:
Environment:
kubectl version
):git describe --tags --dirty --always
):cat /etc/os-release
):uname -a
):The text was updated successfully, but these errors were encountered: