You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
OS system and architecture in which you are running QHub
Ubuntu 20
Expected behavior
Expect qhub to deploy
Actual behavior
Qhub deployment fails with this error:
[terraform]: │ Error: unable to build kubernetes objects from release manifest: [unable to recognize "": no matches for kind "ClusterRole" in version "rbac.authorization.k8s.io/v1beta1", unable to recognize "": no matches for kind "ClusterRoleBinding" in version "rbac.authorization.k8s.io/v1beta1", unable to recognize "": no matches for kind "Role" in version "rbac.authorization.k8s.io/v1beta1", unable to recognize "": no matches for kind "RoleBinding" in version "rbac.authorization.k8s.io/v1beta1"]
[terraform]: │
[terraform]: │ with module.kubernetes-autoscaling[0].helm_release.autoscaler,
[terraform]: │ on modules/cluster-autoscaler/main.tf line 1, in resource "helm_release" "autoscaler":
[terraform]: │ 1: resource "helm_release" "autoscaler" {
[terraform]: │
[terraform]: ╵
Updating qhub/qhub/template/stages/03-kubernetes-initialize/modules/cluster-autoscaler/main.tf with the following worked:
repository = "https://kubernetes.github.io/autoscaler"
chart = "cluster-autoscaler"
version = "9.18.1"
QHub is currently using the stable version 7.1.0 of the autoscaler which has a max version of 8.0.0. This version is trying to use rbac.authorization.k8s.io/v1beta1. This beta API in kubernetes has been disables by default. The latest stable release does fix the problem, but is not currently being updated. I think it's a better idea to use the actively update chart here: https://artifacthub.io/packages/helm/cluster-autoscaler/cluster-autoscaler
By looking at the two charts, we can see that the current autoscaler helm chart is quite far behind the default kubernetes version of 1.22 that QHub uses.
$ helm search repo cluster-autoscaler --versions
NAME CHART VERSION APP VERSION DESCRIPTION
cluster-autoscaler/cluster-autoscaler 9.18.1 1.23.0 Scales Kubernetes worker nodes within autoscali...
...
stable/cluster-autoscaler 8.0.0 1.17.1 Scales worker nodes within autoscaling groups.
stable/cluster-autoscaler 7.3.4 1.17.1 Scales worker nodes within autoscaling groups.
stable/cluster-autoscaler 7.3.3 1.17.1 Scales worker nodes within autoscaling groups.
I propose shifting to this alternate autoscaler chart moving forward.
The text was updated successfully, but these errors were encountered:
Wonderful, thanks for opening this issue @tylerpotts. Indeed I came to realize that last Friday when some weird behavior started with AWS. Thanks for the detailed information.
@viniciusdc Happy to help! I did notice when I finished my deployment that the dask status link is broken. I suspect it's a traefik routing issue, because when I spin up a cluster and click on the task graph link it gives me the "service unavailable" message which is a default traefik error.
Not sure if this new error is related to the new autoscaler or not
@viniciusdc Happy to help! I did notice when I finished my deployment that the dask status link is broken. I suspect it's a traefik routing issue, because when I spin up a cluster and click on the task graph link it gives me the "service unavailable" message which is a default traefik error.
Not sure if this new error is related to the new autoscaler or not
Uhm, dunno. I think that's a different one, could you check what is the current cert showing up for the dashboard page? is it a autogenerated one from Traefik/Lets-encrypt?
OS system and architecture in which you are running QHub
Ubuntu 20
Expected behavior
Expect qhub to deploy
Actual behavior
Qhub deployment fails with this error:
How to Reproduce the problem?
Deploy qhub version
0.4.1
Command output
No response
Versions and dependencies used.
Anything else?
Updating
qhub/qhub/template/stages/03-kubernetes-initialize/modules/cluster-autoscaler/main.tf
with the following worked:QHub is currently using the stable version
7.1.0
of the autoscaler which has a max version of8.0.0
. This version is trying to userbac.authorization.k8s.io/v1beta1
. This beta API in kubernetes has been disables by default. The latest stable release does fix the problem, but is not currently being updated. I think it's a better idea to use the actively update chart here: https://artifacthub.io/packages/helm/cluster-autoscaler/cluster-autoscalerBy looking at the two charts, we can see that the current autoscaler helm chart is quite far behind the default kubernetes version of
1.22
that QHub uses.I propose shifting to this alternate autoscaler chart moving forward.
The text was updated successfully, but these errors were encountered: