You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've been noticing lately that my models no longer autoscale (they stay at the min number of configured pods even when CPU is maxed out). The HPA config seems to be properly generated. Could it have something to do with the resource limits on the sidecar?
Here's an example deployment yaml that isn't working. Seldon version is 0.4.0
Can you try this example with 0.4.1-SNAPSHOT (i.e., from a clone of seldon-core). I retested on a GKE cluster and it works. If that also works for you we would need to look closer at your SeldonDeployment and why its different.
The svcOrchSpec resource limits should not stop autoscaling from happening. The custom limits you set will depend on the expected load you have for the model you will deploy.
I've been noticing lately that my models no longer autoscale (they stay at the min number of configured pods even when CPU is maxed out). The HPA config seems to be properly generated. Could it have something to do with the resource limits on the sidecar?
Here's an example deployment yaml that isn't working. Seldon version is 0.4.0
The text was updated successfully, but these errors were encountered: