AWS up DescribeTags retry quota exceeded
when creating multiple clusters in parallel after upgrading kops v1.29.x -> v1.30.x
#16886
Labels
kind/bug
Categorizes issue or PR as related to a bug.
/kind bug
1. What
kops
version are you running? The commandkops version
, will displaythis information.
Any version after kops 1.29.6 (e.g. v1.30.1 and/or building from latest master branch commit)
2. What Kubernetes version are you running?
kubectl version
will print theversion if a cluster is running or provide the Kubernetes version specified as
a
kops
flag.v1.30
3. What cloud provider are you using?
aws
4. What commands did you run? What is the simplest way to reproduce this issue?
A standard create cluster. Reproducible by creating multiple clusters in parallel on busy AWS account.
5. What happened after the commands executed?
If I upgrade past kops v1.29.6 to kops v1.30.x, I start running into
retry attempt exceeded errors
when provisioning multiple kops clusters on a busy AWS account. We get the following errors during cluster creation:Perhaps this is due to a few unused
retry_max_attempts
configuration constants after kops v1.30's migration to AWS SDK V2?6. What did you expect to happen?
I expect that kops would keep retrying these specific tasks a few more times until the rate-limit buckets refill over the next few seconds. And that the cluster would finish being created.
7. Please provide your cluster manifest. Execute
kops get --name my.example.com -o yaml
to display your cluster manifest.You may want to remove your cluster name and other sensitive information.
N/A
8. Please run the commands with most verbose logging by adding the
-v 10
flag.Paste the logs into this report, or in a gist and provide the gist link here.
TODO (need to wait for one more CI run, apologies)
9. Anything else do we need to know?
Looks like a couple of max retry constants are no longer used after migrating from AWS Go SDK V1 -> V2.
I have raised a draft PR #16887
A workaround would be to raise AWS account EC2 limits, but we'd have to go through the limit request process each time our CI account changes (every 6 months).
Thank you and have a great week!
The text was updated successfully, but these errors were encountered: