r/resource_aws_security_group: increase deletion timeout #1052

s-urbaniak · 2017-07-04T13:43:17Z

This adds a bump in the timeout for removing security groups as we experienced a high flake count upon cluster destruction. Introducing this timeout reduces flakes significantly

This is a stop-gap solution until resource providers get timeout override support.

Fixes coreos/tectonic-installer#1242

/cc @radeksimko @jasminSPC

Fixes coreos/tectonic-installer#1242

radeksimko · 2017-07-04T13:56:42Z

Hi @s-urbaniak
I'd love to solve this problem, but I think 30mins is a bit too high for such resource and consequence. Keep in mind this timeout also applies to users that have a genuine DependencyViolation - i.e. users that have a dependency lying around will have to wait 30mins until they see the error. That's not great user experience.

The good news is that if you can consistently reproduce this problem, we can dig in and find out what's causing this, like we did #1021 - do you mind following the same process? i.e. sending me the debug logs + relevant tf configs?

Thanks.

s-urbaniak · 2017-07-04T13:59:26Z

@radeksimko I'll try to provoke the timeout as I did in the other precedence, sure.

jasmingacic · 2017-07-04T15:45:56Z

Maybe It wouldn't hurt to maybe parameterise Delete timeout?

radeksimko · 2017-07-04T16:44:46Z

@jasminSPC I'd prefer not to do it in this context for this resource. See my longer explanation in #945 (comment)

There's no good reason for an SG removal to take so long. Yes, APIs are eventually consistent and sometimes laggy, but that's matter of minutes, not half an hour. If the user has specified references between resources correctly (= we aren't deleting all things at the same time) then terraform will go resource-by-resource and the only reason for being stuck for so long in this situation is just resource not cleaning up after itself on Amazon's side.

That is likely what's happening here, so when we understand what's holding the SG from removal we can do the cleanup ourselves without having to crank up timeouts and make things work out of the box for everyone. 🙂

Ninir · 2017-08-17T21:25:30Z

Hey @s-urbaniak ,

As @radeksimko exposes it, this issue seems to me like a specific one since there are not so many people complaining about it.

Don't want to urge anyone on this one (😅) but find a solution for that if there is a real issue, or either close it if it's not reproductible.

Do you think you would have time to investigate?

Thanks!

s-urbaniak · 2017-08-28T08:08:44Z

closing, as we will try to find more datapoints having TF_TRACE enabled and come up with a more sensible solution.

Ninir · 2017-08-28T08:12:03Z

Thank you for that @s-urbaniak :)

ghost · 2020-04-11T17:29:19Z

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues.

If you feel this issue should be reopened, we encourage creating a new issue linking back to this one for added context. Thanks!

r/resource_aws_security_group: increase deletion timeout

66d51b7

Fixes coreos/tectonic-installer#1242

radeksimko added the bug Addresses a defect in current functionality. label Jul 4, 2017

radeksimko mentioned this pull request Jul 4, 2017

r/resource_aws_subnet: increase deletion timeout #1051

Closed

radeksimko added the waiting-response Maintainers are waiting on response from community or contributor. label Jul 4, 2017

s-urbaniak closed this Aug 28, 2017

ghost locked and limited conversation to collaborators Apr 11, 2020

breathingdust removed the waiting-response Maintainers are waiting on response from community or contributor. label Sep 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

r/resource_aws_security_group: increase deletion timeout #1052

r/resource_aws_security_group: increase deletion timeout #1052

s-urbaniak commented Jul 4, 2017

radeksimko commented Jul 4, 2017

s-urbaniak commented Jul 4, 2017

jasmingacic commented Jul 4, 2017

radeksimko commented Jul 4, 2017 •

edited

Loading

Ninir commented Aug 17, 2017

s-urbaniak commented Aug 28, 2017

Ninir commented Aug 28, 2017

ghost commented Apr 11, 2020

r/resource_aws_security_group: increase deletion timeout #1052

r/resource_aws_security_group: increase deletion timeout #1052

Conversation

s-urbaniak commented Jul 4, 2017

radeksimko commented Jul 4, 2017

s-urbaniak commented Jul 4, 2017

jasmingacic commented Jul 4, 2017

radeksimko commented Jul 4, 2017 • edited Loading

Ninir commented Aug 17, 2017

s-urbaniak commented Aug 28, 2017

Ninir commented Aug 28, 2017

ghost commented Apr 11, 2020

radeksimko commented Jul 4, 2017 •

edited

Loading