Existing A record error #1562

bertinatto · 2019-04-09T13:14:52Z

Version

$ bin/openshift-install version
bin/openshift-install unreleased-master-729-g83ca64d34e7e16935cbaa39fb42a6c8ba9d58c33
built from commit 83ca64d34e7e16935cbaa39fb42a6c8ba9d58c33
release image registry.svc.ci.openshift.org/openshift/origin-release:v4.0

What happened?

I've seen this error occasionally:

ERROR                                              
ERROR Error: Error applying plan:                  
ERROR                                              
ERROR 1 error occurred:                            
ERROR 	* module.dns.aws_route53_record.api_external: 1 error occurred: 
ERROR 	* aws_route53_record.api_external: [ERR]: Error building changeset: InvalidChangeBatch: [Tried to create resource record set [name='api.myname.mydevcluster.com.', type='A'] but it already exists] 
ERROR 	status code: 400, request id: 2315fe75-5ac2-11e9-9982-7b4f4ee60637 
ERROR                                              
ERROR                                              
ERROR                                              
ERROR                                              
ERROR                                              
ERROR Terraform does not automatically rollback in the face of errors. 
ERROR Instead, your Terraform state file has been partially updated with 
ERROR any resources that successfully completed. Please address the error 
ERROR above and apply again to incrementally change your infrastructure. 
ERROR                                              
ERROR                                              
FATAL failed to fetch Cluster: failed to generate asset "Cluster": failed to create cluster: failed to apply using Terraform

Then I head to the AWS Console and delete the records manually.

I'm not sure how often this happen out there, but it might be worthwhile to parse this AWS error and print a more helpful message to the user.

Ideally, the message would instruct the user to delete the records manually and then proceed with the installation.

The text was updated successfully, but these errors were encountered:

wking · 2019-04-09T19:34:00Z

Ideally, the message would instruct the user to delete the records manually and then proceed with the installation.

That's not always the appropriate response. See rhbz#1659970 (fixed by #1442) for why we error out in this case. Your issue is almost certainly a failed/forgotten uninstall of an earlier cluster; have you been running destroy cluster when you're done with old clusters? But I'm ok with elaborating on this and other Terraform errors (see my stub in #1452) if we wanted to pursue that direction.

bertinatto · 2019-04-10T13:54:19Z

Your issue is almost certainly a failed/forgotten uninstall of an earlier cluster

Yep, that's my case. No doubt that it was my fault, but since we want to make the installer very accessible to our users, it might be worthwhile to point them some directions on how to solve common errors (if that's possible). #1452 looks very useful, perhaps we could have a pointer to it in the error message?

chmouel · 2019-05-04T06:31:34Z

I have the issue happening to me every time (openshift-dev us-east-2 account), running a "destroy cluster"

wking · 2019-05-04T13:18:10Z

I have the issue happening to me every time...

Have you removed the leaked A record? You need to recover this manually after the buggy openshift-dev reaper partially removes the cluster. Running destroy cluster before the reaper gets to your cluster will keep this from happening, but it won't help after the reaper removes your private zone.

abhinavdahiya · 2019-07-24T23:13:47Z

openshift-dev account should be cleaning them correctly, if the installer doesn't find the private zone, the public records cannot be delete for safety.

/close

openshift-ci-robot · 2019-07-24T23:13:48Z

@abhinavdahiya: Closing this issue.

In response to this:

openshift-dev account should be cleaning them correctly, if the installer doesn't find the private zone, the public records cannot be delete for safety.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

lukas-vlcek · 2019-08-16T12:45:44Z

I am running into similar issue but I am unable to find my domain in Hosted zones list so I can not delete my Route53 record in AWS console. Am I looking into wrong dashboard?

Never mind, I found it. I need to click the parent domain first and then I found it...

abhinavdahiya closed this as completed Apr 29, 2019

abhinavdahiya reopened this Apr 29, 2019

openshift-ci-robot closed this as completed Jul 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Existing A record error #1562

Existing A record error #1562

bertinatto commented Apr 9, 2019

wking commented Apr 9, 2019

bertinatto commented Apr 10, 2019

chmouel commented May 4, 2019

wking commented May 4, 2019

abhinavdahiya commented Jul 24, 2019

openshift-ci-robot commented Jul 24, 2019

lukas-vlcek commented Aug 16, 2019 •

edited

Loading

Existing A record error #1562

Existing A record error #1562

Comments

bertinatto commented Apr 9, 2019

Version

What happened?

wking commented Apr 9, 2019

bertinatto commented Apr 10, 2019

chmouel commented May 4, 2019

wking commented May 4, 2019

abhinavdahiya commented Jul 24, 2019

openshift-ci-robot commented Jul 24, 2019

lukas-vlcek commented Aug 16, 2019 • edited Loading

lukas-vlcek commented Aug 16, 2019 •

edited

Loading