ResourceInUse: Service contains registered instances; delete the instances before deleting the service #4853

ghost · 2018-06-16T02:20:04Z

This issue was originally opened by @tuaris as hashicorp/terraform#18264. It was migrated here as a result of the provider split. The original body of the issue is below.

Terraform Version

0.11.5

Terraform Configuration Files

resource "aws_service_discovery_private_dns_namespace" "app_service" {
	name = "app.service"
	vpc = "vcp-12345678"
}

resource "aws_service_discovery_service" "worker" {
	name = "worker"
	dns_config {
		namespace_id = "${aws_service_discovery_private_dns_namespace.app_service.id}"
		dns_records {
			ttl = 10
			type = "A"
		}
		routing_policy = "MULTIVALUE"
	}

	health_check_custom_config {
		failure_threshold = 1
	}
}

resource "aws_ecs_service" "worker" {
        ...
	service_registries {
		registry_arn = "${aws_service_discovery_service.worker.arn}"
                container_name = "worker"
	}
}

Debug Output

Error: Error applying plan:

1 error(s) occurred:

* aws_service_discovery_service.worker(destroy): 1 error(s) occurred:

* aws_service_discovery_service.worker: ResourceInUse: Service contains registered instances; delete the instances before deleting the service

Expected Behavior

Removing the resource aws_service_discovery_service.worker should first stop the service aws_ecs_service.worker, then proceed to delete the resource.

Actual Behavior

Process fails with ResourceInUse: Service contains registered instances; delete the instances before deleting the service

Steps to Reproduce

To reproduce the issue, for example:

Use the configuration above
terraform apply
Remove the resource aws_service_discovery_service.worker
terraform apply

The text was updated successfully, but these errors were encountered:

jackbritchford · 2018-08-24T15:35:54Z

Still an issue, in my case it's a bit of a pain to add depends_on as this is abstracted away in a module... so

ms14981 · 2018-08-29T16:56:15Z

This does seem to be a nasty bug, because terraform should be able to handle deleting resources in the correct order, but it doesn't seem to be in this case. I'm also having this issue with terraform destroy and aws_service_discovery. Currently manual deletion of AWS resources is required when this error happens.

choeflake · 2018-10-03T15:04:46Z

+1

pradeepbhadani · 2018-10-24T08:05:52Z

any workaround to this issue?

jasonfissure · 2018-11-16T05:52:47Z

I'm having the same issue pradeepbhadani cited. After a "terraform destroy" on an ECS fargate environment, I end up with orphaned DNS records in the service discovery namespace that cannot be deleted manually as they are managed by the service discovery service. Then, because those records are still there, the service discovery namespace cannot be deleted.

jasonfissure · 2018-11-19T08:28:32Z

This issue is happening to me while running:

Terraform v0.11.9
provider.aws v1.32.0
provider.template v1.0.0

This is requiring AWS Support personnel to go in and delete the orphaned DNS records manually before the Service Discovery namespace can be deleted using AWS CLI.

jasonfissure · 2018-11-30T09:36:53Z

I've run into this again in another scenario where the namespace wasn't being deleted. A service was being destroyed as part of updating it. I have ended up with an orphaned service discovery operation. An instance was attempted to be registered, but, the underlying ECS service was already destroyed. I'm again left with requiring AWS Support to go in and fix things behind the scenes.

alexrudd · 2018-12-11T14:43:45Z

I'm experiencing this, though with custom service instances (not ECS).

Some kind of force_delete attribute on the service might help so that terraform can cycle through and deregister any instances left in the service before attempting to delete the service.

abhimanyugupta07 · 2019-01-18T13:32:56Z

I was able to resolve this by running

aws servicediscovery list-services --region us-west-2

then selecting my service's ID from the list and running.

aws servicediscovery delete-service --id srv-oy************x

adeelahmadch · 2019-01-21T14:36:30Z

In my case there was a modification in service discovery resource and terraform was unable to destroy the old resource. So I have to do it manually.

module.fargate_staging.aws_service_discovery_service.services[75]: Destroying... (ID: srv-jXXXXXXXXXX)

Solution:

Based on the service id, first i have to find the attached instance-id,

- aws servicediscovery list-instances --service-id=srv-jXXXXXXXXXX --region=eu-central-1 --profile=staging

{
    "Instances": [
        {
            "Attributes": {
                "AWS_INSTANCE_IPV4": "172.XX.XXX.XXX",
                "AWS_INIT_HEALTH_STATUS": "HEALTHY",
                "AVAILABILITY_ZONE": "eu-central-1c",
                "REGION": "eu-central-1",
                "ECS_SERVICE_NAME": "abcxyz",
                "ECS_CLUSTER_NAME": "staging",
                "ECS_TASK_DEFINITION_FAMILY": "staging-abcxyz"
            },
            "Id": "337cfbfd-bc9d-4b42-8a10-ABCXYZ913"
        }
    ]
}

Once i have attached instance-id, i have to deregister it before i delete the service.

- aws servicediscovery deregister-instance --service-id=srv-jXXXXXXXXXX --instance-id=337cfbfd-bc9d-4b42-8a10-ABCXYZ913 --region=eu-central-1 --profile=staging

- aws servicediscovery delete-service --id srv-jXXXXXXXXXX --region=eu-central-1 --profile=staging

It looks like terraform needs to fix this bug :)

dotjim · 2019-01-24T11:14:44Z

Removing the resource aws_service_discovery_service.worker should first stop the service aws_ecs_service.worker, then proceed to delete the resource.

This is true if the associated aws_ecs_service resource itself is being removed or replaced.

The issue also occurs more fundamentally when Terraform needs to remove or replace just the aws_service_discovery_service resource itself in isolation - for example if the dns_records.type is subsequently changed. There are no other resource dependencies, however Terraform fails with same error as it does not first remove the existing service discovery instance records:

aws_service_discovery_service.{name}: ResourceInUse: Service contains registered instances; delete the instances before deleting the service

Until the Terraform AWS provider removes existing service discovery instance records, our options seem limited to manual removal or a destroy time provisioner.

The latter really doesn't sit well with me as it introduces risk, dependencies on the host machine running Terraform having AWS CLI and appropriate privileges - however in a heavily automated CI/CD environment it's perhaps a better interim workaround than random failures and manual intervention.

resource "aws_service_discovery_service" "core" {
  [..]

  /**
   * Workaround to https://github.com/terraform-providers/terraform-provider-aws/issues/4853
   * Terraform does not deregister existing service discovery instance records prior to removing
   * the `aws_service_discovery_service` resource, causing AWS to error with:
   *    ResourceInUse: Service contains registered instances; delete the instances before deleting the service
   */
  provisioner "local-exec" {
    when    = "destroy"
    command = <<EOF_COMMAND
      SERVICE_ID=$(aws servicediscovery list-services --filters '[{"Name":"NAMESPACE_ID","Values":["${var.service_discovery_namespace_id}"]}]' --region ${var.aws_region} \
        --query 'Services[?Name == `${var.service_discovery_name}`].Id' --output text) && \
      aws servicediscovery discover-instances --namespace-name ${var.service_discovery_domain} --service-name ${var.service_discovery_name} \
        --query 'Instances[*].InstanceId | join(`"\n"`, @)' --output text \
      | xargs -I {INSTANCE_ID} aws servicediscovery deregister-instance --service-id $SERVICE_ID --instance-id {INSTANCE_ID} && sleep 5
EOF_COMMAND
  }
}

richardj-bsquare · 2019-02-26T12:43:12Z

In my scenario, I have a direct dependency between the ECS service and service-discovery-service (the service references the service discovery service ARN).

In my case, a seemingly simple change to the DNS TTL value in the service discovery caused me to encounter this problem.

iTaybb · 2019-03-12T20:51:58Z

Same happens here.

milanvdm · 2019-06-21T09:01:10Z

We are hitting the same scenario on services which are already running in production.
The proposed solutions on this issue are restarting your service-discovery instance but I assume this means downtime?

sarjuymd · 2019-07-10T14:29:23Z

+1

eedwards-sk · 2019-09-11T23:03:15Z

This is a bad bug as it basically makes the provider feature incomplete and broken. IMO the original feature should have never been released if it doesn't handle this scenario.

ihakimi · 2019-09-26T08:15:43Z

+1

xiang-chen-0 · 2019-10-15T14:32:11Z

+1

dggmsa · 2019-10-28T12:52:45Z

+1

araddas · 2019-11-01T18:30:11Z

+1

hvar90 · 2019-12-11T15:35:08Z

just stop the task first before delete the service

subtubes-io · 2019-12-12T02:48:31Z

I was able to resolve this by running

aws servicediscovery list-services --region us-west-2

then selecting my service's ID from the list and running.

aws servicediscovery delete-service --id srv-oy************x

that fixed it for me

MooreDerek · 2020-02-17T00:45:07Z

+1

binarymist · 2020-06-03T10:36:59Z

Is there a workaround that actually works around?

japgolly · 2020-06-10T08:55:14Z

This has been working great for me:

Add to aws_service_discovery_service resources:

  # Remove after https://github.com/terraform-providers/terraform-provider-aws/issues/4853 is resolved
  provisioner "local-exec" {
    when    = destroy
    command = "${path.module}/servicediscovery-drain.sh ${self.id}"
  }

servicediscovery-drain.sh:

#!/bin/bash

[ $# -ne 1 ] && echo "Usage: $0 <service-id>" && exit 1

serviceId="--service-id=$1"

echo "Draining servicediscovery instances from $1 ..."
ids="$(aws servicediscovery list-instances $serviceId --query 'Instances[].Id' --output text | tr '\t' ' ')"

found=
for id in $ids; do
  if [ -n "$id" ]; then
    echo "Deregistering $1 / $id ..."
    aws servicediscovery deregister-instance $serviceId --instance-id "$id"
    found=1
  fi
done

# Yes, I'm being lazy here...
[ -n "$found" ] && sleep 5 || true

KevinGimbel · 2020-11-19T14:14:14Z

Having the same issue right now.

MTB90 · 2020-11-24T12:41:46Z

The same problem :(

MooreDerek · 2021-01-22T00:00:00Z

Any progress on this one. Hitting it with TF 0.14.4 and AWS Provider 3.23.0

kyle-thedelta · 2021-02-10T08:46:20Z

Experiencing the same issue on latest TF and AWS:

Terraform v0.14.6
+ provider registry.terraform.io/hashicorp/archive v2.0.0
+ provider registry.terraform.io/hashicorp/aws v3.26.0
+ provider registry.terraform.io/hashicorp/null v3.0.0

Guy-Rawsthorn · 2021-05-05T14:59:27Z

I'm struggling with this while attempting @japgolly solution using a local-exec provisioner and calling a shell script to deregister the service.

provisioner "local-exec" {
    when    = destroy
    command = "../../servicediscovery-drain.sh ${self.id} $PROFILE $REGION"

    environment = {
      REGION = var.region
      PROFILE = var.profile 
    }
  }

I want to pass in TF vars of profile and region to the local-exec provisioner - however tf has restricted this whilst when=destroy. Any way around this?

hashicorp/terraform#23679

github-actions · 2021-09-02T22:28:27Z

This functionality has been released in v3.57.0 of the Terraform AWS Provider. Please see the Terraform documentation on provider versioning or reach out if you need any assistance upgrading.

For further feature requests or bug reports with this functionality, please create a new GitHub issue following the template. Thank you!

141984 · 2022-03-08T10:13:50Z

Experienced this issue during terraform destroy of ECS services running on EC2 instances.

Terraform v1.0.11
on linux_amd64
+ provider registry.terraform.io/hashicorp/aws v3.65.0

.....

Error: error deleting Service Discovery Service (srv-1234545656gdfghltx): ResourceInUse: Service contains registered instances; delete the instances before deleting the service


Error: error deleting Service Discovery Service (srv-1234545656gdfghlbv): ResourceInUse: Service contains registered instances; delete the instances before deleting the service

ekuongm · 2022-03-22T13:16:29Z

Experienced the same issue :

Terraform v0.15.3
on linux_amd64

provider registry.terraform.io/hashicorp/aws v4.6.0

ProofOfPizza · 2022-04-04T13:40:08Z

Ran into it today using TF v1.1.7

github-actions · 2022-05-05T02:29:29Z

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues.
If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

ghost mentioned this issue Jun 16, 2018

ResourceInUse: Service contains registered instances; delete the instances before deleting the service hashicorp/terraform#18264

Closed

bflad added enhancement Requests to existing resources that expand the functionality or scope. service/servicediscovery Issues and PRs that pertain to the servicediscovery service. labels Jun 26, 2018

pradeepbhadani mentioned this issue Nov 2, 2018

KNOWN ISSUE: terraform destroy does not destroy service discovery namespace ExpediaGroup/apiary-data-lake#82

Closed

ewbankkit mentioned this issue Aug 19, 2020

Force Delete Service Discovery Service #3538

Merged

bilbof mentioned this issue Sep 17, 2020

Use a service mesh for internal networking in ECS alphagov/govuk-infrastructure#17

Merged

matthew-a-carr mentioned this issue Mar 30, 2021

Risk: Service discovery between webapp/service in AWS mcagov/beacons#29

Open

ewbankkit closed this as completed in #3538 Sep 1, 2021

github-actions bot added this to the v3.57.0 milestone Sep 1, 2021

github-actions bot locked as resolved and limited conversation to collaborators May 5, 2022

ResourceInUse: Service contains registered instances; delete the instances before deleting the service #4853

ResourceInUse: Service contains registered instances; delete the instances before deleting the service #4853

Comments

ghost commented Jun 16, 2018

Terraform Version

Terraform Configuration Files

Debug Output

Expected Behavior

Actual Behavior

Steps to Reproduce

jackbritchford commented Aug 24, 2018

ms14981 commented Aug 29, 2018

choeflake commented Oct 3, 2018

pradeepbhadani commented Oct 24, 2018

jasonfissure commented Nov 16, 2018

jasonfissure commented Nov 19, 2018 • edited Loading

jasonfissure commented Nov 30, 2018

alexrudd commented Dec 11, 2018

abhimanyugupta07 commented Jan 18, 2019

adeelahmadch commented Jan 21, 2019

dotjim commented Jan 24, 2019

richardj-bsquare commented Feb 26, 2019 • edited Loading

iTaybb commented Mar 12, 2019

milanvdm commented Jun 21, 2019

sarjuymd commented Jul 10, 2019

eedwards-sk commented Sep 11, 2019

ihakimi commented Sep 26, 2019

xiang-chen-0 commented Oct 15, 2019

dggmsa commented Oct 28, 2019

araddas commented Nov 1, 2019

hvar90 commented Dec 11, 2019

subtubes-io commented Dec 12, 2019

MooreDerek commented Feb 17, 2020

binarymist commented Jun 3, 2020

japgolly commented Jun 10, 2020

KevinGimbel commented Nov 19, 2020

MTB90 commented Nov 24, 2020

MooreDerek commented Jan 22, 2021

kyle-thedelta commented Feb 10, 2021

Guy-Rawsthorn commented May 5, 2021 • edited Loading

github-actions bot commented Sep 2, 2021

141984 commented Mar 8, 2022

ekuongm commented Mar 22, 2022

ProofOfPizza commented Apr 4, 2022 • edited Loading

github-actions bot commented May 5, 2022

jasonfissure commented Nov 19, 2018 •

edited

Loading

richardj-bsquare commented Feb 26, 2019 •

edited

Loading

Guy-Rawsthorn commented May 5, 2021 •

edited

Loading

ProofOfPizza commented Apr 4, 2022 •

edited

Loading