Error Messaging Updates for Diagnostics #17314

bflad · 2021-01-27T14:54:28Z

Community Note

Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
Please do not leave "+1" or other comments that do not add relevant new information or questions, they generate extra noise for issue followers and do not help prioritize the request
If you are interested in working on this issue or have submitted a pull request, please leave a comment

Description

Given the age of the Terraform AWS Provider and some of its resources, there has been a long history of varied styles for returning resource error messages. Some places return no context (return err) while others return some additional context (return fmt.Errorf("...", err)) with varied message styles.

Through much of the history of these error messages (prior to Terraform CLI version 0.12), the user interface experience when showing these error messages was in a "go-multierror" style (destroy operation error shown below):

Error: Error applying plan:

1 error(s) occurred:

* aws_db_instance.rds_sc_node (destroy): 1 error(s) occurred:

* aws_db_instance.rds_sc_node: DB Instance FinalSnapshotIdentifier is required when a final snapshot is required

Where Terraform CLI injected all this user interface output:

Error: Error applying plan:

1 error(s) occurred:

* aws_db_instance.rds_sc_node (destroy): 1 error(s) occurred:

* aws_db_instance.rds_sc_node:

And the Terraform resource error was the remaining part (as sourced from return ...):

DB Instance FinalSnapshotIdentifier is required when a final snapshot is required

When returning AWS Go SDK (e.g. API) error messages, the lack of returning error message context meant that practitioners could be faced with very confusing errors, such as this refresh operation example:

Error refreshing state: 1 error(s) occurred:

* 1 error(s) occurred:

* AccessDenied: Access Denied
    status code: 403, request id: []

To rectify this, the maintainers moved towards requiring a consistent error messaging context including a noun and verb in addition to the actual error, so it was more clear about what was happening and where. e.g. return fmt.Errorf("error creating/reading/updating/deleting Service Thing (%s): %w", d.Id(), err). This preferred style is documented in the Contributing Guide.

Terraform CLI version 0.12 and later added support for resources to return both warning and error diagnostics in the user interface output. Diagnostics support both a summary and details fields. Given these changes, the output is now much more terse:

Error: Summary

Details

Warning: Summary

Details

To compensate for existing resources that return a raw error type, the Terraform Plugin SDK converts these errors into an error diagnostic with the error message string in the summary field. Given the prior consistency standard for error messages, the error messaging now has an awkward output:

Error: error creating/reading/updating/deleting Service Thing (abc123): ErrCode: ErrMessage

Rather than introduce yet more error message stylings with raw error types (since they should be updated anyways), it will likely be ideal to address this as or after resources are migrated to Terraform Plugin SDK version 2 and fully support diagnostics. This will allow resources to properly inject a summary and separate the details.

Some initial proposals include introducing helpers to wrap error messages before they are returned in resource logic. For example, having error diagnostic wrapper(s) such as:

// The below represents illustrative examples where the details of the
// naming, messaging, and function signatures are only to show the
// conceptual functionality and potential.

// "Simple" wrapper

func ResourceReadErrorDiagnostic(service string, component string, identifier string, err error) diag.Diagnostic {
  return diag.Diagnostic{
    Severity: diag.Error,
    Summary: fmt.Sprintf("Unable to Read %s %s (%s)", service, component, identifier),
    Detail: err.Error(),
  }
}

// "Error Type" wrapper

func ResourceReadErrorDiagnostic(service string, component string, identifier string, err error) diag.Diagnostic {
  summary := fmt.Sprintf("Unable to Read %s %s (%s)", service, component, identifier)
  var awsError awserr.Error
  var resourceTimeoutError resource.TimeoutError

  if errors.As(err, &awsError) {
    return diag.Diagnostic{
      Severity: diag.Error,
      Summary: summary,
      Detail: fmt.Sprintf("AWS API operation error: %s", awsError.Error()),
    }
  }

  if errors.As(err, &resourceTimeoutError) {
    return diag.Diagnostic{
      Severity: diag.Error,
      Summary: summary,
      Detail: fmt.Sprintf("Terraform resource timeout error: %s", resourceTimeoutError.Error()),
    }
  }

  return diag.Diagnostic{
    Severity: diag.Error,
    Summary: summary,
    Detail: err.Error(),
  }
}

// "Advanced Error Type" wrapper
// return AwsError{Operation: "DescribeVpcs", Error: err}

type AwsError struct {
  Operation string
  Error err
  // to get really detailed/fancy
  // Endpoint string
  // Region string
}

func ResourceReadErrorDiagnostic(service string, component string, identifier string, err error) diag.Diagnostic {
  summary := fmt.Sprintf("Unable to Read %s %s (%s)", service, component, identifier)
  var awsError AwsError
  var resourceTimeoutError resource.TimeoutError

  if errors.As(err, &awsError) {
    return diag.Diagnostic{
      Severity: diag.Error,
      Summary: summary,
      Detail: fmt.Sprintf("AWS API operation (%s) error: %s", awsError.Operation, awsError.Error()),
    }
  }

  if errors.As(err, &resourceTimeoutError) {
    return diag.Diagnostic{
      Severity: diag.Error,
      Summary: summary,
      Detail: fmt.Sprintf("Terraform resource timeout error: %s", resourceTimeoutError.Error()),
    }
  }

  return diag.Diagnostic{
    Severity: diag.Error,
    Summary: summary,
    Detail: err.Error(),
  }
}

Which the above could render as:

Error: Unable to Read EC2 VPC (vpc-123)

AWS API operation (DescribeVpcs) error: AccessDenied: AccessDenied

References

The text was updated successfully, but these errors were encountered:

bflad · 2021-01-27T15:44:21Z

Until this handling is decided, we could consider dropping any leading error prefix in fmt.Error() messages to remove the Error: error ... problem now. I would just be worried about creating yet more styles that will make things harder to find/replace when we get to this point. (Unless we want to tackle that now, but I think we should let practitioners weigh in on the urgency of that type of work given everything else going on. Those updates might make the error messages confusing without further sentence/context updates, making more work.)

YakDriver · 2021-02-26T20:12:55Z

Excellent write up. I concur on sticking with one standard to facilitate future replacing of fmt.Errorf("error....

github-actions · 2023-12-26T17:42:33Z

Marking this issue as stale due to inactivity. This helps our maintainers find and focus on the active issues. If this issue receives no comments in the next 30 days it will automatically be closed. Maintainers can also remove the stale label.

If this issue was automatically closed and you feel this issue should be reopened, we encourage creating a new issue linking back to this one for added context. Thank you!

github-actions · 2024-02-29T02:02:03Z

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues.
If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

bflad added thinking technical-debt Addresses areas of the codebase that need refactoring or redesign. provider Pertains to the provider itself, rather than any interaction with AWS. labels Jan 27, 2021

ghost added the service/rds Issues and PRs that pertain to the rds service. label Jan 27, 2021

bflad mentioned this issue Jan 27, 2021

New Resource: aws_imagebuilder_image #16710

Merged

This was referenced Jan 27, 2021

data-source/aws_route53_zone: Perform NS record lookup for private Hosted Zones #17002

Merged

err="rpc error: code = Unavailable desc = transport is closing" #16073

Closed

Better diagnostics for “multiple VPC Endpoint Services matched” #17415

Closed

hc-github-team-terraform-aws removed the thinking label Oct 6, 2021

ewbankkit mentioned this issue Jan 26, 2022

Remove "error " prefix from fmt.Errorf returned error strings hashicorp/aws-cloudformation-resource-schema-sdk-go#35

Closed

ewbankkit mentioned this issue Jan 17, 2023

[Enhancement]: Standardize and enhance error handling #28891

Closed

github-actions bot added the stale Old or inactive issues managed by automation, if no further action taken these will get closed. label Dec 26, 2023

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Jan 29, 2024

github-actions bot locked as resolved and limited conversation to collaborators Feb 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error Messaging Updates for Diagnostics #17314

Error Messaging Updates for Diagnostics #17314

bflad commented Jan 27, 2021 •

edited

Loading

bflad commented Jan 27, 2021

YakDriver commented Feb 26, 2021

github-actions bot commented Dec 26, 2023

github-actions bot commented Feb 29, 2024

Error Messaging Updates for Diagnostics #17314

Error Messaging Updates for Diagnostics #17314

Comments

bflad commented Jan 27, 2021 • edited Loading

Community Note

Description

References

bflad commented Jan 27, 2021

YakDriver commented Feb 26, 2021

github-actions bot commented Dec 26, 2023

github-actions bot commented Feb 29, 2024

bflad commented Jan 27, 2021 •

edited

Loading