Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Overhaul Module for Consistency #17

Merged
merged 20 commits into from
Aug 29, 2021
Merged

Overhaul Module for Consistency #17

merged 20 commits into from
Aug 29, 2021

Conversation

Nuru
Copy link
Contributor

@Nuru Nuru commented Jun 27, 2021

what

  • Input use_name_prefix replaced with create_before_destroy. Previously, create_before_destroy was always set to true but of course that fails if you are not using a name prefix, because the names must be unique. Now the name is automatically a prefix if create_before_destroy is true and not if it is not.
  • Input security_group_enabled renamed to create_security_group. Whether the security group is created or not, it will be enabled, and setting security_group_enabled to false does not disable the entire module, even though the module is named "security-group", which makes the old name terribly confusing. The new name is more descriptive.
  • Input id renamed to target_security_group_id. Again id by itself is too vague. Converted to list to conform to new standard pattern that optional inputs which are used in conditionals are passed as list elements. See Hashicorp recommendation
  • Added a security_group_name input, which, if set, will set the security group name. If not set, name will be derived from null-label. Because the security group name must be unique within an account, we should provide some way for people to set/override it other than forcing them to create a customized null-label.
  • As a convenience, added rule_matrix . Many of our modules allow users to simply give a list of security groups to allow access to the new resource, typically called allowed_security_groups. This variable allows for easy migration by closely paralleling the existing resource creation code. It allows any number of rules to be applied to any combined list of security groups and CIDRs. See example.
  • As a convenience, added allow_all_egress. AWS by default allows full egress to newly created security groups. Terraform removes this when taking over a security group, but our modules frequently want to restore it. Historically, though, the modules have implemented this slightly differently, and few or none have allowed IPv6 egress. Adding this boolean gives us a way to enable it simply and consistently (as opposed to every module writing its own egress rule).
  • Abandoned the attempt to create stable keys for rules to use in for_each. Existing generated keys were not guaranteed unique, and keys that were generated and guaranteed to be unique would not be known at plan time and thus could not be used. Instead, provide option for user to provide stable keys and, if not provided, generate keys knowing they might not be stable.

why

  • This module is the foundation for how we'll handle security groups across all of our modules and we need to ensure greater consistency from the onset

naming conventions

We want to migrate to a consistent set of name across modules. However, it is also quite painful to be forced to rename, so where possible I would like to maintain existing names but mark them deprecated and feed them into the new names in main.tf locals{}. We have also already seen issues with the most recent set of name changes. Therefore I propose these names with these meanings:

associated_security_group_ids

associated_security_group_ids is a list of IDs of Security Group that are "associated" (AWS' term) with the resource being created. In other words, the new resource is placed in or becomes a member of the Security Groups identified by the ID.

Most often our modules got this information as existing_security_groups and a boolean use_existing_security_groups, and the recent change was to call this input simply security_groups. However, there is no consistency in naming in the AWS provider (redshift_cluster calls it cluster_security_groups, elasticache_replication_group calls it security_group_ids, ec2_instance calls it just security_groups but accepts legacy security group names as well as IDs) and in our modules we typically have several lists of security groups (see below), so just calling this security_groups is very confusing.

Using "associated" makes it clear the purpose, and suffixing with "ids" makes it clear the type. Since AWS in inconsistent and variously uses ARNs, IDs, and Names to identify resources, I think including the type is very helpful.

allowed_security_group_ids

allowed_security_group_ids are security groups that are allowed ingress to the resource being created. Typically rules allowing this are added to the single created security group, as it should be unnecessary for an existing security group, but where desired, these rules can be added to the first in the list of "associated" security groups

allow_all_egress

AWS by default creates security groups that allow no ingress but allow all egress. When Terraform starts managing rules for the security group, it removes this default egress rule. Modules should include an allow_all_egress boolean to restore that rule when true.

security_group_description

Our modules have evolved over time (at community request) to provide more useful descriptions of Security Groups. Unfortunately, Terraform cannot change the description of an existing security group; instead it must replace the SG with a new one with the new description. For this reason, changes to the description field, while beneficial for new users, can be too disruptive on existing infrastructure to be worth it. In order to provide users with control over the description and thus mitigate the impact of changes, all modules that create security groups should include a security_group_description input which, if set, overrides any other kind of generated or default description.

Instance Metadata Services

Although not actually part of the Security Group module, since we are covering consistent naming of inputs, we document here that we are using the following inputs and defaults to configure the AWS Instance Metadata Service. Note that our inputs do not exactly match the Terraform resource inputs because we have chosen to use boolean inputs rather than string inputs to toggle features. Our standard metadata_options block looks like this

  metadata_options {
    http_endpoint               = (var.metadata_http_endpoint_enabled) ? "enabled" : "disabled"
    http_put_response_hop_limit = var.metadata_http_put_response_hop_limit
    http_tokens                 = (var.metadata_http_tokens_required) ? "required" : "optional"
  }

and defaults are

metadata_http_endpoint_enabled = true
metadata_http_put_response_hop_limit = 2
metadata_http_tokens_required = true

We picked these defaults so that we default to best security practices with a concession (hop limit 2 instead of 1) to running containerized services. However, metadata_http_tokens_required = true may break some existing applications and is a breaking change, so when implemented, it should be noted in the release notes, along with how to preserve the previous settings.

Optional Inputs

This module is among the first to implement the new Cloud Posse standard for optional inputs in Terraform. Because of issues like this (just one of many, many examples) we are going to follow Hashicorp's advice and prohibit the conditional creation of resources based on values of inputs. If you want to condition the creation of a resource (e.g. count = xxx ? 1 : 0) based on whether the input is supplied or not, the way we are going to do it is to make the optional input a list. A supplied value will be in a list with 1 element. An omitted value will use the default list of 0 elements. It will remain standard practice to depend on the value of enabled, but otherwise we should avoid conditional creation of resources based on input values.

Unfortunately, this also means we cannot use for_each when the values might be generated during apply. This appears to be a consequence of the fact that for_each requires a set and the cardinality of the set depends on the values generated (adding 2 of the same value to a set only increases the size of the set by 1). So we can only use for_each when we can guarantee the user is hard coding the values so they are all known an plan time. Otherwise use count.

references

Issues with Terraform management of Security Group Rules: drift detection, cyclical dependencies, and competing for control: a post from a Hashicorp engineer

Bug in Terraform AWS provider requires multiple apply cycles to update aws_elasticache_replication_group security groups:

Some problems with the previous version:

@Nuru Nuru requested review from a team as code owners June 27, 2021 04:00
@Nuru
Copy link
Contributor Author

Nuru commented Jun 27, 2021

/test all

@aknysh
Copy link
Member

aknysh commented Jun 27, 2021

I like associated_security_group_ids and allowed_security_group_ids names, we needed this consistency for a long time

variables.tf Outdated Show resolved Hide resolved
variables.tf Outdated Show resolved Hide resolved
variables.tf Outdated Show resolved Hide resolved
main.tf Outdated Show resolved Hide resolved
Copy link
Member

@aknysh aknysh left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good, please see a few comments

examples/complete/main.tf Outdated Show resolved Hide resolved
@osterman osterman changed the title Overhaul Overhaul Module for Consistency Jun 28, 2021
osterman
osterman previously approved these changes Jun 28, 2021
@Nuru
Copy link
Contributor Author

Nuru commented Jun 28, 2021

/test all

@nitrocode
Copy link
Member

/test all


That is why the `rules` object has the structure it has.

</summary>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
</summary>
</details>

README.yaml Outdated
Comment on lines 283 to 284
vpc_id = module.vpc.vpc_id

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

vpc_id already set below. Looks like the other examples keep it below the rules so perhaps remove this one to keep consistency across the examples.

Suggested change
vpc_id = module.vpc.vpc_id

README.yaml Outdated Show resolved Hide resolved
main.tf Show resolved Hide resolved
@Nuru
Copy link
Contributor Author

Nuru commented Aug 28, 2021

/test all

@Nuru Nuru removed do not merge Do not merge this PR, doing so would cause problems wip Work in Progress: Not ready for final review or merge labels Aug 28, 2021
Copy link
Member

@aknysh aknysh left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please see comments

Co-authored-by: Andriy Knysh <aknysh@users.noreply.github.com>
@Nuru
Copy link
Contributor Author

Nuru commented Aug 29, 2021

/test all

@Nuru Nuru requested a review from aknysh August 29, 2021 22:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants