Skip to content

Latest commit

 

History

History
302 lines (231 loc) · 18.3 KB

README.md

File metadata and controls

302 lines (231 loc) · 18.3 KB

acm-demo-journey

Basic walkthrough of Anthos Config Management capabilities (Config Sync & Policy Controller) as well as integration with Config Connector's KRM model for GCP resources.

This walkthrough assumes you have Anthos Config Management already installed (the operator, specifically) on your GKE cluster(s). If you haven't already done this, follow these instructions. If, on the other hand, you have no idea what Anthos Config Management is and you landed here by mistake, take a moment to familiarize yourself with ACM via an overview here.

This isn't meant to be a deep dive in to all the various bells and whistles that ACM provides - we're not going to touch things like Binary Authorization nor the Heirarchy Controller - but this should be enough to start building some muscle memory on how to simplify multi-cluster management and defining guardrails for configuration within those clusters.

Prerequisites

As mentioned in the prior section, we're going to assume you have the config-management-operator deployed to your GKE cluster as decscribed here.

For the configuration of ACM beyond that, the following section includes a sample configManagement spec YAML that will get you up and running with minimal modification.

You'll also need both envsubst and Google Cloud Buildpacks installed, as they will be used in part 3 to generate some YAML and a container image, respectively.

01-config-sync

Let's actually deploy the config sync and policy controller bits first. There is a sample config-management.yaml in 01-config-sync/sample-config-management. Tweak the cluster name if you wish, and apply it:

$ kubectl apply -f 01-config-sync/sample-config-management/config-management.yaml
configmanagement.configmanagement.gke.io/config-management created

A few things will happen after you apply that YAML. The pods for Config Sync and Policy Controller will get deployed, and with Config Sync deployed, your cluster will now start watching the policyDir defined in 01-config-sync/sample-config-management/config-management.yaml. If Config Sync is working properly, you'll have 3 additional namespaces (foo, inherited-demo-01, and inherited-demo-01):

$ kubectl get ns
configmanagement.configmanagement.gke.io/config-management created
NAME                       STATUS   AGE
foo                        Active   7m46s
********  OTHER NAMESPACE STUFF  ********
inherited-demo-01          Active   7m46s
inherited-demo-02          Active   7m46s

Verify that Config Sync is constantly reconciling the configuration of your cluster against the desired state in the policyDir by deleting the foo namespace, and verifying that it gets re-created automatically in near-realtime:

$ kubectl delete ns foo
namespace "foo" deleted

$ kubectl get ns foo
NAME    STATUS   AGE
foo     Active   28s

Last thing we're going to verify is that namespace inheritence is working. Namespace inheritence is described here. For our example, 01-config-sync/namespaces/inherited-example is an "abstract namespace directory" that contains inherited-configmap.yaml, a sample configMap. Namespace inheritence allows us to define that namespace spec once, and the two namespaces defined (inherited-demo-01 and inherited-demo-02) will both "inherit" the configMap.

$ kubectl get cm/inherited-configmap -oyaml -n inherited-demo-01
apiVersion: v1
data:
  database: bar-db
  database_uri: bar-db://1.1.1.1:1234
kind: ConfigMap

$ kubectl get cm/inherited-configmap -oyaml -n inherited-demo-02
apiVersion: v1
data:
  database: bar-db
  database_uri: bar-db://1.1.1.1:1234
kind: ConfigMap

02-policy-controller-basic

Let's start with a simple example of leveraging Policy Controller to establish some guardrails for Kubernetes resources. In this section, you'll leverage the existing constraint template library to get a simple policy deployed.

If you want to get a print-out of what constraint templates you have at your disposal, run the following command:

$ kubectl get constrainttemplates
NAME                                      AGE
allowedserviceportname                    17h
destinationruletlsenabled                 17h
disallowedauthzprefix                     17h
*** trucated for brevity                  ***
k8spspselinuxv2                           17h
k8spspvolumetypes                         17h
k8srequiredlabels                         17h
k8srestrictnamespaces                     17h
k8srestrictrolebindings                   17h
policystrictonly                          17h
sourcenotallauthz                         17h

Let's use the k8srequiredlabels constraint to enforce a labeling / tagging taxonomy. The following command will print out the details of the constraint template, including the policy language (written in Rego):

$ kubectl describe constrainttemplates k8srequiredlabels

Now we can create a constraint the leverages the k8srequiredlabels template. Take a look at the constraint spec:

$ cat 02-policy-controller-basic/pod-buzz-label-constraint.yaml 
apiVersion: constraints.gatekeeper.sh/v1beta1
kind: K8sRequiredLabels
metadata:
  name: pod-must-have-buzz-label
spec:
  match:
    kinds:
      - apiGroups: [""]
        kinds: ["Pod"]
    namespaces:
      - foo
  parameters:
    labels:
      - key: "buzz"
  #enforcementAction: dryrun # dryrun == audit mode

Notice the kind field references the constraint template. Also, within the spec, the constraint is scoped specifically to pods in the foo namespace, and is requiring that the pod has a label with a key of "buzz" applied to it. Lastly, notice that the enforcementAction field is commented out; this means that the policy will be enforced by the admission controller and creation of a pod without the "buzz" label in the foo namespace will be blocked. Uncommenting this field will switch to audit mode instead.

Apply the constraint:

$ kubectl apply -f 02-policy-controller-basic/pod-buzz-label-constraint.yaml 
k8srequiredlabels.constraints.gatekeeper.sh/pod-must-have-buzz-label created

Now try to deploy a pod that has the buzz label commented out in its spec (see line 8):

$ kubectl apply -f 02-policy-controller-basic/fizz-pod.yaml 
Error from server ([denied by pod-must-have-buzz-label] you must provide labels: {"buzz"}): error when creating "02-policy-controller-basic/fizz-pod.yaml": admission webhook "validation.gatekeeper.sh" denied the request: [denied by pod-must-have-buzz-label] you must provide labels: {"buzz"}

You have a few options for remediation. You can modify the pod spec YAML and uncomment the "buzz" label (line 8 of 02-policy-controller-basic/fizz-pod.yaml) or you can switch the constraint's enforcementAction to audit mode (by uncommenting line 15 of 02-policy-controller-basic/pod-buzz-label-constraint.yaml), so a non-compliant pod can be created, but will be logged for non-compliance. If you choose to do the latter, run the following, and you should see the offending pod has been logged:

$ kubectl describe K8sRequiredLabels/pod-must-have-buzz-label

03-config-connector

In this section, you're going to use the Kubernetes Resource Model to deploy both the application elements common to Kubernetes (pods, service, etc) and some GCP resources outside of the GKE cluster that the application will interact with. This walkthrough has been tested using the Config Connector GKE Add-on installation method, although there are alternative approaches.

Make sure you've completed the Config Connector setup (service account setup, policy binding, and applying the ConfigConector YAML), and that you've enabled the Resource Manager API:

$ gcloud services enable cloudresourcemanager.googleapis.com

The demo app that you're going to deploy is very simple. It's composed of:

  • Cloud Pub/Sub topic
  • Express.js app that runs in K8s pods that serves up two paths. The root (/) path serves up an index.html doc you'll access in a browser, and a second path /gcp_check/ that, when accessed, will attempt to publish a message to the Pub/Sub topic. Embedded in the index.html document is a script running on an interval to GET the /gcp_check/ path, and will report back on whether a message was published successfully or not
  • Different identity resources (Google sevice account, IAM policies, etc) and Kubernetes (Kubernetes service account, workload identity) resources to grant proper permissions to the demo app

For this walkthrough, the K8s resources for both the app elements and the GCP infrastructure will live in the same K8s namespace called config-connector-demo-app. The YAML used to create that namespace, along with several other YAML manifests require references to your project ID. To simplify things, we've included some YAML "templates" in 03-config-connector/k8s-templates that we'll use the envsubst command (by way of make) to generate valid K8s YAML from:

$ export project_id=$(gcloud config get-value project) # or whatever project you want to use
$ make
Generating valid YAML for 03-Config-Connector referencing the project_id environment variable
envsubst < 03-config-connector/k8s-templates/service-account.yaml.template > 03-config-connector/k8s/app/service-account.yaml
envsubst < 03-config-connector/k8s-templates/iam-wi-policy.yaml.template > 03-config-connector/k8s/gcp/iam-wi-policy.yaml
envsubst < 03-config-connector/k8s-templates/iam-pubsub-policy.yaml.template > 03-config-connector/k8s/gcp/iam-pubsub-policy.yaml
envsubst < 03-config-connector/k8s-templates/configmap.yaml.template > 03-config-connector/k8s/app/configmap.yaml
envsubst < 03-config-connector/k8s-templates/namespace.yaml.template > 03-config-connector/k8s/app/namespace.yaml

To get a print-out of supported GCP resources on whatever version of Config Connector you're using, run:

$ kubectl get crds --selector cnrm.cloud.google.com/managed-by-kcc=true

The YAML for the demo app and supporting infrastructure are almost ready to go, but first you need to build a container image in your own GCR repo and reference it in the pod spec (03-config-connector/k8s/app/deployment.yaml). The following commands assume you've already installed Google Cloud Buildpacks:

$ export project_id=$(gcloud config get-value project) # or whatever project you want to use

$ pack build -p 03-config-connector/src/ \
--builder gcr.io/buildpacks/builder:v1 \
--publish gcr.io/${project_id}/config-connector-demo-app:v1.0.0
.
.
.
Reusing cache layer 'google.nodejs.runtime:node'
Reusing cache layer 'google.nodejs.npm:npm'
Successfully built image gcr.io/am01-services/config-connector-demo-app:v1.0.0

Once you've built and published your image, update the image reference in 03-config-connector/k8s/app/deployment.yaml to point to the newly built & published image in GCR. If you plan on modifying the source and building your own versions, just tweak the tag suffix (:v1.0.0 in the example above).

Now, let's deploy the app components first (minus the GCP stuff, for now):

$ kubectl apply -f 03-config-connector/k8s/app/namespace.yaml
namespace/config-connector-demo-app created

$ kubectl apply -f 03-config-connector/k8s/app/
configmap/config-connector-demo-app-configmap created
deployment.apps/config-connector-demo-app created
namespace/config-connector-demo-app unchanged
serviceaccount/demo-app-ksa created
service/config-connector-demo-app-svc created

After a few moments, lets capture the IP address of the K8s service that the demo app is exposed on. You'll need a minute or two for the LoadBalancer resource to complete provisioning:

$ echo $(kubectl get svc config-connector-demo-app-svc -n config-connector-demo-app | grep -v EXTERNAL-IP | awk '{ print $4}')
1.2.3.4 # sample

Load up that IP address in a browser (via HTTP, not HTTPS). Give it a few seconds, and you should see something like: ❌ GCP PubSub inaccessible ❌ in your browser window. Take a look at the browser console, and you'll see something like:

Calling /gcp_check path
(index):29 GET http://1.2.3.4/gcp_check 500 (Internal Server Error)
checkBackend @ (index):29
(anonymous) @ (index):48
(index):41 Received error while publishing: 404 undefined: Getting metadata from plugin failed with error: Could not refresh access token: A Not Found error was returned while attempting to retrieve an accesstoken for the Compute Engine built-in service account. This may be because the Compute Engine instance does not have any permission scopes specified: Could not refresh access token: Unsuccessful response status code. Request failed with status code 404

We expected that, because the GCP stuff hasn't been spun up yet. Let's do that now:

$ kubectl apply -f 03-config-connector/k8s/gcp/
iampolicy.iam.cnrm.cloud.google.com/demo-app-iam-pubsub-policy created
iamserviceaccount.iam.cnrm.cloud.google.com/demo-app-gsa created
iampolicy.iam.cnrm.cloud.google.com/demo-app-iam-wi-policy created
pubsubtopic.pubsub.cnrm.cloud.google.com/demo-app-topic created

Give it a few minutes, but momentarily you should start seeing ✅ GCP PubSub accessible ✅ in your browser window. Additionally, you should see something like this in your browser console:

(index):25 Calling /gcp_check path
(index):37 Message 1924293278679133 published.
(index):25 Calling /gcp_check path
(index):37 Message 1924291607076576 published.
(index):25 Calling /gcp_check path
(index):37 Message 1924298495207169 published.

So your app deployment is eventually consistent - you can deploy this stuff in any order, and it eventually converges into a functional (albeit super basic) app. Pretty neat, huh?

One more thing - the demo app is basically firing off Pub/Sub messages to nobody. If you want to see things end-to-end, you can create a subscription to your topic via the gcloud CLI, although I'll leave that as an excercise for the reader.

04-policy-controller-config-connector

Let's wrap up with a combined exercise where you'll write a Policy Controller constraint that can be applied to a Config Connector-managed GCP resource (specifically, a Pub/Sub subscription resource that references the Pub/Sub topic from the prior section).

Start by provisioning the subscription resource:

$ kubectl apply -f 04-policy-controller-config-connector/k8s/pubsub-subscription.yaml 
pubsubsubscription.pubsub.cnrm.cloud.google.com/demo-app-subscription created

04-policy-controller-config-connector/k8s/pubsub-sub-message-retention-constraint-template.yaml defines a constraint template that checks to see if a Pub/Sub subscription resource's spec defines a messageRetentionDuration that matches some value defined in a constraint. The Rego is pretty basic:

package gcppubsubmessageretention

violation[{"msg": msg}] {
    input.review.kind.kind == "PubSubSubscription"
    re_match("^(extensions|pubsub.cnrm.cloud.google.com)$", input.review.kind.group)
    input.review.object.spec.messageRetentionDuration != input.parameters.permittedMessageRetention
    msg := sprintf("The messageRetentionDuration must equal %v; it currently equals %v", [input.parameters.permittedMessageRetention, input.review.object.spec.messageRetentionDuration])
}

Apply the constraint template:

$ kubectl apply -f 04-policy-controller-config-connector/k8s/pubsub-sub-message-retention-constraint-template.yaml 
constrainttemplate.templates.gatekeeper.sh/gcppubsubmessageretention created

Now that the constraint template has been applied, create a constraint, which specifies the permitted messageRetentionDuration (in this example, 1 hour or 3600 seconds) and runs in dryrun mode:

$ kubectl apply -f 04-policy-controller-config-connector/k8s/pubsub-sub-message-retention-constraint.yaml 
gcppubsubmessageretention.constraints.gatekeeper.sh/pubsub-msg-retention-1-hour created

After a few moments, check the constraint to see if it has flagged our subscription (which it should, as the message retention duration of the subscription is not 1 hour):

$ kubectl describe gcppubsubmessageretention.constraints.gatekeeper.sh/pubsub-msg-retention-1-hour
.
.
.
Total Violations:  1
Violations:
Enforcement Action:  dryrun
Kind:                PubSubSubscription
Message:             The messageRetentionDuration must equal 3600s; it current equals 86400s
Name:                demo-app-subscription
Namespace:           config-connector-demo-app

That's it! You've completed this walkthrough, and hopefully learned a few things about ACM & Config Connector along the way. If you have feedback or questions, feel free to contact me via an issue. You have The Power, and if you've been sitting down for too long, you should Move Your Body and get some exercise :)

TODO

  • add screen shots
  • add architecture schematic for demo app
  • break up monolith README.md into per-section README.mds