Cherry-pick #26056 to 7.x: Add k8s cluster identifier #26346

ChrsMark · 2021-06-16T14:09:17Z

Cherry-pick of PR #26056 to 7.x branch. Original message:

What does this PR do?

This PR add cluster identifier fields (defined in ECS) as part of k8s metadata in:

event's enrichment with autodiscovery
event's enrichment in kubernetes module (where already happens)
event's enrichment in add_kubernetes_metadata processor

Note: [MetaGenerators' refactoring ] The identifiers are stored under orchestrator.cluster.url/name and because of this the metadata generators are refactored a little bit so as to cover the addition of such fields that are out of kubernetes.* namespace. The change is transparent and kubernetes.* metadata are still reported in the same way. The refactoring is about making it easier to handle in the future ECS fields populated by k8s metadata generators. The logic is covered in interfaces' docs.

The transparency of the refactoring is ensured by Event's testing in tests below:

The fields are populated following the flow bellow:

Try to get this info from kube_config if provided
Else (when inCluster mode) try to get the info from kubeadm-config configMap (if available). Only for clusters setup with kubeadm.
Else try to get the info from cloud’s meta api (only on GKE)
Else these fields will not be populated (see [Metricbeat] Add cluster identifier to Kubernetes metadata #17467 (comment))

Why is it important?

To add cluster identifier ECS fields as part of k8s metadata.

Checklist

My code follows the style guidelines of this project
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have made corresponding change to the default configuration files
I have added tests that prove my fix is effective or that my feature works
I have added an entry in CHANGELOG.next.asciidoc or CHANGELOG-developer.next.asciidoc.

Author's Checklist

update docs in metadata generators
update cluster roles
manual testing

How to test this PR locally

A. Verify that events from state_* metricsets are enriched properly

Enable kubernetes module with the following datasets:

- module: kubernetes
  metricsets:
    - state_node
    - state_deployment
    - state_pod
    - state_container
    - state_service
  period: 10s
  hosts: ["0.0.0.0:8081"]
  add_metadata: true
  kube_config: /Users/chrismark/.kube/config

Note: In the example above I run kube-state-metrics on local cluster using kind and I expose it to my host machine using kubectl -n kube-system port-forward svc/kube-state-metrics 8081:8080. In this case I need to define add_metadata as true and also provide the proper kube_config so as to reach the k8s API. You can try kubectl config view -o jsonpath='{"Cluster name\tServer\n"}{range .clusters[*]}{.name}{"\t"}{.cluster.server}{"\n"}{end}' to verify the values.
2. Ensure that orchestrator.cluster.name, orchestrator.cluster.name, kubernetes.namespace and kubernetes.node.name are being populated properly.
3. Perform same test while running with inCluster mode, running metricbeat as Pod in the cluster (Note that the k8s cluster should be create with kubeadm since values for cluster info are retrieved from kubeadm-config configmap, you can try kubectl -n kube-system get configmap kubeadm-config -o yaml to verify it)

B. Verify that events from add_kuberentes_metadata are enriched properly

Use updated manifests from https://github.com/elastic/beats/tree/master/deploy/kubernetes
Deploy Filebeat on kubernetes (cluster should be created with kubadm [ie a kind cluster] ) and configure log collection like this:

    filebeat.inputs:
    - type: container
      paths:
        - /var/log/containers/*.log
      processors:
        - add_kubernetes_metadata:
            host: ${NODE_NAME}
            matchers:
            - logs_path:
                logs_path: "/var/log/containers/"

Ensure that orchestrator.cluster.name, orchestrator.cluster.name, kubernetes.namespace and kubernetes.node.name are being populated properly.

C. Verify that events from autodiscover provider are enriched properly

Use updated manifests from https://github.com/elastic/beats/tree/master/deploy/kubernetes
Deploy Filebeat on kubernetes (cluster should be created with kubadm [ie a kind cluster] ) and configure log collection like this:

    filebeat.autodiscover:
      providers:
        - type: kubernetes
          node: ${NODE_NAME}
          hints.enabled: true
          hints.default_config:
            type: container
            paths:
              - /var/log/containers/*${data.kubernetes.container.id}.log

Ensure that orchestrator.cluster.name, orchestrator.cluster.name, kubernetes.namespace and kubernetes.node.name are being populated properly.

D. Perform one of the above scenarios with Metricbeat running as Pod on GKE.

Related issues

Closes [Metricbeat] Add cluster identifier to Kubernetes metadata #17467

(cherry picked from commit 0829211)

elasticmachine · 2021-06-16T14:09:21Z

Pinging @elastic/integrations (Team:Integrations)

elasticmachine · 2021-06-16T15:36:24Z

💚 Build Succeeded

the below badges are clickable and redirect to their specific view in the CI or DOCS

Expand to view the summary

Build stats

Build Cause: Started by user Chris Mark
Start Time: 2021-06-17T07:38:24.996+0000
Duration: 143 min 51 sec
Commit: 93afa6f

Test stats 🧪

Test	Results
Failed	0
Passed	46932
Skipped	5090
Total	52022

Trends 🧪

💚 Flaky test report

Tests succeeded.

Expand to view the summary

Test stats 🧪

Test	Results
Failed	0
Passed	46932
Skipped	5090
Total	52022

JackJudge01 · 2023-09-29T14:25:50Z

has this been forgotten ?

Add k8s cluster identifiers (elastic#26056)

93afa6f

(cherry picked from commit 0829211)

ChrsMark added [zube]: In Review backport Team:Integrations Label for the Integrations team labels Jun 16, 2021

botelastic bot added the needs_team Indicates that the issue/PR needs a Team:* label label Jun 16, 2021

botelastic bot removed the needs_team Indicates that the issue/PR needs a Team:* label label Jun 16, 2021

ChrsMark merged commit d8d0551 into elastic:7.x Jun 17, 2021

zube bot added [zube]: Done and removed [zube]: In Review labels Jun 17, 2021

zube bot removed the [zube]: Done label Sep 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cherry-pick #26056 to 7.x: Add k8s cluster identifier #26346

Cherry-pick #26056 to 7.x: Add k8s cluster identifier #26346

ChrsMark commented Jun 16, 2021 •

edited by zube bot

Loading

elasticmachine commented Jun 16, 2021

elasticmachine commented Jun 16, 2021 •

edited by jenkins-beats-ci bot

Loading

Build stats

Test stats 🧪

Trends 🧪

Test stats 🧪

JackJudge01 commented Sep 29, 2023

Cherry-pick #26056 to 7.x: Add k8s cluster identifier #26346

Cherry-pick #26056 to 7.x: Add k8s cluster identifier #26346

Conversation

ChrsMark commented Jun 16, 2021 • edited by zube bot Loading

What does this PR do?

Why is it important?

Checklist

Author's Checklist

How to test this PR locally

A. Verify that events from state_* metricsets are enriched properly

B. Verify that events from add_kuberentes_metadata are enriched properly

C. Verify that events from autodiscover provider are enriched properly

D. Perform one of the above scenarios with Metricbeat running as Pod on GKE.

Related issues

elasticmachine commented Jun 16, 2021

elasticmachine commented Jun 16, 2021 • edited by jenkins-beats-ci bot Loading

💚 Build Succeeded

Build stats

Test stats 🧪

Trends 🧪

💚 Flaky test report

Test stats 🧪

JackJudge01 commented Sep 29, 2023

ChrsMark commented Jun 16, 2021 •

edited by zube bot

Loading

elasticmachine commented Jun 16, 2021 •

edited by jenkins-beats-ci bot

Loading