Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[EKS] [request]: allow to configure metricsBindAddress for kube-proxy #657

Closed
chancez opened this issue Dec 17, 2019 · 59 comments
Closed

[EKS] [request]: allow to configure metricsBindAddress for kube-proxy #657

chancez opened this issue Dec 17, 2019 · 59 comments
Labels
EKS Amazon Elastic Kubernetes Service Proposed Community submitted issue

Comments

@chancez
Copy link

chancez commented Dec 17, 2019

Community Note

  • Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
  • Please do not leave "+1" or "me too" comments, they generate extra noise for issue followers and do not help prioritize the request
  • If you are interested in working on this issue or have submitted a pull request, please leave a comment

Tell us about your request
I want to be able to monitor kube-proxy with Prometheus, but cannot because, by default the metricsBindAddress is set to 127.0.0.1:10249 meaning it isn't accessible outside of the pod.

Which service(s) is this request for?
EKS

Tell us about the problem you're trying to solve. What are you trying to do, and why is it hard?
I want to monitor kube-proxy. I cannot do this unless I can reconfigure kube-proxy in some way.

Are you currently working around this issue?
I am not.

Additional context
You can see the kube-proxy config created by EKS in the kube-system namespace via k get cm -n kube-system kube-proxy-config -o yaml.

@chancez chancez added the Proposed Community submitted issue label Dec 17, 2019
@TBeijen
Copy link

TBeijen commented Dec 20, 2019

Additional context illustrating the problem: https://github.com/helm/charts/tree/master/stable/prometheus-operator#kubeproxy

@dawidmalina
Copy link

Quick hotfix:

kubectl --kubeconfig kubeconfig* -n kube-system get cm kube-proxy-config -o yaml |sed 's/metricsBindAddress: 127.0.0.1:10249/metricsBindAddress: 0.0.0.0:10249/' | kubectl --kubeconfig kubeconfig* apply -f -
kubectl --kubeconfig kubeconfig* -n kube-system patch ds kube-proxy -p "{\"spec\":{\"template\":{\"metadata\":{\"labels\":{\"updateTime\":\"`date +'%s'`\"}}}}}"

@alfredkrohmer
Copy link

I cannot do this unless I can reconfigure kube-proxy in some way.

You can see the kube-proxy config created by EKS in the kube-system namespace via k get cm -n kube-system kube-proxy-config -o yaml.

So what exactly prevents you from changing this configmap?

@chancez
Copy link
Author

chancez commented Dec 23, 2019

Nothing, except that it's entirely possible that'll get overwritten at some point in the future. EKS has no documentation on what's supported and unsupported when it comes to this kind of modification, so I'd like to know what kind of guarantees there would be that it won't be stomped on in the future.

@alfredkrohmer
Copy link

alfredkrohmer commented Dec 23, 2019

The EKS cluster upgrade documentation explicitly instructs to manually update kube-proxy, coredns and aws-node after a control plane upgrade. I highly doubt that AWS will touch these components in an automated fashion. We have been using custom configuration for all three (kube-proxy, coredns and aws-node) since we started using EKS and never had a problem with this.

@chancez
Copy link
Author

chancez commented Dec 23, 2019

That's the daemonset, not the configuration. While I agree it's unlikely to be an issue, having previously worked on commercial Kubernetes distributions, making assumptions on what's going to be safe to modify isn't always the best choice. I suppose I can always open a ticket.

@kogeler
Copy link

kogeler commented Feb 19, 2020

I really need this feature!

@Cre-eD
Copy link

Cre-eD commented Feb 19, 2020

@adamrbennett
Copy link

Manually editing configurations is a work around, not a solution. All configs should be coded and recorded in source control. Unless I'm missing something, it appears EKS users have no control over kube-proxy configuration, except for manual edits.

@mikestef9 mikestef9 added the EKS Amazon Elastic Kubernetes Service label Apr 9, 2020
@damscott
Copy link

damscott commented Apr 28, 2020

I think @devkid is correct, but we could use confirmation from the EKS team that automated changes will not be made to the kube-system namespace in a running cluster.

Here is the problem I encountered when setting metricsBindAddress:

When I tried to edit metricsBindAddress I found that I did not have a kube-proxy-config configmap. I have two EKS clusters: one is 565 day old, the other is 325 days old.
The newer cluster has the kube-proxy-config configmap, the older cluster does not. I have never touched the configuration of those resources beyond patching the DaemonSet image tag as documented in Updating an Amazon EKS cluster Kubernetes version.

Here is my proposed solution:

Have a git repo with all of the baseline EKS manifests (including kube-proxy) and document which EKS Platform Version corresponds with each revision

  • This already exists to some degree for the aws-node DaemonSet as part of the CNI patching process
  • It would provide guidance during upgrades while leaving full control in the hands of the user
  • It would help recover from accidental misconfigurations made to core EKS components
  • It would have helped me dealing with the undocumented drift in the kube-proxy manifests

Update:

Copying the DaemonSet and ConfigMaps from the newer cluster to the old one works. It was necessary to change one line in the kube-proxy configmap:

        server: https://<foo>.<bar>.<region>.eks.amazonaws.com

@abangser
Copy link

Just to add, I have a cluster that was last upgraded in late Jan to version 1.14 but initially created in October at 1.10. It does not seem to have the kube-proxy-config configmap. We are also running prometheus-operator and unable to use the kubeProxy exporter without an edit to the metricsBindAddress referenced in the docs here: #657 (comment).

@karlskewes
Copy link

100% agree with @damscott. Good write up.
kube-proxy-config ConfigMap and kube-proxy DaemonSet should be in a versioned manifest like other components (cni, coredns, calico).

There's a second ConfigMap called kube-proxy that is a cluster specific k8s kubeconfig from the looks of it that I guess we wouldn't touch.

@mbarrien
Copy link

Note that in my investigation of the clusters in my org (originally made in a wide range of EKS versions from 1.11 to 1.15), any EKS clusters created from 1.12 onwards use the kube-proxy-config configmap. There is still the issue of migrating any EKS clusters created before 1.12 to using a configmap; this might be possible to do manually.

Any EKS clusters created up to 1.12 also have a slightly different kube-proxy configmap; the configmap 1.12+ contains the cluster URL, while 1.11 and below have the cluster URL (along with oom-scor-0adj and proxy-mode) passed as a command line argument to kubeproxy (within the podspec inside the DaemonSet spec).

If you created your cluster pre-EKS 1.12, to be able pass metricsBindAddress you either need to "upgrade" your daemonset to take in the kube-proxy-config configmap (which you'll need top create manually), or pass it in as a command line argument. Unfortunately this "upgrade" of kube-proxy to the new way is not documented.

@mbarrien
Copy link

@damscott https://github.com/aws/eks-charts/tree/master/stable/aws-vpc-cni has some helm charts for keeping aws-vpc-cni up to date. Would be great if they added similar charts of kube-proxy and coredns.

@TBBle
Copy link

TBBle commented Jun 15, 2020

I haven't verified this, but CoreDNS already has a Helm chart, so it should just be a matter of getting the right parameters for that to match the default-installed version, then any changes you need.

@OffCode
Copy link

OffCode commented Jun 25, 2020

I also have the same issue with most recent EKS version (Kubernetes 1.16) and prometheus-operator stable helm chart. The root cause of the problem is that EKS likely uses an out-dated config value metricsBindAddress.
The Kubernetes documentation describes:

--metrics-bind-address ipport     Default: 127.0.0.1:10249

  | The IP address with port for the metrics server to serve on (set to '0.0.0.0:10249' for all IPv4 interfaces and '[::]:10249' for all IPv6 interfaces). Set empty to disable.

See also
https://kubernetes.io/docs/reference/command-line-tools-reference/kube-proxy/

@boatmisser
Copy link

boatmisser commented Aug 19, 2020

This is still an issue with EKS 1.17.

The kube-proxy-config has 0.0.0.0 for bindAddress and healthzBindAddress.
However it still uses 127.0.0.1 for metricsBindAddress!!!

Would it be much of an issue to move metricsBindAddress to 0.0.0.0 to bring it in line with the other addresses, and allow us to monitor kube-proxy???

@chancez
Copy link
Author

chancez commented Aug 28, 2020

Original creator of the issue here. I thought I'd chime in on how I handle this these days.

At this point I just maintain a kustomize base for kube-proxy that I apply to all clusters immediately after deploying and it's managed via ArgoCD. You could do the same with a helm chart or similar.

At this point I think AWS has made it quite clear, it's unlikely they'll manage components inside the cluster, at least in the short to medium term. If anything, I'd bet that if they were to begin managing "core" components, like kube-proxy, aws-node/cni, etc, that you would opt-in to EKS managing these components for you, and even then, it's likely to only be in new EKS clusters. This is just a guess.

Their documentation provides instructions on managing the versions of various components in the cluster, so it makes me think that it's unlikely there will be any automated management of this stuff for quite some time, so there's little risk to managing the versions, and configuration of these resources yourself.

As others suggested, I think a more likely avenue for supporting this feature is aws/eks-charts#156, #923, and the other issues where you would fully manage kube-proxy yourself.

@aviau
Copy link

aviau commented Jul 2, 2021

At this point I think AWS has made it quite clear, it's unlikely they'll manage components inside the cluster,

Good news! EKS now manages components inside the cluster via EKS cluster addons.

However, they have zero config.

Can we please have addon config that would allow for things like this?

@Oliniusz
Copy link

Oliniusz commented Jul 6, 2021

I've made a dumb move and enabled the kube-proxy addon. It failed with installing due to the modified metricsBindAddress in the configmap. Eh, whatever - I thought - and I've disabled it straight away. That actually completely deleted kube-proxy resources from the cluster...

So to have the cluster back, I've re-enabled the addon back with the config overwriting option checked. That installed kube-proxy back but restored metricsBindAddress back to original 127.0.0.1. After that I've modified the configmap by hand cursing at myself how stupid I was and how unpredictable it was.

I hope AWS will add anytime soon some option to keep the configmap untouched or just will change the default setting of metricsBindAddress to 0.0.0.0 - that's the only thing I would need here.

Otherwise I'm putting kube-proxy manifests into the gitops repository like I should have done already and forget about all those addons.

@aviau
Copy link

aviau commented Jul 6, 2021

That actually completely deleted kube-proxy resources from the cluster...

Same experience here. Deploying an addon without overwrite enable results in breaking everything.

@Pluies
Copy link

Pluies commented Jul 15, 2021

Same boat here - switched to the add-on, lost my monitoring :(

The add-ons are a great step forward to keep these sort of resources up-to-date, so I don't want to revert back to maintaining them manually.

It would be really good to have metricsBindAddress: 0.0.0.0:10249 be the default on the kube-proxy add-on! Or at the very least have this configurable at the add-on level, similar to the way the vpc-cni add-on allows you to pick an IAM role for IRSA.

@pramodpawade
Copy link

pramodpawade commented Jul 27, 2021

I'll second @Pluies here. Recently I tried this on my project and lost my monitoring.

I see a couple of comments saying that AWS EKS Addons doesn't update kube-proxy config on your cluster if you modify them but unfortunately it does.
I tried this by enabling EKS Addons and then modifying the kube-proxy config on my cluster manually to include metricsBindAddress: 0.0.0.0:10249 but after a couple of mins without doing any update to my cluster, I can see the metricsBindAddress: 127.0.0.1:10249.
So it does prove that EKS Addons are updating your cluster config and keeps checking & modifying after periodic intervals.

It would be really good if AWS can make metricsBindAddress: 0.0.0.0:10249 be the default on the kube-proxy add-on or provide some placeholder to modify this on the add-ons level.

Update as of 03/08/2021: I was checking this with the AWS support team and it seems this issue is now in the proposed stage so I think, we might get some solution on this from AWS soon.

@andrew-pickin-epi
Copy link

I have to echo the above. Especially as there is no was to roll back this change in production clusters without downtime and significant remedial action.

I the meantime if AWS are reading this - please remove this addon as the default way to upgrade a cluster, or at least include a warning.

It's possible, as happened with us, that upgrade appears fine on test cluster only to break later on production, and rolling back isn't possible/practical.

@mikestef9
Copy link
Contributor

Correct, kube-proxy on EKS v1.22 clusters (either unmanaged kube-proxy installed with new 1.22 cluster, or managed add-on) now expose metrics by default. If you are upgrading an older cluster that still used the unmanaged add-on, the manifest will not change.

@julianxhokaxhiu
Copy link

What about kube-proxy on older K8s versions on EKS though? Why a one-line change patch is not applied to all versions 1.19 and above?

@ddmunhoz
Copy link

ddmunhoz commented Apr 5, 2022

Thanks @stevehipwell and mikestef9 for the update!

@stevehipwell
Copy link

@mikestef9 I was going to say that the documentation for this was confusing as it called out "Amazon Managed Service for Prometheus" but you've already fixed that so thank you.

@TBBle
Copy link

TBBle commented Apr 5, 2022

If you are upgrading an older cluster that still used the unmanaged add-on, the manifest will not change.

@mikestef9 (and relevant to @julianxhokaxhiu 's question): If you manually update the configmap on an older cluster (and/or an older-but-upgraded-to-1.22 cluster) will it still revert the configmap change back to the localhost bind address?

@stevehipwell
Copy link

@TBBle for unmanaged add-ons unless you know any differently the manifests aren't modified once the cluster has been provisioned. Or are you asking about managed add-ons?

@TBBle
Copy link

TBBle commented Apr 5, 2022

I was referring to #657 (comment), #657 (comment), and #657 (comment). They're talking about managed addons.

Reading #657 (comment) again, perhaps I misparsed it, and the change applies to the kube-proxy managed addon on all clusters, it's only the unmanaged add-ons where the cluster version is relevant because it chooses which config map to go with initially and doesn't touch it again.

@mikestef9
Copy link
Contributor

That's right. If you upgrade to the 1.22 kube proxy managed add-on on an existing cluster upgraded to 1.22, it will contain the change to expose metrics

@zekena2
Copy link

zekena2 commented Jun 1, 2022

What about old versions? I've version 1.21 do I've to upgrade in order to monitor kube-proxy?

@zekena2
Copy link

zekena2 commented Jun 2, 2022

I tried editing the configMap kube-proxy-config and it got reverted to 127.0.0.1 and I tried creating a new configMap with the solution and pointing the daemonSet to the new one and still the daemonSet got reverted.
The docs claim that we can edit the configuration of the add-on starting from v1.18 and later but it seems false in the case of kube-proxy at least for me.
https://docs.aws.amazon.com/eks/latest/userguide/add-ons-configuration.html

@stevehipwell
Copy link

@zekena2 did you check if the field was being managed by EKS? AFAIK you can't use the managed kube-proxy add-on and change this value. You either need to manage it yourself or upgrade to v1.22.

@zekena2
Copy link

zekena2 commented Jun 3, 2022

Yes it seems all the config related stuff are managed by EKS which basically means that nothing is configurable. regardless of the metrics problem I should be able to configure the addon as I want.

@stevehipwell
Copy link

@zekena2 you could install the managed add-on for v1.21 and then disable it before patching. When you upgrade to v1.22 you can just re enable the managed add-on and it should just work.

@zekena2
Copy link

zekena2 commented Jun 3, 2022

Thanks @stevehipwell I got it working that way. I removed the plugin with checking preserve on cluster and then edited the configMap and it preserved. I still think it's not an ideal way to configure the addon.

@samox73
Copy link

samox73 commented Aug 16, 2022

According to https://docs.aws.amazon.com/eks/latest/userguide/kubernetes-versions.html starting from K8s 1.22+ on EKS, the metricsBindAddress should be set to expose metrics outside the pod:

Starting with Amazon EKS version 1.22, kube-proxy is configured by default to expose Prometheus metrics outside the pod. This behavior change addresses the request made in containers roadmap issue

However, in both our clusters (prod is on 1.22 while staging is on 1.23) the kube-proxy-config ConfigMap still reads:

metricsBindAddress: 127.0.0.1:10249

I tried installing the EKS add-on manually but that didn't change anything either.

@TBBle
Copy link

TBBle commented Aug 16, 2022

Were those clusters installed at an earlier version and upgraded? If they were upgraded from a version before the change, then they won't have changed the setting as part of the upgrade. See #657 (comment)

@samox73
Copy link

samox73 commented Aug 17, 2022

Were those clusters installed at an earlier version and upgraded? If they were upgraded from a version before the change, then they won't have changed the setting as part of the upgrade. See #657 (comment)

Yes, they were upgraded from 1.21, however, we have only been using the managed add-on.

@julianxhokaxhiu
Copy link

When I did upgrade my clusters, I removed the extension first and then I re-added it via the AWS EKS Web console, and it worked perfectly. Prometheus is able to fetch metrics from kube-proxy now. If you upgrade the add-on probably it won't touch the config, so try the remove and re-add approach.

@andrew-pickin-epi
Copy link

andrew-pickin-epi commented Nov 9, 2022

I have managed addon, and upgrading to 1.22 the resulting in metricsBindAddress: 0.0.0.0:10249.
However, I used the --force option when upgrading

@samox73
Copy link

samox73 commented Nov 16, 2022

I have managed addon, and upgrading to 1.22 the resulting in metricsBindAddress: 0.0.0.0:10249. However, I used the --force option when upgrading

@andrew-pickin-epi see the reply just above yours.

@andrew-pickin-epi
Copy link

I have managed addon, and upgrading to 1.22 the resulting in metricsBindAddress: 0.0.0.0:10249. However, I used the --force option when upgrading

@andrew-pickin-epi see the reply just above yours.

My point is you don't need to remove and re-apply. Using --force overrides existing setting and is the documneted way to upgrade as per the AWS documentation. https://docs.aws.amazon.com/eks/latest/userguide/managing-add-ons.html

@praveenchandran
Copy link

Quick fix :
Edit the kube-proxy Daemonset and pass an additional parameter to kube-proxy --metrics-bind-address 0.0.0.0:10249.
The pods will restart and the ports will be open.

@rfcku
Copy link

rfcku commented Jul 5, 2024

Quick fix : Edit the kube-proxy Daemonset and pass an additional parameter to kube-proxy --metrics-bind-address 0.0.0.0:10249. The pods will restart and the ports will be open.

Indeed this resolved also my issue, when updating from eks 1.22 to 1.23 the proxy-config still pointed to 127.0.0.1:10249, so after updating the configmap kube-proxy-config to point to 0.0.0.0:10249 and restarting the kube-proxy daemonset the issue was resolved.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
EKS Amazon Elastic Kubernetes Service Proposed Community submitted issue
Projects
None yet
Development

No branches or pull requests