Skip to content

Deploy Istio to the utility cluster #7341

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 3 commits into from
Oct 4, 2024
Merged

Conversation

upodroid
Copy link
Member

@upodroid upodroid commented Sep 25, 2024

This PR does a few things:

We are using Istio in the Utility cluster because:

  1. We need auth, grpc(required for multicluster observability) loadbalancing which I can't easily do with GCE Ingress
  2. We deploy an oauth2 proxy and secure the argocd UI. Its now available for viewing over the internet for specific users.
  3. We'll be deploying Atlantis next for terraform automation and that needs public endpoints with traffic restricted to GitHub webhooks

Fixed a bug in istio/istio#53245

@k8s-ci-robot k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. area/prow Setting up or working with prow in general, prow.k8s.io, prow build clusters sig/k8s-infra Categorizes an issue or PR as relevant to SIG K8s Infra. sig/testing Categorizes an issue or PR as relevant to SIG Testing. approved Indicates a PR has been approved by an approver from all required OWNERS files. labels Sep 25, 2024
@k8s-ci-robot k8s-ci-robot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Sep 25, 2024
@upodroid upodroid added the tide/merge-method-squash Denotes a PR that should be squashed by tide when it merges. label Sep 25, 2024
@upodroid upodroid requested review from dims, BenTheElder and ameukam and removed request for cjwagner and alvaroaleman September 25, 2024 17:06
@BenTheElder
Copy link
Member

We need auth, grpc(required for multicluster observability) loadbalancing which I can't easily do with GCE Ingress

Can you elaborate on the grpc bit for future reference? What exactly are we doing here?

We deploy an oauth2 proxy and secure the argocd UI. Its now available for viewing over the internet for specific users.

kubectl proxy ? /shrug

Somewhat concerned about keeping this patched.
We could use the managed addon?

@ameukam
Copy link
Member

ameukam commented Sep 25, 2024

We'll be deploying Atlantis next for terraform automation and that needs public endpoints with traffic restricted to GitHub webhooks

Did we got consensus on this ? we explored this possibility but I don't think we made a decision to rollout Atlantis or did I miss it ?

@upodroid
Copy link
Member Author

upodroid commented Sep 25, 2024

Can you elaborate on the grpc bit for future reference? What exactly are we doing here?

https://medium.com/@yakuphanbilgic3/thanos-for-multi-cluster-monitoring-in-aws-eks-clusters-using-s3-storage-09c684e05e2b Thanos with multiple clusters. We need a unified grafana observability stack with all the metrics from all the K8s clusters and Managed Prometheus will be insanely expensive.

Somewhat concerned about keeping this patched.

All of the programs involved are go programs and much easier than nginx. Its also one of the reasons why I kept the main prow cluster stack simple with minimal external components. There are alot of powerful programs available that can make our lives easier.

Did we got consensus on this ? we explored this possibility but I don't think we made a decision to rollout Atlantis or did I miss it ?

I'm planning on starting the exploration for that. FYI, atlantis is the Terraform PR automation that we'll be configuring on this repo. https://www.runatlantis.io/

@upodroid upodroid force-pushed the istio branch 3 times, most recently from cd62178 to 3f9b082 Compare October 2, 2024 22:27
@BenTheElder
Copy link
Member

All of the programs involved are go programs and much easier than nginx. Its also one of the reasons why I kept the main prow cluster stack simple with minimal external components. There are alot of powerful programs available that can make our lives easier.

nginx is one binary and a couple libraries with a very stable API though, so I'm not sure that's a winning comparison.

I'm not current on the state of Istio, I last used it a few years ago and it was a pretty large install and non trivial overhead that seemed aimed at use cases we haven't really needed here (like mtls)

I trust that you've thought about these though :-)

I'm planning on starting the exploration for that. FYI, atlantis is the Terraform PR automation that we'll be configuring on this repo. https://www.runatlantis.io/

I think Arnaud is objecting to this phrasing because it sounds like we've made a collective decision but we haven't spent much time on this yet. Versus maybe not necessarily disagreeing that this is something we should do. At least, that's how I read it.

@BenTheElder
Copy link
Member

Also nginx is only sitting in front of some of our pods and despite on breaking changes to worry about we have not done a good job keeping it patched so ... that's actually why I'm a little concerned about deploying this versus using the cluster provided gateway implementation and doing metrics collection with an agent. More to keep updated, but we don't have a lot of ops bandwidth.

@BenTheElder
Copy link
Member

/lgtm
/approve
/hold

@k8s-ci-robot k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Oct 3, 2024
@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Oct 3, 2024
@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: BenTheElder, upodroid

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@upodroid
Copy link
Member Author

upodroid commented Oct 4, 2024

/hold cancel

@k8s-ci-robot k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Oct 4, 2024
@k8s-ci-robot k8s-ci-robot merged commit ad89a41 into kubernetes:main Oct 4, 2024
3 checks passed
@k8s-ci-robot k8s-ci-robot added this to the v1.32 milestone Oct 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. area/prow Setting up or working with prow in general, prow.k8s.io, prow build clusters cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. sig/k8s-infra Categorizes an issue or PR as relevant to SIG K8s Infra. sig/testing Categorizes an issue or PR as relevant to SIG Testing. size/L Denotes a PR that changes 100-499 lines, ignoring generated files. tide/merge-method-squash Denotes a PR that should be squashed by tide when it merges.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants