New method for providing configurable self-hosted LB/DNS/VIP for on-prem #524

yboaron · 2020-11-02T14:54:26Z

In current implementation, the self-hosted DNS/LB/VIP stack runs under the MCO umbrella by default
in all on-prem platforms and there's no option to configure it.

Some customers have their own external Load Balancing and DNS resolution,besides, there are cases in which some traffic used to provide these self-hosted services is disallowed in the customer's network.
The outcome of such scenarios is that cluster's resources are spent on providing unused services.

This PR suggests a new configurable method for providing the self-hosted stack.

jcpowermac · 2020-11-04T13:27:55Z

cc: @patrickdillon

sjenning · 2020-11-10T14:53:14Z

What a small world! I stumbled on this today right before I planned to write a script to remove all the KNI stuff from the MCO configs, since I have external DNS/LB available. I'm a 👍 on this!

mandre

Great to see some work on giving the cluster admin more control over the LB/DNS stack. This is a highly desired feature for deployments on OpenStack.

enhancements/network/on-prem-configurable-lb-dns-stack.md

cybertron

Some thoughts inline.

As I mentioned in one of my comments, I'm still a little unclear about the plan for supporting external LBs. What I had in mind is not quite what I'm hearing from other people, so it might be good if we could have a sync meeting when I'm back from PTO.

enhancements/network/on-prem-configurable-lb-dns-stack.md

squeed · 2020-12-14T11:07:33Z

/cc @Miciah

enhancements/network/on-prem-configurable-lb-dns-stack.md

sttts · 2021-01-08T10:12:15Z

enhancements/network/on-prem-configurable-lb-dns-stack.md

+
+### Open Questions [optional]
+
+- Is `cluster-hosted-net-services-operator` an acceptable name? 


cluster-self-hosted-load-balancers-operator ?

sttts · 2021-01-08T10:14:17Z

enhancements/network/on-prem-configurable-lb-dns-stack.md

+In order to run the CHNSO components, Kubelet in each node should first join the cluster by communicating with the control plane, however the components provided by the CHNSO are responsible for enabling that.
+As a result, the below circular dependency should be addressed:
+* Kubelet can't talk to the control plane until the CHNSO has started.
+* CHNSO can't start until Kubelet can talk to the control plane.


this also applies to kube-controller-manager and kube-scheduler. They both use the API VIP to communicate with the API. In other words: without API VIP no deployments, no replicasets, no pods.

enhancements/network/on-prem-configurable-lb-dns-stack.md

mfojtik · 2021-02-15T12:31:59Z

enhancements/network/on-prem-configurable-lb-dns-stack.md

+
+### Open Questions [optional]
+
+- Is `cluster-hosted-net-services-operator` an acceptable name?


s/net/network ?

I know there was some objection to the "cluster-hosted" part of the name previously. What if we replaced that with "internal"? I think the main thing we're trying to communicate here is that on other platforms these are external services provided by the cloud, but for on-prem they need to hosted internally.

I believe the "services" part was also considered redundant, so would something like "internal-network-operator" be a better option? It's shorter, at least. :-) The one major objection I could see to that is keepalived is providing an externally available VIP, and the internal name might be confusing in that context.

mfojtik · 2021-02-15T12:42:59Z

@yboaron what I'm missing in enhancement is:

What are the degraded conditions this new operator will eventually report?
What kind of healthchecks the operator will perform to assure the system is working as expected?

enhancements/network/on-prem-configurable-lb-dns-stack.md

sttts · 2021-02-22T09:42:11Z

enhancements/network/on-prem-configurable-lb-dns-stack.md

+
+### Suggested design
+
+To support early clustering requirements Keepalived will continue running as static pods through MCO, additionally, a new dnsmasq service (also deployed through MCO) will run in each node while HAProxy, CoreDNS-MDNS and MDNS-publisher will run as static pods through the new operator.


This contradicts line 64. Who will run keepalived on masters?

Keepalived should run by the MCO and according to the description from line64 services on masters will be managed by both MCO and the new operator.
It seems OK to me.
Am I missing something?

sttts · 2021-02-22T09:43:13Z

enhancements/network/on-prem-configurable-lb-dns-stack.md

+#### Self hosted API Loadbalancer
+
+- The current self hosted LB implementation (based on Keepalived and HAProxy) doesn't support graceful switchover, which means connections will break upon shutdown of node holds the VIP.
+- The self hosted API loadbalancer will run similarly to the current mode.


why only similarly? Why not equally?

Will change that

sttts · 2021-02-22T09:45:40Z

enhancements/network/on-prem-configurable-lb-dns-stack.md

+  `Progressing=False` when the `Infrastructure` resource is a platform
+  type other than openstack, baremetal, ovirt or vsphere.
+- Update `ClusterOperator` DEGRADED field in accordance with the following healthchecks (in case self-hosted stack is enabled):
+  - single node holds the VIP


can there be multiple holding the VIP ? How would you know?

Ohh, my fault, I'll update this line

sttts · 2021-02-22T09:51:49Z

enhancements/network/on-prem-configurable-lb-dns-stack.md

+Below is a possible option for a CRD instance of this operator:
+
+```yaml
+apiVersion: clusterHostedNetServices.operator.io/v1alpha1


operator.openshift.io/v1

sttts · 2021-02-22T09:52:11Z

enhancements/network/on-prem-configurable-lb-dns-stack.md

+spec:
+  dns: 
+    nodesResolution: Enabled
+    appsResolution: Enabled


what are apps? Do you mean services?

It's the .apps wildcard DNS record

sttts · 2021-02-22T09:55:30Z

enhancements/network/on-prem-configurable-lb-dns-stack.md

+
+  - resolve node names and .apps wildcard record
+  - resolve api-int to 192.168.111.5.
+  - Run the self-hosted Loadbalancer for api only if apiintIpAddress is equal to API-VIP value provides in install-config file.


what is that API-VIP value during cluster runtime?

The VIP address provided in the install-config.yaml file

sttts · 2021-02-22T10:01:42Z

enhancements/network/on-prem-configurable-lb-dns-stack.md

+
+In order to migrate the self-hosted LB to an external Load balancer the admin should:
+
+- Provide new IP address (!= API-VIP from install-config) pointing to external Load balancer front end.


is there no way to use a DNS name only?

I don't think it's possible since there are kubeconfig files (e.g: /var/lib/kubelet/kubeconfig) pointing to https://api-int.ostest.test.metalkube.org:6443 server

sttts · 2021-02-22T10:03:08Z

Removing hold now that processes keep running as static pods.

/hold cancel

…r on-prem In current implementation, the self-hosted DNS/LB/VIP stack runs under the MCO umbrella by default in all on-prem platforms and there's no option to configure it. Some customers usually have their own external Load Balancing and DNS resolution. Besides, there are cases in which some traffic used to provide these self-hosted services is disallowed in the customer's network. The outcome of such scenarios is that cluster's resources are spent on providing unused services. This PR suggests a new configurable method for providing the self-hosted stack.

sttts · 2021-03-04T09:25:21Z

As this seems to have stalled: I have removed my hold as the technical concerns about DaemonSet are gone after targetting static pods again. Hence, this is mainly a concern of MCO team now and general platform strategy.

And speaking about my view of general platform strategy: this enhancement allows customers to use their own LB even on these on-prem platforms. I think this is the wrong direction. We should rather become or stay opinionated where we can. I don't see value for customers to have their own internal API LB. Instead this increases support costs of the product.

Note: I cannot comment on the need for the DNS part of this enhancement.

As this is about general direction, I would like to hear architects' opinion: @smarterclayton @derekwaynecarr

yboaron · 2021-03-04T10:35:49Z

As this seems to have stalled: I have removed my hold as the technical concerns about DaemonSet are gone after targetting static pods again. Hence, this is mainly a concern of MCO team now and general platform strategy.

And speaking about my view of general platform strategy: this enhancement allows customers to use their own LB even on these on-prem platforms. I think this is the wrong direction. We should rather become or stay opinionated where we can. I don't see value for customers to have their own internal API LB. Instead this increases support costs of the product.

Note: I cannot comment on the need for the DNS part of this enhancement.

As this is about general direction, I would like to hear architects' opinion: @smarterclayton @derekwaynecarr

Sorry for the late response (It took time to clarify the detailed requirements in the context of internal API/Ingress traffic with the PMs),

After further discussions with PM it was recently decided that till we'll have detailed requirements from customers regarding internal api/ingress traffic it would be possible for the external API/Ingress traffic to run through external LB , similar to what shiftstack doing [0] while the internal API/Ingress will continue running through self-hosted stack.

As per on-prem infra DNS components (coredns-mdns and mdns-publisher), there's already ongoing work [1], [2] to replace them by dnsmaq (similar to what described in [3] enhancement )

So, I assume I should close this PR

[0] https://docs.openshift.com/container-platform/4.7/networking/load-balancing-openstack.html#nw-osp-configuring-external-load-balancer_load-balancing-openstack
[1] openshift/machine-config-operator#2450
[2] openshift/machine-config-operator#2374
[3] #654

yboaron · 2021-03-04T15:48:41Z

/close

openshift-ci-robot · 2021-03-04T15:48:53Z

@yboaron: Closed this PR.

In response to this:

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

openshift-ci-robot requested review from jwforres and stbenjam November 2, 2020 14:54

yboaron changed the title ~~New method for providing configurable LB/DNS/VIP self-hosted for on-prem~~ New method for providing configurable self-hosted LB/DNS/VIP for on-prem Nov 4, 2020

mandre reviewed Nov 12, 2020

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Outdated Show resolved Hide resolved

enhancements/network/on-prem-configurable-lb-dns-stack.md Show resolved Hide resolved

enhancements/network/on-prem-configurable-lb-dns-stack.md Outdated Show resolved Hide resolved

hardys reviewed Nov 16, 2020

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Outdated Show resolved Hide resolved

hardys reviewed Nov 16, 2020

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Show resolved Hide resolved

hardys reviewed Nov 16, 2020

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Outdated Show resolved Hide resolved

mhrivnak reviewed Nov 17, 2020

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Outdated Show resolved Hide resolved

enhancements/network/on-prem-configurable-lb-dns-stack.md Outdated Show resolved Hide resolved

cybertron mentioned this pull request Nov 17, 2020

Move api-int record from coredns to /etc/hosts openshift/machine-config-operator#2236

Closed

cybertron reviewed Nov 18, 2020

View reviewed changes

yboaron requested review from mandre, hardys, cybertron and mhrivnak November 22, 2020 21:09

cybertron reviewed Dec 4, 2020

View reviewed changes

yboaron requested a review from cybertron December 13, 2020 10:29

openshift-ci-robot requested a review from Miciah December 14, 2020 11:07

mfojtik reviewed Jan 6, 2021

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Outdated Show resolved Hide resolved

mfojtik reviewed Jan 6, 2021

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Show resolved Hide resolved

sttts reviewed Jan 8, 2021

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Outdated Show resolved Hide resolved

sttts reviewed Jan 8, 2021

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Show resolved Hide resolved

sttts reviewed Jan 8, 2021

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Show resolved Hide resolved

sttts reviewed Jan 8, 2021

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Outdated Show resolved Hide resolved

sttts reviewed Jan 8, 2021

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Outdated Show resolved Hide resolved

sttts reviewed Jan 8, 2021

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Outdated Show resolved Hide resolved

sttts reviewed Feb 15, 2021

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Show resolved Hide resolved

sttts reviewed Feb 15, 2021

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Outdated Show resolved Hide resolved

sttts reviewed Feb 15, 2021

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Outdated Show resolved Hide resolved

sttts reviewed Feb 15, 2021

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Outdated Show resolved Hide resolved

sttts reviewed Feb 15, 2021

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Outdated Show resolved Hide resolved

sttts reviewed Feb 15, 2021

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Outdated Show resolved Hide resolved

sttts reviewed Feb 15, 2021

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Show resolved Hide resolved

sttts reviewed Feb 15, 2021

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Show resolved Hide resolved

mfojtik reviewed Feb 15, 2021

View reviewed changes

enhancements/network/on-prem-configurable-lb-dns-stack.md Outdated Show resolved Hide resolved

mfojtik reviewed Feb 15, 2021

View reviewed changes

soltysh reviewed Feb 15, 2021

View reviewed changes

yboaron requested review from sttts, soltysh and mfojtik February 17, 2021 13:36

sttts reviewed Feb 22, 2021

View reviewed changes

openshift-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Feb 22, 2021

openshift-ci-robot closed this Mar 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New method for providing configurable self-hosted LB/DNS/VIP for on-prem #524

New method for providing configurable self-hosted LB/DNS/VIP for on-prem #524

yboaron commented Nov 2, 2020

jcpowermac commented Nov 4, 2020

sjenning commented Nov 10, 2020

mandre left a comment

cybertron left a comment

squeed commented Dec 14, 2020

sttts Jan 8, 2021

sttts Jan 8, 2021

mfojtik Feb 15, 2021

cybertron Feb 18, 2021

mfojtik commented Feb 15, 2021

sttts Feb 22, 2021

yboaron Feb 22, 2021

sttts Feb 22, 2021

yboaron Feb 22, 2021

sttts Feb 22, 2021

yboaron Feb 22, 2021

sttts Feb 22, 2021

yboaron Feb 22, 2021

sttts Feb 22, 2021

yboaron Feb 22, 2021

sttts Feb 22, 2021

yboaron Feb 22, 2021

sttts Feb 22, 2021

yboaron Feb 22, 2021

sttts commented Feb 22, 2021

sttts commented Mar 4, 2021

yboaron commented Mar 4, 2021

yboaron commented Mar 4, 2021

openshift-ci-robot commented Mar 4, 2021


		### Open Questions [optional]

		- Is `cluster-hosted-net-services-operator` an acceptable name?


		### Suggested design

		To support early clustering requirements Keepalived will continue running as static pods through MCO, additionally, a new dnsmasq service (also deployed through MCO) will run in each node while HAProxy, CoreDNS-MDNS and MDNS-publisher will run as static pods through the new operator.


		In order to migrate the self-hosted LB to an external Load balancer the admin should:

		- Provide new IP address (!= API-VIP from install-config) pointing to external Load balancer front end.

New method for providing configurable self-hosted LB/DNS/VIP for on-prem #524

New method for providing configurable self-hosted LB/DNS/VIP for on-prem #524

Conversation

yboaron commented Nov 2, 2020

jcpowermac commented Nov 4, 2020

sjenning commented Nov 10, 2020

mandre left a comment

Choose a reason for hiding this comment

cybertron left a comment

Choose a reason for hiding this comment

squeed commented Dec 14, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mfojtik commented Feb 15, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sttts commented Feb 22, 2021

sttts commented Mar 4, 2021

yboaron commented Mar 4, 2021

yboaron commented Mar 4, 2021

openshift-ci-robot commented Mar 4, 2021