insights: Insights-gateway #447

iNecas · 2020-08-20T12:20:06Z

No description provided.

iNecas · 2020-08-24T10:46:32Z

/assign @mfojtik @smarterclayton
/cc @jhjaggars, @chambridge, @martinkunc

radekvokal · 2020-08-24T10:54:44Z

From business perspective it makes sense to me. To be absolutely clear, the components/operators using the c.r.c Insights gateway will be responsible for payload size and data privacy. Insights gateway won't be anyhow processing the data. Correct @iNecas ?

iNecas · 2020-08-24T11:14:24Z

Correct @radekvokal

openshift-ci-robot · 2020-08-25T14:27:43Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: iNecas
To complete the pull request process, please assign mfojtik
You can assign the PR to them by writing /assign @mfojtik in a comment when ready.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

smarterclayton

The drawbacks are all things I consider significant - I would probably lean towards a library approach (and expected it to be discussed here if there are gaps).

smarterclayton · 2020-08-26T12:37:41Z

enhancements/insights/insights_gateway.md

+
+Given insights-operator is already part of the OCP cluster and has proper credentials
+configured already as part of the installation, we're proposing for it to expose
+a proxy for other components to be able to send the payloads on their behalf.


Hrm - I would probably prefer these components remain decoupled architecturally. Insights is providing support data about the cluster - it really is not a generic operator (generic operators have a high bar).

There must be an extremely strong reason to expose a new generic api - it would probably be a better start to expose a library those operators can use rather than trying to provide a service. The justifications over providing a library would have to be very compelling.

Looping @jhjaggars and especially @chambridge to perhaps fill in some gaps around requirements/justification and more thoughts on the library alternative.

Few things regarding the library approach:

some targeted operators are implemented with ansible (https://github.com/project-koku/korekuta-operator/blob/5fe7938ee87ca08dec6181cc0e37083979def62c/roles/collect/tasks/main.yml#L339): the library doesn't seem to help much with that

with the library approach, the operator would need to have access to secrets in openshift-config namespace: would we want to allow this or minimize the access to that resource? Or would it mean the credentials need to be handled differently. The gateway approach limits the access to the upload only: nothing else exposed.

the library still doesn't allow the single place for overview on what data are flowing to c.r.c.

additional requirements around networking we've already seen and solved in insights-operator (such as service-specific proxy as implemented in openshift/insights-operator@65e2183) would need to be implemented again or left as gaps.

As of extending the operator to be a generic one, we could argue the scope is still narrowed around providing the c.r.c-related services, not to be a single place for all outgoing traffic beyond the c.r.c. ingress.

As mentioned at the bottom of this doc (https://github.com/openshift/enhancements/pull/447/files#diff-74d208115f9ccd14e14cb6d90d466a9eR196) long term, we would like to see the generic solution that even insights-operator could use provided by OCP core.

For the API, would adding v1alpha in the endpoint help with the concerns anyhow?

The goal here is to create a secure channel into cloud.redhat.com for data that doesn't fit telemetry yet provide value to customer thru applications build on c.r.c -- here's a discussion why Cost Management can't meet the budget and cardinality requirements of monitoring team and also why the current dataflow into c.r.c is not optimal -- https://docs.google.com/document/d/1gdxlc37-CMniwptdccob4C3fwrAZNgyOQFDNL5msCsk/edit#heading=h.v7h8j8wxgjo2

Taking a library approach solves a couple of the issues we're wanting to address:

abstract the mechanics of talking to cloudDot (http client configuration and uri stuff)

centralize the proxy configuration

It doesn't address gaining access to credentials necessary for authentication with cloudDot.

Additionally, a library approach grants us the ability to just do it, which I really like.

I'm waiting for more feedback on this proposal from the OLM operators interested into this future.

Given we would go the library approach, where should the proxy configuration live?

Can someone describe how the library would work with an Ansible based operator?
Would this be something we'd have to put into an Ansible collection? Also would like to understand the impact on EXD how the library would be built and made available for a certified operator release? I think the documentation for the library would need some information on how an operator would gather the pull secret config plus any necessary RBAC role/cluster role needs.

But if all that is easy then I'm okay with a library approach as it might have a side effect that it doesn't need all the backporting into z streams.

I assume the library would also take into account the cluster global proxy configuration?

For the ansible-operator, I can think of two ways (based more on how I think ansible-operator works, without much exprience):

explosing the library via executable that would be available in the operator image and used instead of the curl it uses today. You would need to figure out how to add the binary to your image. Disadvantage would be it would need to read configuration on every upload, or deal with caching.

running as a side-car container, exposing the local endpoint to provide the gathering functionality
Regarding the side-car: what do you think about this being the generic regardless it's ansible operator or not @smarterclayton, @jhjaggars? This way, we could also standardize on metrics and would actually feel like k8s way?

What would be the advantages of using the library from the solution we have today - coded by ourselves? It seems that the long term solution of an upload service will better fix the air-gapped use cases, and would simplify credential management and RBAC configuration.

At least two advantages that I can think of:

standardization of how to make the connection

clear instructions for users to follow

The sidecar container approach (basically a service wrapper around the library) should address the ansible operator case. Would it be a big lift to include another image in the existing certification process for each operator?

Another question: could the OLM managed operators create this bindings as part of the deployment process (or ask the administrator to do so?):

apiVersion: rbac.authorization.k8s.io/v1 kind: Role metadata: name: some-olm-operator namespace: openshift-config rules: - apiGroups: - "" resources: - secrets resourceNames: - pull-secret - support verbs: - get --- apiVersion: rbac.authorization.k8s.io/v1 kind: RoleBinding metadata: name: some-olm-operator namespace: openshift-config roleRef: kind: Role name: some-olm-operator subjects: - kind: ServiceAccount name: operator namespace: some-olm-operator

If so, the sidecar could also pull the right token from the pull secret and watch for the changes.

The sidecar is probably my favorite one also as it could standardize on things like tracked metrics.

smarterclayton · 2020-10-08T16:02:53Z

enhancements/insights/insights_gateway.md

+each needing additional data to be sent from the OCP clusters, such as:
+
+- [cost management](https://github.com/project-koku/korekuta-operator)
+- [subscription watch](https://github.com/chambridge/subscription-watch-operator)


We are generally moving towards all this data will be sent through telemetry in the future

smarterclayton · 2020-10-08T16:03:15Z

enhancements/insights/insights_gateway.md

+
+- [cost management](https://github.com/project-koku/korekuta-operator)
+- [subscription watch](https://github.com/chambridge/subscription-watch-operator)
+- [marketplace](https://github.com/redhat-marketplace/redhat-marketplace-operator)


Over time, I expect all of this data to flow through telemetry once we have completed the necessary data interlock.

openshift-bot · 2021-01-06T17:50:15Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh by commenting /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.
Exclude this issue from closing by commenting /lifecycle frozen.

If this issue is safe to close now please do so with /close.

/lifecycle stale

openshift-bot · 2021-02-05T19:43:12Z

Stale issues rot after 30d of inactivity.

Mark the issue as fresh by commenting /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.
Exclude this issue from closing by commenting /lifecycle frozen.

If this issue is safe to close now please do so with /close.

/lifecycle rotten
/remove-lifecycle stale

openshift-bot · 2021-03-07T19:57:04Z

Rotten issues close after 30d of inactivity.

Reopen the issue by commenting /reopen.
Mark the issue as fresh by commenting /remove-lifecycle rotten.
Exclude this issue from closing again by commenting /lifecycle frozen.

/close

openshift-ci-robot · 2021-03-07T19:57:14Z

@openshift-bot: Closed this PR.

In response to this:

Rotten issues close after 30d of inactivity.

Reopen the issue by commenting /reopen.
Mark the issue as fresh by commenting /remove-lifecycle rotten.
Exclude this issue from closing again by commenting /lifecycle frozen.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

insights: Insights-gateway

dea5809

openshift-ci-robot requested review from abhinavdahiya and derekwaynecarr August 20, 2020 12:20

iNecas mentioned this pull request Aug 20, 2020

Insights gateway WIP iNecas/enhancements#1

Closed

openshift-ci-robot assigned mfojtik and smarterclayton Aug 24, 2020

iNecas mentioned this pull request Aug 25, 2020

Insights Up to date gathering martinkunc/enhancements#1

Closed

Add a note about opt-out consequences

355b418

iNecas force-pushed the insights-gateway branch from 096e077 to 355b418 Compare August 25, 2020 14:28

smarterclayton suggested changes Aug 26, 2020

View reviewed changes

smarterclayton reviewed Oct 8, 2020

View reviewed changes

openshift-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 6, 2021

openshift-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Feb 5, 2021

openshift-ci-robot closed this Mar 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

insights: Insights-gateway #447

insights: Insights-gateway #447

iNecas commented Aug 20, 2020

iNecas commented Aug 24, 2020

radekvokal commented Aug 24, 2020

iNecas commented Aug 24, 2020

openshift-ci-robot commented Aug 25, 2020

smarterclayton left a comment

smarterclayton Aug 26, 2020

iNecas Aug 26, 2020

radekvokal Aug 26, 2020

jhjaggars Aug 27, 2020

iNecas Aug 28, 2020

chambridge Aug 31, 2020

iNecas Aug 31, 2020

chargio Aug 31, 2020

jhjaggars Sep 1, 2020

iNecas Sep 1, 2020 •

edited

Loading

smarterclayton Oct 8, 2020

smarterclayton Oct 8, 2020

openshift-bot commented Jan 6, 2021

openshift-bot commented Feb 5, 2021

openshift-bot commented Mar 7, 2021

openshift-ci-robot commented Mar 7, 2021

insights: Insights-gateway #447

insights: Insights-gateway #447

Conversation

iNecas commented Aug 20, 2020

iNecas commented Aug 24, 2020

radekvokal commented Aug 24, 2020

iNecas commented Aug 24, 2020

openshift-ci-robot commented Aug 25, 2020

smarterclayton left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

iNecas Sep 1, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

openshift-bot commented Jan 6, 2021

openshift-bot commented Feb 5, 2021

openshift-bot commented Mar 7, 2021

openshift-ci-robot commented Mar 7, 2021

iNecas Sep 1, 2020 •

edited

Loading