Add enhancement proposal for result forwarding #69

rhmdnd · 2022-07-15T21:29:22Z

This commit writes down the proposal for implementing a results
forwarding mechanism in the compliance operator.

Co-Authored-By: Juan Antonio Osorio juan.osoriorobles@eu.equinix.com

enhancements/results-forwarding.md

rhmdnd · 2022-07-25T18:59:55Z

/retest

enhancements/results-forwarding.md

jhrozek · 2022-08-02T13:40:38Z

enhancements/results-forwarding.md

+Required: True
+Type: string
+
+Reference to a server certificate/keypair secret for mutual TLS.


confused as per why do we need the server cert? I thought the aggregator was a client only?
Or is this secret meant for the GRPC service?

I thought this was for the gRPC server, but I'll let @mrogers950 and @JAORMX clarify.

enhancements/results-forwarding.md

jhrozek · 2022-08-02T13:49:50Z

enhancements/results-forwarding.md

+
+#### `resultForwarding.provider`
+
+Required: True


Possibly a dumb question, but what does 'Required' mean here? "Does not have a default and must be set"?
Shouldn't grpc to the local crd-creating service be the default?

I was even wondering if we should have a grpc-local provider as the default that would nominally talk over grpc, but not over TCP, but a UNIX socket in some shared volume. This would allow us to get away by not requiring mTLS for the communication betweeen the components..

Possibly a dumb question, but what does 'Required' mean here? "Does not have a default and must be set"?
Shouldn't grpc to the local crd-creating service be the default?

Yeah - I think required means it must be specified by the user.

Maybe we need to go through the various options, and pick the best default for backwards compatibility. A grpc provider makes a lot of sense for the external storage case, but it requires the end user to know where they want to send results (something they don't necessarily care about today) since they go into etcd, right?

So of course the downside of having the grpc service exposed over local sockets only is that we don't test the network forwarding and would have more configurations to test..

With the grpc-over-network approach, how would the default CO deployment look like then? I was imagining we'd have one more deployment with the grpc server, a service and then the grpc client would use http://service as the endpoint (and since we'd know the service name, everything could just use defaults..)

@mrogers950 do you have an opinion?

If it's true that we only need to forward the objects that the aggregator is responsible for (results and remediations, see my comment above), my thought is that our "local" mode would just be the existing CR creation and always on by default, and if resultForwarding.provider is set, there is also an attempt to forward. So resultForwarding.Provider would be optional, and we provide another option resultForwarding.DisableLocal that is false by default, and only takes effect when set to true and also provider == set.

On the topic of the local gRPC traffic, since this is only results, I think that making a local provider and all would be overkill (requires the aggregator to break up into a client/server). For the evidence proposal that type of pattern makes more sense for what we want to accomplish there, since that's where our existing component client+servers are involved.

hmm, I guess your proposal would also be safer (more stable) in the sense that the aggregator wouldn't change that drastically from a self-contained binary to a client/server.

Just FWIW, the reason I proposed the always-forwarder was just that it felt architecturally (even if local) much cleaner where the aggregator would be just a forwarder to a handler, because we're have to implement the forwarding either way.

I'm not opposed to moving towards that if it makes sense for the rest of the forwarding features. We could still build the server implementation but keep it as a test component to run the aggregator client against.

that sounds like a good compromise. We need a testbed anyway and having a test server that produces the CRDs would also make it easier to do e2e testing.

Good idea!

rhmdnd · 2022-08-29T19:46:32Z

enhancements/results-forwarding.md

+to a configured implementation. In this case, the **aggregator** will act as a
+gRPC client. The **resultserver** then becomes a gRPC server, subject to the
+evidence forwarding provider and will be renamed **evidence-persistor** and
+responsible for writing results.


In this case - what does the aggregator currently use to pass results to the resultserver?

Or are we just saying that the aggregator will be a gRPC client. An example server implementation would be resultserver or rhmdnd/compserv?

The former.

Currently, the aggregator parses all the results coming as ConfigMaps from the nodes.

It "aggregates" them together, forming groups from the different machine pools. e.g. one result is applicable to a set of nodes, not just one node. This is done also to detect inconsistencies in pools: "One node deviated from the benchmark configuration, it should be fixed."

Finally, it takes that aggregated result and creates the CRDs based on this.

The idea is to change the last step from the model and turn the aggregator into a gRPC client that always forwards. This way, that code becomes more general and we can then focus on writing server implementations: Be it the default one that creates CRDs, or compserv that centralizes the results.

Thanks for the additional details. I attempted to clarify that in the wording.

JAORMX · 2022-08-30T07:17:52Z

enhancements/results-forwarding.md

+to a configured implementation. In this case, the **aggregator** will act as a
+gRPC client. The **resultserver** then becomes a gRPC server, subject to the
+evidence forwarding provider and will be renamed **evidence-persistor** and
+responsible for writing results.


The former.

Currently, the aggregator parses all the results coming as ConfigMaps from the nodes.

It "aggregates" them together, forming groups from the different machine pools. e.g. one result is applicable to a set of nodes, not just one node. This is done also to detect inconsistencies in pools: "One node deviated from the benchmark configuration, it should be fixed."

Finally, it takes that aggregated result and creates the CRDs based on this.

The idea is to change the last step from the model and turn the aggregator into a gRPC client that always forwards. This way, that code becomes more general and we can then focus on writing server implementations: Be it the default one that creates CRDs, or compserv that centralizes the results.

JAORMX · 2022-08-30T07:20:42Z

enhancements/results-forwarding.md

+
+#### `resultForwarding.provider`
+
+Required: True


I'd say always requiring the gRPC traffic to go through the network would help in keeping things consistent. e.g. you wouldn't need a code-path to handle creating a unix socket and having a separate goroutine in the aggregator to handle this.

JAORMX · 2022-08-30T07:21:09Z

enhancements/results-forwarding.md

+
+#### `resultForwarding.provider`
+
+Required: True


I also agree that we should choose a sane default that's backwards compatible, that is what users expect and would help in migration.

enhancements/results-forwarding.md

rhmdnd · 2022-08-30T15:15:25Z

enhancements/results-forwarding.md

+to a configured implementation. In this case, the **aggregator** will act as a
+gRPC client. The **resultserver** then becomes a gRPC server, subject to the
+evidence forwarding provider and will be renamed **evidence-persistor** and
+responsible for writing results.


Thanks for the additional details. I attempted to clarify that in the wording.

rhmdnd · 2022-08-30T15:17:44Z

enhancements/results-forwarding.md

+
+#### `resultForwarding.provider`
+
+Required: True


It sounds like this should still be required, then?

Should we also have a field for marking a default value?

enhancements/results-forwarding.md

This commit writes down the proposal for implementing a results forwarding mechanism in the compliance operator. Co-Authored-By: Juan Antonio Osorio juan.osoriorobles@eu.equinix.com

jhrozek · 2022-09-02T08:17:23Z

On Thu, Sep 01, 2022 at 12:11:47PM -0700, Lance Bragstad wrote: @rhmdnd commented on this pull request. > +Required: True if using `token` authentication type +Type: string + +#### `resultForwarding.grpc.authentication.token.tokenSecretName` + +Required: False +Type: string + +Reference to the secret containing the token. + +#### `resultForwarding.grpc.extraMetadata.clusterName` + +Required: True +Type: string + +The name of the cluster to use in result payloads. Ok - I'll pull this out of the per-scan configuration then and document it else where. Who would be responsible for setting up this config map?

The way I understood the workflow was: - the admin could set up the config map themselves manually - if we can infer the cluster name, CO should set this config map automatically. This would be the default on OCP. - (unsure about this part) if the CM is not set and the autodetection is not possible, what then? A random string (that would have to be stable across re-runs)? Or just a hard fail?

JAORMX · 2022-09-02T08:24:51Z

(unsure about this part) if the CM is not set and the autodetection is not possible, what then? A random string (that would have to be stable across re-runs)? Or just a hard fail?

I like the idea of having a random string. It's just ideal to have a way to detect a unique deployment.

rhmdnd · 2022-09-06T15:59:48Z

I like the idea of having a random string. It's just ideal to have a way to detect a unique deployment.

Ok, let me summarize to make sure I understand where we are:

The Compliance Operator will check for a ConfigMap within the operator namespace (e.g., openshift-compliance) that is specific to the compliance operator's need (e.g, data.compliance_cluster_id = "b876fd70-b6fb-4c9d-9bb6-b53e87d10ac7"). If this configuration map doesn't exist when Compliance Operator's gRPC forwarding implementation needs it, it will create it using a random string for the compliance_cluster_id.

Administrators can modify this value if they choose to (e.g, data.compliance_cluster_id = "us-east-pci-dss"). The Compliance Operator will only care if the value exists, so that it can use it.

We could recommend the users populate this ConfigMap prior to using scan settings with gRPC forwarding if they want consistency from the beginning of the forwarding history.

Is that correct?

jhrozek

/lgtm
imo we should just continue iterating on the doc as we implement the API

sheriff-rh

Looks great - one small educational question. I like this proposal, seems like it would be very helpful.

sheriff-rh · 2022-09-20T14:04:17Z

enhancements/results-forwarding.md

+Requires a network connection to forward results after each scan. The
+Compliance Operator should validate the endpoint URL and fail early if it is
+malformed. If the Compliance Operator cannot connect to the gRPC endpoint, it
+should retry and issue an alert.


This seems to answer the question posed on line 436, right?

Yeah - I think we could supply a configuration, through documentation, that tells users how to configure the compliance-operator to forward results and how to store them locally.

The safety net in that case would be that the results are saved somewhere in a persistent volume if the gRPC forwarding fails.

openshift-ci · 2022-09-20T14:07:56Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: jhrozek, rhmdnd, sheriff-rh

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [jhrozek,rhmdnd]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

rhmdnd · 2022-09-27T14:09:50Z

/lgtm
imo we should just continue iterating on the doc as we implement the API

I'm on board with updating the document as we implement the API and configuration.

@JAORMX @mrogers950 @Vincent056 thoughts?

xiaojiey · 2023-02-07T06:56:11Z

/label qe-approved

openshift-ci bot requested review from Vincent056 and xiaojiey July 15, 2022 21:29

openshift-ci bot added the approved label Jul 15, 2022

rhmdnd mentioned this pull request Jul 15, 2022

WIP: Add enhancement proposal for result/evidence forwarding #2

Closed

rhmdnd requested review from JAORMX, jhrozek and mrogers950 July 15, 2022 21:31

rhmdnd force-pushed the result-forwarding-enhancement branch from 6b7b55d to 67e4594 Compare July 18, 2022 20:32

rhmdnd commented Jul 18, 2022

View reviewed changes

enhancements/results-forwarding.md Show resolved Hide resolved

enhancements/results-forwarding.md Show resolved Hide resolved

enhancements/results-forwarding.md Outdated Show resolved Hide resolved

rhmdnd commented Jul 29, 2022

View reviewed changes

enhancements/results-forwarding.md Outdated Show resolved Hide resolved

jhrozek reviewed Aug 2, 2022

View reviewed changes

enhancements/results-forwarding.md Show resolved Hide resolved

jhrozek reviewed Aug 2, 2022

View reviewed changes

enhancements/results-forwarding.md Outdated Show resolved Hide resolved

jhrozek reviewed Aug 2, 2022

View reviewed changes

rhmdnd force-pushed the result-forwarding-enhancement branch from 67e4594 to d586a0f Compare August 2, 2022 17:12

rhmdnd commented Aug 29, 2022

View reviewed changes

rhmdnd force-pushed the result-forwarding-enhancement branch from d586a0f to 0e08eac Compare August 29, 2022 19:50

JAORMX requested changes Aug 30, 2022

View reviewed changes

openshift-ci bot assigned JAORMX Aug 30, 2022

rhmdnd commented Aug 30, 2022

View reviewed changes

rhmdnd force-pushed the result-forwarding-enhancement branch from 0e08eac to 6f6c110 Compare August 30, 2022 15:37

mrogers950 reviewed Aug 30, 2022

View reviewed changes

enhancements/results-forwarding.md Outdated Show resolved Hide resolved

mrogers950 reviewed Aug 30, 2022

View reviewed changes

enhancements/results-forwarding.md Show resolved Hide resolved

mrogers950 reviewed Aug 30, 2022

View reviewed changes

enhancements/results-forwarding.md Outdated Show resolved Hide resolved

rhmdnd force-pushed the result-forwarding-enhancement branch from 6f6c110 to b0822ea Compare August 30, 2022 19:11

mrogers950 reviewed Aug 30, 2022

View reviewed changes

enhancements/results-forwarding.md Outdated Show resolved Hide resolved

Add enhancement proposal for result forwarding

06ab0c6

This commit writes down the proposal for implementing a results forwarding mechanism in the compliance operator. Co-Authored-By: Juan Antonio Osorio juan.osoriorobles@eu.equinix.com

rhmdnd force-pushed the result-forwarding-enhancement branch from b0822ea to 06ab0c6 Compare September 1, 2022 19:38

jhrozek approved these changes Sep 16, 2022

View reviewed changes

openshift-ci bot assigned jhrozek Sep 16, 2022

openshift-ci bot added the lgtm label Sep 16, 2022

rhmdnd requested review from sheriff-rh, Wiharris and mkumku September 16, 2022 19:08

Wiharris added the docs-approved label Sep 19, 2022

sheriff-rh approved these changes Sep 20, 2022

View reviewed changes

openshift-ci bot assigned sheriff-rh Sep 20, 2022

sheriff-rh removed their assignment Nov 3, 2022

mkumku added the px-approved label Nov 29, 2022

openshift-ci bot added the qe-approved label Feb 7, 2023

openshift-merge-robot merged commit 0cf1555 into ComplianceAsCode:master Feb 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add enhancement proposal for result forwarding #69

Add enhancement proposal for result forwarding #69

rhmdnd commented Jul 15, 2022

rhmdnd commented Jul 25, 2022

jhrozek Aug 2, 2022

rhmdnd Aug 2, 2022

jhrozek Aug 2, 2022

jhrozek Aug 2, 2022

rhmdnd Aug 2, 2022

jhrozek Aug 4, 2022

jhrozek Aug 4, 2022

mrogers950 Aug 30, 2022

mrogers950 Aug 30, 2022

jhrozek Aug 31, 2022

mrogers950 Aug 31, 2022

jhrozek Sep 1, 2022

rhmdnd Aug 29, 2022

JAORMX Aug 30, 2022

rhmdnd Aug 30, 2022

JAORMX Aug 30, 2022

JAORMX Aug 30, 2022

JAORMX Aug 30, 2022

rhmdnd Aug 30, 2022

rhmdnd Aug 30, 2022

jhrozek commented Sep 2, 2022 via email

JAORMX commented Sep 2, 2022

rhmdnd commented Sep 6, 2022

jhrozek left a comment

sheriff-rh left a comment

sheriff-rh Sep 20, 2022

rhmdnd Sep 20, 2022

openshift-ci bot commented Sep 20, 2022

rhmdnd commented Sep 27, 2022

xiaojiey commented Feb 7, 2023


		#### `resultForwarding.provider`

		Required: True


		#### `resultForwarding.provider`

		Required: True


		#### `resultForwarding.provider`

		Required: True


		#### `resultForwarding.provider`

		Required: True

Add enhancement proposal for result forwarding #69

Add enhancement proposal for result forwarding #69

Conversation

rhmdnd commented Jul 15, 2022

rhmdnd commented Jul 25, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jhrozek commented Sep 2, 2022 via email

JAORMX commented Sep 2, 2022

rhmdnd commented Sep 6, 2022

jhrozek left a comment

Choose a reason for hiding this comment

sheriff-rh left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

openshift-ci bot commented Sep 20, 2022

rhmdnd commented Sep 27, 2022

xiaojiey commented Feb 7, 2023