[processor/transform] Set attribute values using connection context #33288

ptodev · 2024-05-29T15:05:33Z

Component(s)

processor/transform

Is your feature request related to a problem? Please describe.

The attribute and resource processors have a from_context config argument which can be used to set attribute values using connection context:

  # Key specifies the attribute to act upon.
- key: <key>
  action: {insert, update, upsert}
  # FromContext specifies the context value to use to populate the attribute value. 
  # If the key is prefixed with `metadata.`, the values are searched
  # in the receiver's transport protocol additional information like gRPC Metadata or HTTP Headers. 
  # If the key is prefixed with `auth.`, the values are searched
  # in the authentication information set by the server authenticator. 
  # Refer to the server authenticator's documentation part of your pipeline for more information about which attributes are available.
  # If the key doesn't exist, no action is performed.
  # If the key has multiple values the values will be joined with `;` separator.
  from_context: <other key>

It would be nice to be able to do the same thing with the transform processor.

Describe the solution you'd like

The solution should work with all OTTL contexts, so probably we need to add a new OTTL function?

Describe alternatives you've considered

No response

Additional context

This feature will help reduce the need for attributes and resource processors.

Also, #18643 suggests that the transform processor may replace the attributes and resource processors. Having this feature would be a prerequisite for that.

The text was updated successfully, but these errors were encountered:

github-actions · 2024-05-29T15:05:52Z

Pinging code owners:

processor/transform: @TylerHelmuth @kentquirk @bogdandrutu @evan-bradley

See Adding Labels via Comments if you do not have permissions to add labels yourself.

github-actions · 2024-05-29T15:07:16Z

Pinging code owners for pkg/ottl: @TylerHelmuth @kentquirk @bogdandrutu @evan-bradley. See Adding Labels via Comments if you do not have permissions to add labels yourself.

github-actions · 2024-07-29T03:32:10Z

This issue has been inactive for 60 days. It will be closed in 60 days if there is no activity. To ping code owners by adding a component label, see Adding Labels via Comments, or if you are unsure of which component this issue relates to, please ping @open-telemetry/collector-contrib-triagers. If this issue is still relevant, please ping the code owners or leave a comment explaining why it is still relevant. Otherwise, please close it.

Pinging code owners:

processor/transform: @TylerHelmuth @kentquirk @bogdandrutu @evan-bradley
pkg/ottl: @TylerHelmuth @kentquirk @bogdandrutu @evan-bradley

See Adding Labels via Comments if you do not have permissions to add labels yourself.

odubajDT · 2024-08-01T08:37:20Z

Hey @TylerHelmuth I would like to look at this issue

TylerHelmuth · 2024-08-06T22:35:24Z

@odubajDT @evan-bradley I'd like to talk about this more. We don't have syntax or a specific transformcontext to represent how to extract something from the connection context. I'd like to see some proposals on how we'd implement this before making a PR.

evan-bradley · 2024-08-07T13:45:26Z

Could we have that discussion on a draft PR? I would prefer to look at something concrete when considering possible designs. I think the high-level design is fairly straightforward (get a value from the context through a path/converter), and the implementation is what's under consideration.

evan-bradley · 2024-08-07T13:58:48Z

As an initial proposal, we'll want a ctx/context path that is required to be keyed by the path parser and calls (context.Context).Value with the key, and returns the return value (an any-typed value or nil).

As for the transform context, the most conventional way to do this would be to add context.Context as the first parameter to each NewTransformContext function, but this is something we can explore in a concrete implementation. Once the transform context has the context.Context reference, it's just a matter of retrieving it through a path.

If the implementation shows flaws, we can probably pull this off using a converter that relies on an interface implemented by the transform context, but I'd rather save this as a plan B.

TylerHelmuth · 2024-08-07T19:36:04Z

We keep breaking NewTransformContext functions as we need access to more data. Should we add an Option pattern?

evan-bradley · 2024-08-08T14:44:42Z

I think for now it's conceptually simpler for all data to be required. I think we will always have a context.Context object that can be used and can avoid the situation where certain context paths don't work in certain components.

In regards to breaking NewTransformContext, as non-ergonomic as it is from a function signature perspective, if we want everything to be required I'd rather break this at compilation time. The alternative is we implement a config struct or options pattern that we have to validate at runtime for containing the correct data, which would place the burden on users instead of component authors if something is required but not present.

TylerHelmuth · 2024-08-08T14:53:39Z

Ok lets try with just breaking it.

Are there any security concerns here? It seems likely you could do something like merge_maps(attributes, ctx, "upsert"), and expose keys.

evan-bradley · 2024-08-08T14:59:52Z

I dug into it and there's no way to iterate over context.Context without reflection. The path parser will need to require that you key the ctx path when using it. I think that should protect us from situations like the one you described, since if you ask for a key, it's your responsibility to know what's in it.

The one tricky thing is that any complex key/value types will be basically impossible to do without custom OTTL extensions. I'm not sure how we properly address this situation:

ctx := context.WithValue(ctx, StructKey{key: "key"}, CustomStructData{data: contextData})

Keying this would be impossible with OTTL, and even if you use e.g. a string key here instead, using the data is impossible unless you can e.g. convert the type to a map. I think this will just have to be a known caveat.

TylerHelmuth · 2024-08-08T15:13:10Z

Requiring an index should be enforceable by the ottl context path function. I agree we should only support string keys.

evan-bradley · 2024-08-08T15:33:44Z

I agree we should only support string keys.

We don't have to enforce this, (context.Context).Value takes an any type as a key and we can just pass a user-supplied value directly and return whatever Value returns (nil or the value). I was just pointing out that non-primitive typed keys won't work out-of-the-box.

Thinking on it, users could add Converters to provide support for complex key types, e.g. ctx[KeyFunc("key")] where KeyFunc returns whatever type they use to key context values. So I think we should be good to go here.

TylerHelmuth · 2024-08-08T15:36:51Z

In that case, I think we should be extremely strict about NOT adding any KeyFunc converters to pkg/ottlfuncs.

odubajDT · 2024-09-04T11:30:33Z

One additional question here, if I understand the implementation of attributes processor correctly, there is the possibility to extract context data only from the following context keys:

- "metadata."
- "auth."
- "client.address"

What in this case means connection context? Do we want to duplicate the exact same functionality or something else as well? Thanks in advance!

evan-bradley · 2024-09-04T13:10:00Z

What in this case means connection context?

The Collector passes a context.Context object down the pipeline alongside a payload, and usually receivers will include information about the network connection inside of this object.

Do we want to duplicate the exact same functionality or something else as well?

I think the fact that we have the ability to spread operations over multiple statements means that we don't have to special-case this kind of addressing. For example, we could have the following: ctx["auth"]["api-token"] instead of needing to write auth.api-token like it would be in the attributes processor.

github-actions · 2024-11-04T03:36:25Z

This issue has been inactive for 60 days. It will be closed in 60 days if there is no activity. To ping code owners by adding a component label, see Adding Labels via Comments, or if you are unsure of which component this issue relates to, please ping @open-telemetry/collector-contrib-triagers. If this issue is still relevant, please ping the code owners or leave a comment explaining why it is still relevant. Otherwise, please close it.

Pinging code owners:

processor/transform: @TylerHelmuth @kentquirk @bogdandrutu @evan-bradley
pkg/ottl: @TylerHelmuth @kentquirk @bogdandrutu @evan-bradley

See Adding Labels via Comments if you do not have permissions to add labels yourself.

TylerHelmuth · 2024-11-15T19:32:07Z

@djaglowski and I talked a bit about this at kubecon. We think we can get away with including the context.Context in the TransformContext and then adding specific paths to each ottl context to support accessing specific things in the context.Context. Something like request.http to get the map off incoming request headers.

djaglowski · 2024-12-02T18:47:49Z

One additional detail which @TylerHelmuth and I discussed about how OTTL paths could refer to contexts:

As mentioned elsewhere in the thread, context values can be added in several different ways and each way requires a different access pattern in code. I recently added support in the routing connector to route based on either grpc or http request context, so there is an example here where we fetch values from the context using two different mechanisms. At the same time, it shows how it may be useful to automatically check all known mechanisms for the value, since data may be received in various ways but then sent to the same component for processing.

Based on these use cases, the idea we came up with that each known mechanism for accessing values in the context would have its own Getter implementation, but that a top-level mechanism would also be available, which would search through all known mechanisms in some pre-defined order. e.g.

request.http["foo"] # gets value of "foo" from go.opentelemetry.io/collector/client.Metadata
request.grpc["foo"] # gets value of "foo" from google.golang.org/grpc/metadata
request["foo"] # gets value of "foo" from google.golang.org/grpc/metadata, but if not found there, then tries go.opentelemetry.io/collector/client.Metadata

ptodev added enhancement New feature or request needs triage New item requiring triage labels May 29, 2024

github-actions bot added the processor/transform Transform processor label May 29, 2024

TylerHelmuth added pkg/ottl and removed needs triage New item requiring triage labels May 29, 2024

ptodev mentioned this issue May 29, 2024

Tempo: Include OpenTelemetry Resource Processor as an extra pipeline for trace configuration grafana/alloy#454

Open

github-actions bot added the Stale label Jul 29, 2024

evan-bradley removed the Stale label Aug 1, 2024

evan-bradley assigned odubajDT Aug 1, 2024

github-actions bot added the Stale label Nov 4, 2024

evan-bradley removed the Stale label Nov 13, 2024

TylerHelmuth mentioned this issue Nov 25, 2024

[ottl] Demonstrate simpler context constructors #36188

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[processor/transform] Set attribute values using connection context #33288

[processor/transform] Set attribute values using connection context #33288

ptodev commented May 29, 2024 •

edited

Loading

github-actions bot commented May 29, 2024

github-actions bot commented May 29, 2024

github-actions bot commented Jul 29, 2024

odubajDT commented Aug 1, 2024

TylerHelmuth commented Aug 6, 2024

evan-bradley commented Aug 7, 2024 •

edited

Loading

evan-bradley commented Aug 7, 2024

TylerHelmuth commented Aug 7, 2024

evan-bradley commented Aug 8, 2024

TylerHelmuth commented Aug 8, 2024

evan-bradley commented Aug 8, 2024

TylerHelmuth commented Aug 8, 2024

evan-bradley commented Aug 8, 2024

TylerHelmuth commented Aug 8, 2024

odubajDT commented Sep 4, 2024

evan-bradley commented Sep 4, 2024

github-actions bot commented Nov 4, 2024

TylerHelmuth commented Nov 15, 2024 •

edited

Loading

djaglowski commented Dec 2, 2024 •

edited

Loading

[processor/transform] Set attribute values using connection context #33288

[processor/transform] Set attribute values using connection context #33288

Comments

ptodev commented May 29, 2024 • edited Loading

Component(s)

Is your feature request related to a problem? Please describe.

Describe the solution you'd like

Describe alternatives you've considered

Additional context

github-actions bot commented May 29, 2024

github-actions bot commented May 29, 2024

github-actions bot commented Jul 29, 2024

odubajDT commented Aug 1, 2024

TylerHelmuth commented Aug 6, 2024

evan-bradley commented Aug 7, 2024 • edited Loading

evan-bradley commented Aug 7, 2024

TylerHelmuth commented Aug 7, 2024

evan-bradley commented Aug 8, 2024

TylerHelmuth commented Aug 8, 2024

evan-bradley commented Aug 8, 2024

TylerHelmuth commented Aug 8, 2024

evan-bradley commented Aug 8, 2024

TylerHelmuth commented Aug 8, 2024

odubajDT commented Sep 4, 2024

evan-bradley commented Sep 4, 2024

github-actions bot commented Nov 4, 2024

TylerHelmuth commented Nov 15, 2024 • edited Loading

djaglowski commented Dec 2, 2024 • edited Loading

ptodev commented May 29, 2024 •

edited

Loading

evan-bradley commented Aug 7, 2024 •

edited

Loading

TylerHelmuth commented Nov 15, 2024 •

edited

Loading

djaglowski commented Dec 2, 2024 •

edited

Loading