Composite propagators: Clarify the behavior of multiple extractor #496

rghetia · 2020-02-28T19:36:46Z

In a composite propagator there can be more than one Extractor for trace context. The specification doe not indicate how to handle extraction from two different propagators if both of them extracts valid trace context. In current implementation in Go the second extractor overwrites the first. So the last one wins. There are two problems with this.

The Last trace context is invalid and the first trace context is valid - Even though the first context is valid it will not use it.
Both of them are valid - The last context will be used but that may not be counter intuitive to the user. Also it involves unnecessary processing. Extracting is not cheap.

dyladan · 2020-02-28T21:03:32Z

IMO propagators should be validating extracted contexts to ensure they are valid before setting them on the context. I could see this being important for use cases where the propagation format spec changes however. For instance, if Trace Context releases a version 2, you may want to have a v1 propagator as a fallback if the v2 propagator fails or vice versa.

Flarna · 2020-03-02T07:23:46Z

Maybe we should pass any extracted SpanContext down to the next propagators in chain to allow them to decide if they should extract and overwrite,...?

Oberon00 · 2020-03-02T15:47:48Z

I think this already happens implicitly because each propagator gets as input the context returned by the previous one. However, there is no way to know for the propagator whether any value left there was set by a another composite propagator component or is the value from the parent context, which should be overwritten.

I think you should only add orthogonal components to an aggregate propagator. A new special ChainedPropagator API is probably required to coordinate multiple propagators competing for the same Context key.

carlosalberto · 2020-03-02T17:31:58Z

So in theory there should only be a propagator for each cross-cutting concern (tracing, correlation context), so they work on only one 'slot' of the Context.

What you mention has been known as 'fallback' propagators, as a stack-alike Propagator that contains different propagators for the same concern. An example would be a fallback Propagator that contains both a TraceContext and a B3 propagator, acting as a single propagator.

In such case, it would work like:

extract(ctx, carrier)
  for propagator in contained_propagators
    new_ctx = propagator.extract(ctx, carrier)
    if tracing_api.get_span_context(new_ctx) is not null
      return new_ctx
...

tracing_propagator = create_fallback_propagator(trace_context_propagator, b3propagator)

I'm wondering if we should add a section on this case (probably yes).

Flarna · 2020-03-02T17:42:37Z

As setting of propagators is possible via API which may have a lot independent clients it's maybe hard to ensure that there are no duplicates. e.g. HttpTextPropagator seem to be reused in grpc (and maybe other protocols having text header support) so each OTel plugin may add one instance of such a processor.

rghetia · 2020-03-02T18:13:06Z

A composite extractor with a policy would work to satisfy all use cases. For example,

type ExtractPolicy int

const {
    StopOnValid
    ExtractAll
}

type struct CompositeExtractorWithPolicy {
    extractors []*extractor
    policy ExtractPolicy
    extractValdiator func(context.Context) bool
}
func (cewp *CompositeExtractorPolicy) Extract(ctx context.Context, supplier HTTPSupplier) (context.Context) {
    for extractor := range cewp.extractors {
       newCtx := extractor.Extract(ctx, supplier)
       valid := cewp.extractValidator(newCtx)
       if cewp.policy == StopOnValid {
          if valid {
            return newCtx
          }
       }
    }
}

Flarna · 2020-03-02T18:17:30Z

@rghetia But wouldn't this result in a propagator for one concern may stops extraction of another, unrelated concern?

rghetia · 2020-03-02T18:18:29Z

@rghetia But wouldn't this result in a propagator for one concern may stops extraction of another, unrelated concern?

you just supply two sets for propagator with policy, one for each concern.

pavolloffay · 2020-05-26T12:21:15Z

We know that only a single context/key can be extracted. The precedence order should be configurable by the user of the system/SDK. Then if the extraction fails for any reason the SDK could "try again" with the next extractor.

carlosalberto · 2020-05-26T13:17:35Z

I'm doing related work around this, so will take it from here.

carlosalberto · 2020-06-17T16:37:14Z

Please see open-telemetry/opentelemetry-java#1339 as a prototype for this.

A new StackPropagator is added (name to be refined) as a custom propagator that takes a list of Trace propagators and, upon extraction, stops when the first successful operation happens, else trying with the next propagator.

Two questions arise:

Do we want this as a general purpose Propagator? I see us only needing this for the many existing tracing formats, but if we need to have a general purpose one, we should provide a policy/function (as the one Rahul mentions) to let the propagator know when to stop iterating over the propagators.
To simplify things, upon failed extraction the propagators won't set an invalid/nil/empty value in Context - this saves us a few allocations happening under failed extraction, but also may allow 'leaking' of lingering, old active Span instances - but this is an error in itself anyway.

Thoughts? cc @dyladan @pavolloffay @Oberon00

carlosalberto · 2020-07-02T00:08:42Z

Now that #671 is merged, wondering whether we should close this issue, or should we add such MultiPropagator to the Specification? @Oberon00 @dyladan

Oberon00 · 2020-07-07T07:47:52Z

I think #671 solved the issue I pointed out in my comment #496 (comment). If we think a MultiPropagator is something that is generally useful, I think we should add it to the spec. Trying to specify it in detail might surface more issues.

carlosalberto · 2020-07-10T14:47:32Z

@Oberon00 For the record: I think specifing a MultiPropagator for tracing should be the way to start, given that this multiple-format support seems to be a tracing-only issue (for now). Any opinion on that?

Oberon00 · 2020-07-10T15:58:31Z

I'm not sure how the concrete concern plays a role when defining a multi-propagator? I would think of something like

class MultiHttpTextPropagator implements HttpTextPropagator {
  public MultiHttpTextPropagator(Collection<HttpTextPropagator> extractors, Collection<HttpTextPropagator> injectors) { ... };

  @Override
  public <Carrier> void extract(Context target, Carrier c, Getter<Carrier> g) {...} // Call all extractors (unconditionally)
  @Override
  public <Carrier> void inject(Context target, Carrier c, Setter<Carrier> s) {...} // Call all injectors (unconditionally)
}

If we want to not try all extractors everytime we'd just need to pass some Predicate<Context> (or function from old context + new context -> boolean) to the constructor that tells us whether to continue.

andrewhsu · 2020-09-03T14:52:54Z

Having a look at this P1 issue with @carlosalberto, should this really be a P2 because it looks like an addition to the API, not a breaking change that should go in earlier? Thinks java is the only one that has this atm.

Oberon00 · 2020-09-03T14:57:32Z

See my comment above #496 (comment):

If we think a MultiPropagator is something that is generally useful, I think we should add it to the spec. Trying to specify it in detail might surface more issues.

Those "more issues" might require breaking changes to fix them.

andrewhsu · 2020-09-09T16:00:42Z

From the issue triage mtg today, talked with @carlosalberto and looks like with the PR #671 this issue can now be bumped down to P2 which means it is not necessarily a blocking change for freezing trace api.

jkwatson · 2020-09-14T17:06:07Z

I'm confused if a MultiPropagator is different than the Composite Propagator already in the spec. Is this an additional thing, or a clarification of the existing one?

Oberon00 · 2020-09-14T17:10:58Z

I think it's about multiple propagators for the same concern. E.g. B3+W3C trace propagators at once.

jkwatson · 2020-09-14T17:35:23Z

Ah, so Composite is multiple-concerns and Multi is multiple for a single concern. Got it.

aabmass · 2021-05-13T17:23:13Z

Looking at the Java TraceMultiPropagator, I think there is still an issue.

Upon extraction, this propagator invokes HttpTextFormat#extract() for every registered
trace propagator, returning immediately when a successful extraction happened.

Not all span context propagators encode the full W3C TraceContext, for example Jaeger format doesn't encode tracestate. If the Jaeger propagator extracted first and you immediately return, the tracestate won't be propagated on.

Also, what if the trace flags are not matching e.g. a load balancer before your service decides to set the sampled flag, but only sets it on the Jaeger header. Then if the Jaeger propagator is configured in TraceMultiPropagator to run after the W3C propagator, the sampled flag is lost.

Maybe the TraceMultiPropagator should merge all the resulting span contexts if they have the same trace ID + span ID?

jkwatson · 2021-05-13T17:29:18Z

Looking at the Java TraceMultiPropagator, I think there is still an issue.

Upon extraction, this propagator invokes HttpTextFormat#extract() for every registered
trace propagator, returning immediately when a successful extraction happened.

Not all span context propagators encode the full W3C TraceContext, for example Jaeger format doesn't encode tracestate. If the Jaeger propagator extracted first and you immediately return, the tracestate won't be propagated on.

Also, what if the trace flags are not matching e.g. a load balancer before your service decides to set the sampled flag, but only sets it on the Jaeger header. Then if the Jaeger propagator is configured in TraceMultiPropagator to run after the W3C propagator, the sampled flag is lost.

Maybe the TraceMultiPropagator should merge all the resulting span contexts if they have the same trace ID + span ID?

TraceMultiPropagator doesn't exist in the 1.x run of Otel java. You want to look at this: https://github.com/open-telemetry/opentelemetry-java/blob/main/context/src/main/java/io/opentelemetry/context/propagation/MultiTextMapPropagator.java#L57-L67

aabmass · 2021-05-13T17:53:16Z

That just looks like the regular Composite Propagator mentioned in the spec. That has the issue of overwriting still, right?

jkwatson · 2021-05-13T18:17:19Z

That just looks like the regular Composite Propagator mentioned in the spec. That has the issue of overwriting still, right?

Yes. I just wanted to make sure that the old thing you pointed at wasn't a relevant part of the conversation, since it was deleted before 1.0.0 was released.

austinlparker · 2024-04-23T20:43:15Z

If this issue is still relevant, please open a new issue with more detail.

mayurkale22 mentioned this issue Feb 28, 2020

feat: add composite propagator open-telemetry/opentelemetry-js#821

Merged

rghetia mentioned this issue Feb 28, 2020

ExtractHTTP should stop after extracting first valid trace context open-telemetry/opentelemetry-go#501

Closed

This was referenced May 21, 2020

PROPAGATORS=b3,tracecontext has confusing behavior open-telemetry/opentelemetry-java-instrumentation#425

Closed

Document span context extraction precedence in multi format open-telemetry/opentelemetry-java#1273

Closed

carlosalberto self-assigned this May 26, 2020

carlosalberto mentioned this issue Jun 17, 2020

MultiTracePropagator implementation open-telemetry/opentelemetry-java#1339

Merged

carlosalberto mentioned this issue Jun 24, 2020

Do not write a value in Context upon failed extraction. #671

Merged

carlosalberto added area:api Cross language API specification issue spec:context Related to the specification/context directory labels Jun 30, 2020

carlosalberto added the release:required-for-ga Must be resolved before GA release, or nice to have before GA label Jul 2, 2020

Oberon00 mentioned this issue Jul 7, 2020

Configuring propagators by environment variable #680

Closed

carlosalberto added the priority:p1 Highest priority level label Jul 24, 2020

andrewhsu added priority:p2 Medium priority level and removed priority:p1 Highest priority level labels Sep 9, 2020

Oberon00 mentioned this issue Sep 24, 2020

Document precedence order is multiple context propagators are used open-telemetry/opentelemetry-java#1698

Merged

andrewhsu added release:after-ga Not required before GA release, and not going to work on before GA and removed priority:p2 Medium priority level release:required-for-ga Must be resolved before GA release, or nice to have before GA labels Sep 25, 2020

iNikem mentioned this issue Apr 6, 2021

Allow multiple context propagators open-telemetry/opentelemetry-dotnet-instrumentation#104

Merged

aabmass mentioned this issue May 13, 2021

Propagator return original context if failed to extract GoogleCloudPlatform/opentelemetry-operations-python#139

Merged

austinlparker added the triage:rejected:declined label Apr 23, 2024

austinlparker unassigned carlosalberto Apr 23, 2024

austinlparker closed this as completed Apr 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Composite propagators: Clarify the behavior of multiple extractor #496

Composite propagators: Clarify the behavior of multiple extractor #496

rghetia commented Feb 28, 2020 •

edited

Loading

dyladan commented Feb 28, 2020

Flarna commented Mar 2, 2020

Oberon00 commented Mar 2, 2020

carlosalberto commented Mar 2, 2020

Flarna commented Mar 2, 2020

rghetia commented Mar 2, 2020

Flarna commented Mar 2, 2020

rghetia commented Mar 2, 2020

pavolloffay commented May 26, 2020

carlosalberto commented May 26, 2020

carlosalberto commented Jun 17, 2020

carlosalberto commented Jul 2, 2020

Oberon00 commented Jul 7, 2020

carlosalberto commented Jul 10, 2020

Oberon00 commented Jul 10, 2020 •

edited

Loading

andrewhsu commented Sep 3, 2020

Oberon00 commented Sep 3, 2020

andrewhsu commented Sep 9, 2020

jkwatson commented Sep 14, 2020

Oberon00 commented Sep 14, 2020

jkwatson commented Sep 14, 2020

aabmass commented May 13, 2021

jkwatson commented May 13, 2021

aabmass commented May 13, 2021

jkwatson commented May 13, 2021

austinlparker commented Apr 23, 2024

Composite propagators: Clarify the behavior of multiple extractor #496

Composite propagators: Clarify the behavior of multiple extractor #496

Comments

rghetia commented Feb 28, 2020 • edited Loading

dyladan commented Feb 28, 2020

Flarna commented Mar 2, 2020

Oberon00 commented Mar 2, 2020

carlosalberto commented Mar 2, 2020

Flarna commented Mar 2, 2020

rghetia commented Mar 2, 2020

Flarna commented Mar 2, 2020

rghetia commented Mar 2, 2020

pavolloffay commented May 26, 2020

carlosalberto commented May 26, 2020

carlosalberto commented Jun 17, 2020

carlosalberto commented Jul 2, 2020

Oberon00 commented Jul 7, 2020

carlosalberto commented Jul 10, 2020

Oberon00 commented Jul 10, 2020 • edited Loading

andrewhsu commented Sep 3, 2020

Oberon00 commented Sep 3, 2020

andrewhsu commented Sep 9, 2020

jkwatson commented Sep 14, 2020

Oberon00 commented Sep 14, 2020

jkwatson commented Sep 14, 2020

aabmass commented May 13, 2021

jkwatson commented May 13, 2021

aabmass commented May 13, 2021

jkwatson commented May 13, 2021

austinlparker commented Apr 23, 2024

rghetia commented Feb 28, 2020 •

edited

Loading

Oberon00 commented Jul 10, 2020 •

edited

Loading