OpenTelemetry TraceIdRatioBased sampler requirements following OTEP 235 #4166

jmacd · 2024-07-29T22:36:23Z

Changes

Updates Trace SDK and TraceState handling specifications with OTEP 235 sampling thresholds. This PR depends on #4162 to introduce the concept of Trace Randomness. This PR is the second part of two, it focuses on thresholds.

Revise TraceIdRatioBased algorithm section. The existing TODO implies this is not a breaking change.
Change text about TraceIdRatioBased construction
Move text about TraceIdRatioBased description (leave unmodified).

The content of OTEP 235 was revised for clarity by @kalyanaj in open-telemetry/oteps#261. I've heavily copied from the final text in that still-unmerged OTEP. I introduced new content explaining how to compute thresholds from probabilities with use of variable precision, referring to the OTel Collector-Contrib pkg/sampling reference implementation. The new (Golang) demonstration code is validated here, https://go.dev/play/p/7eLM6FkuoA5.

A proof of concept for this specification along with #4162 can be found in open-telemetry/opentelemetry-go#5645.

Part of #3602.

Product of the Sampling SIG members @kentquirk @kalyanaj @oertl @PeterF778 and myself.

…ng OTEP 235.

specification/trace/tracestate-probability-sampling.md

jmacd · 2024-07-30T15:37:03Z

Feedback from the OTel Spec SIG meeting discussion cc/ @jsuereth:

Please add a migration guide to explain how transitioning samplers will work; in particular, it's not safe to begin using non-root independent sampling until TraceIdRatioBased samplers are replaced everywhere in a trace. Until then, only safe to continue using ParentBased sampling w/ root TraceIdRatioBased decision.

Update: 68fa270

github-actions · 2024-08-07T03:17:34Z

This PR was marked stale due to lack of activity. It will be closed in 7 days.

…ication into jmacd/otep235

specification/trace/tracestate-handling.md

…ication into jmacd/otep235

This reduces the number of lines of diff in PR 4166, which replaces the entire `tracestate-probability-sampling.md` file with new contents. Part of #4166. ## Changes Move a file, place a link to it and explain that a change is in progress.

jmacd · 2024-08-15T14:51:43Z

@kalyanaj @PeterF778 @oertl @kentquirk Please take another look at this PR, especially the file tracestate-probability-sampling.md which now reads as a new file, not as a major rewrite. The contents are derived from open-telemetry/oteps#261.

jmacd · 2024-08-15T14:52:54Z

@open-telemetry/specs-trace-approvers @open-telemetry/specs-approvers @open-telemetry/technical-committee this PR has reached consensus in the Sampling SIG, we have multiple prototypes implemented, and we are looking for final approvals.

specification/trace/sdk.md

specification/trace/tracestate-handling.md

specification/trace/tracestate-probability-sampling.md

Co-authored-by: Tobias Bachert <git@b-privat.de>

…ication into jmacd/otep235

specification/trace/sdk.md

Co-authored-by: Trask Stalnaker <trask.stalnaker@gmail.com>

…ication into jmacd/otep235

…rv sub-key and th sub-key

jmacd · 2024-10-21T21:42:50Z

@PeterF778 @oertl @kentquirk @kalyanaj @yuanyuanzhao3, please take another look.

92876f9
1855839
44c8190

github-actions · 2024-10-29T03:18:51Z

This PR was marked stale due to lack of activity. It will be closed in 7 days.

…ication into jmacd/otep235

jmacd · 2024-10-30T22:19:30Z

~~I've made a small but substantial change based on feedback from @yuanyuanzhao3, see e6dc409~~.

I will place this PR in draft while the Sampling SIG discusses at least one more detail raised by @yuanyuanzhao3, which is (in my words) a concern about the ambiguity between th:0 and unset th. In earlier debates and discussions over this feature, we have discussed a "zero adjusted count" value for the threshold, but we removed this feature when we transitioned from "acceptance threshold" to "rejection threshold". In the rejection threshold formulation, we can't represent zero adjusted count, this will be discussed in tomorrow's Sampling SIG.

PeterF778 · 2024-10-31T04:21:49Z

specification/trace/sdk.md


 #### AlwaysOn

 * Returns `RECORD_AND_SAMPLE` always.
 * Description MUST be `AlwaysOnSampler`.
+* If the incoming TraceState has a valid OpenTelemetry TraceState `th` sub-key, the the returned TraceState is unmodified.


That part would be correct if the AlwaysOnSampler decided to sample by looking at the parent sampled flag. But it is not supposed to. It is expected to always sample, also when the parent was not sampled. Thus th:0 is always correct.

Agreed--
As discussed in the SIG today, we are missing a specification for a ConsistentParentBased sampler. We should probably not use the AlwaysOn sampler in this case.

Assertion: Consistent probability samplers should not inspect the sampled flag.

ConsistentParentBased: when it is invoked in child context, it simply copies the Th value and returns the sampled flag as its decision. If there is a sampled flag and no th value: leave th unset and respect the sampled flag. This is an error case.

I will revert commit e6dc409

Then, I will make a new PR to specify the ConsistentParentBased sampler.

The issue is that span metrics in this service would be skewed.

No, if service B uses AlwaysOnSampler, all of its spans will get sampled (regardless of the sampling decisions for service A upstream). This might mean incomplete traces, but the metrics for service B will be correct with th:0.

…metry#4168) This reduces the number of lines of diff in PR 4166, which replaces the entire `tracestate-probability-sampling.md` file with new contents. Part of open-telemetry#4166. ## Changes Move a file, place a link to it and explain that a change is in progress.

This reverts commit e6dc409.

jsuereth · 2024-11-05T13:04:10Z

specification/trace/sdk.md

+* For root spans, always sample a new context.
+* For child spans, take the decision of the parent context.
+
+By using the ParentBased sampler by default, users can change sampling


Nit: this should be an "aside" or non-normative callout. I'm not sure we have precedence here, but could you move this to some different markdown structure, perhaps > quote ?

jsuereth · 2024-11-05T13:08:06Z

specification/trace/sdk.md

+
+When a TraceIdRatioBased Sampler makes a decision for a non-root Span
+using TraceID randomness, but the Trace random flag was not set, the
+SDK SHOULD issue a one-time warning statement in its log with a


nit: one-time warning - You may need to include criteria of what consistutes a "time", i.e. is it once per line of code, once per span name...

I think this is a bit too vague to implement consistently.

github-actions · 2024-11-13T03:18:07Z

This PR was marked stale due to lack of activity. It will be closed in 7 days.

…ication into jmacd/otep235

jmacd mentioned this pull request Jul 29, 2024

Prototype for W3C Trace Context Level 2 support in TraceIDRatioBased sampler open-telemetry/opentelemetry-go#5645

Closed

OpenTelemetry trace SDK requirements for probability sampling followi…

0524a3d

…ng OTEP 235.

jmacd force-pushed the jmacd/otep235 branch from eb65467 to 0524a3d Compare July 29, 2024 22:57

jmacd marked this pull request as ready for review July 29, 2024 23:24

jmacd requested review from a team July 29, 2024 23:24

github-actions bot assigned jack-berg Jul 29, 2024

jmacd mentioned this pull request Jul 29, 2024

Update 'rv' value generation based on randomness flag + editorial changes to improve clarity open-telemetry/oteps#261

Open

linebreaks

c5453f8

jmacd mentioned this pull request Jul 30, 2024

Rename the experimental probability sampling specification #4168

Merged

jmacd commented Jul 30, 2024

View reviewed changes

specification/trace/tracestate-probability-sampling.md Show resolved Hide resolved

github-actions bot added the Stale label Aug 7, 2024

jmacd mentioned this pull request Aug 7, 2024

Randomness requirements following W3C Trace Context level 2 #4162

Draft

5 tasks

jmacd added 2 commits August 7, 2024 15:13

Merge branch 'main' of github.com:open-telemetry/opentelemetry-specif…

25a61fd

…ication into jmacd/otep235

Add a migration section

68fa270

PeterF778 reviewed Aug 7, 2024

View reviewed changes

specification/trace/tracestate-handling.md Outdated Show resolved Hide resolved

github-actions bot removed the Stale label Aug 8, 2024

jmacd added 2 commits August 15, 2024 07:18

Merge branch 'main' of github.com:open-telemetry/opentelemetry-specif…

51f9794

…ication into jmacd/otep235

lowercase hex

ba5a47b

jmacd added 3 commits August 15, 2024 07:46

spec-compliance-matrix.md

49673b7

merge w/ removed file

e51bea6

chlog

4afe1c7

kalyanaj reviewed Aug 15, 2024

View reviewed changes

kentquirk approved these changes Aug 20, 2024

View reviewed changes

specification/trace/tracestate-probability-sampling.md Show resolved Hide resolved

jmacd and others added 5 commits October 4, 2024 13:55

more overview

6e29b0e

nuance

77b51f8

Update specification/trace/tracestate-probability-sampling.md

a61fbdd

Co-authored-by: Tobias Bachert <git@b-privat.de>

Merge branch 'main' of github.com:open-telemetry/opentelemetry-specif…

59c329d

…ication into jmacd/otep235

Merge branch 'jmacd/otep235' of github.com:jmacd/opentelemetry-specif…

d21f341

…ication into jmacd/otep235

trask reviewed Oct 11, 2024

View reviewed changes

specification/trace/sdk.md Outdated Show resolved Hide resolved

specification/trace/sdk.md Outdated Show resolved Hide resolved

jmacd and others added 5 commits October 16, 2024 16:51

Apply suggestions from code review

4e05267

Co-authored-by: Trask Stalnaker <trask.stalnaker@gmail.com>

Merge branch 'main' of github.com:open-telemetry/opentelemetry-specif…

d65ea09

…ication into jmacd/otep235

Use consistent terminology with 4162, e.g., OpenTelemetry TraceState …

92876f9

…rv sub-key and th sub-key

Specify a compatibility warning for transition

1855839

asymmetrical

44c8190

TOC

66d190f

github-actions bot added the Stale label Oct 29, 2024

jmacd added 2 commits October 30, 2024 15:13

Merge branch 'main' of github.com:open-telemetry/opentelemetry-specif…

0aacc19

…ication into jmacd/otep235

AlwaysOn should respect sampling threshold

e6dc409

github-actions bot removed the Stale label Oct 31, 2024

PeterF778 reviewed Oct 31, 2024

View reviewed changes

Revert "AlwaysOn should respect sampling threshold"

c75a010

This reverts commit e6dc409.

jmacd marked this pull request as draft November 1, 2024 20:52

jsuereth approved these changes Nov 5, 2024

View reviewed changes

github-actions bot added the Stale label Nov 13, 2024

jmacd removed the Stale label Nov 13, 2024

jmacd added 2 commits November 13, 2024 13:07

Merge branch 'main' of github.com:open-telemetry/opentelemetry-specif…

87fb314

…ication into jmacd/otep235

do not change AlwaysOnSampler spec

f3693fc

jmacd mentioned this pull request Nov 13, 2024

Consistent delegating sampler, alternative to ParentBased #4294

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenTelemetry TraceIdRatioBased sampler requirements following OTEP 235 #4166

OpenTelemetry TraceIdRatioBased sampler requirements following OTEP 235 #4166

jmacd commented Jul 29, 2024 •

edited

Loading

jmacd commented Jul 30, 2024 •

edited

Loading

github-actions bot commented Aug 7, 2024

jmacd commented Aug 15, 2024

jmacd commented Aug 15, 2024

jmacd commented Oct 21, 2024

github-actions bot commented Oct 29, 2024

jmacd commented Oct 30, 2024 •

edited

Loading

PeterF778 Oct 31, 2024

jmacd Oct 31, 2024

yuanyuanzhao3 Oct 31, 2024

PeterF778 Oct 31, 2024

jsuereth Nov 5, 2024

jsuereth Nov 5, 2024

github-actions bot commented Nov 13, 2024

OpenTelemetry TraceIdRatioBased sampler requirements following OTEP 235 #4166

Are you sure you want to change the base?

OpenTelemetry TraceIdRatioBased sampler requirements following OTEP 235 #4166

Conversation

jmacd commented Jul 29, 2024 • edited Loading

Changes

jmacd commented Jul 30, 2024 • edited Loading

github-actions bot commented Aug 7, 2024

jmacd commented Aug 15, 2024

jmacd commented Aug 15, 2024

jmacd commented Oct 21, 2024

github-actions bot commented Oct 29, 2024

jmacd commented Oct 30, 2024 • edited Loading

PeterF778 Oct 31, 2024

Choose a reason for hiding this comment

jmacd Oct 31, 2024

Choose a reason for hiding this comment

yuanyuanzhao3 Oct 31, 2024

Choose a reason for hiding this comment

PeterF778 Oct 31, 2024

Choose a reason for hiding this comment

jsuereth Nov 5, 2024

Choose a reason for hiding this comment

jsuereth Nov 5, 2024

Choose a reason for hiding this comment

github-actions bot commented Nov 13, 2024

jmacd commented Jul 29, 2024 •

edited

Loading

jmacd commented Jul 30, 2024 •

edited

Loading

jmacd commented Oct 30, 2024 •

edited

Loading