Need a detailed description of the rate_sampling policy under the tailSampling processor #35419

DraegoG · 2024-09-25T13:52:58Z

Component(s)

processor/tailsampling

Describe the issue you're reporting

While trying to go through the README of the Tail Sampling Processor, I was not clear that what is the use case of the rate_sampling policy and how it works internally.
I found a discussion started here: #1797, mentioning the same problem but does not see a conclusion there.

Can someone please explain the use of the rate_sampling policy(we can update the same in the README as well)?

Also I feel there is a need to update the README as well with explaining the use of the policies in more detail and what are the values supported for other parameters for e.g. in the ottl_condition policy there is a parameter called error_mode but nowhere its explained what are the supported values for that parameter.

github-actions · 2024-09-25T13:57:08Z

Pinging code owners:

processor/tailsampling: @jpkrohling

See Adding Labels via Comments if you do not have permissions to add labels yourself.

github-actions · 2024-11-25T03:37:51Z

This issue has been inactive for 60 days. It will be closed in 60 days if there is no activity. To ping code owners by adding a component label, see Adding Labels via Comments, or if you are unsure of which component this issue relates to, please ping @open-telemetry/collector-contrib-triagers. If this issue is still relevant, please ping the code owners or leave a comment explaining why it is still relevant. Otherwise, please close it.

Pinging code owners:

processor/tailsampling: @jpkrohling

See Adding Labels via Comments if you do not have permissions to add labels yourself.

atoulme · 2024-12-07T00:17:14Z

@jpkrohling please review?

jpkrohling · 2024-12-09T12:16:41Z

rate_limiting serves to limit the number of spans that should be sampled. It takes only the number of spans that were sent at the current time frame (each second at the moment) and drops everything once that limit has been reached. You can use this together with the AND sampler, so that you can define that you want 10k spans/second for a specific service, while retaining all other spans for less noisy services.

    policies: # this acts as an "or": if at least one policy has a SAMPLE decision, we sample
      [
        {
          name: rate-limited-tenant,
          type: and,
          and:
            {
              and_sub_policy:
                [
                  {
                    name: tenant-noisy,
                    type: string_attribute,
                    string_attribute: { key: tenant, values: ["noisy"] },
                  },
                  {
                    name: rate-limit-noisy,
                    type: rate_limiting,
                    rate_limiting: { spans_per_second: 100 },
                  },
                ],
            },
        },
      ]

I'm adding a summary of this to the readme as well.

Fixes open-telemetry#35419 Signed-off-by: Juraci Paixão Kröhling <juraci@kroehling.de>

mugli · 2024-12-12T18:50:54Z

@jpkrohling One question regarding rate_limiting that wasn't clear to me from the doc (or reading the unit tests). Is there any unintended effect from using this policy at top level (without and policy)?

I assume it'll limit the maximum rate of spans from after evaluating all other top level policies. But will it also do the reverse, like take the decision to sample a (potentially uninteresting) trace, because current limit is lower when all the other policies decide not to sample a trace?

…pen-telemetry#36721) Fixes open-telemetry#35419 Signed-off-by: Juraci Paixão Kröhling <juraci@kroehling.de> Signed-off-by: Juraci Paixão Kröhling <juraci@kroehling.de> Co-authored-by: Curtis Robert <crobert@splunk.com>

DraegoG added the needs triage New item requiring triage label Sep 25, 2024

github-actions bot added the processor/tailsampling Tail sampling processor label Sep 25, 2024

DraegoG mentioned this issue Sep 25, 2024

Discuss the possibility of deprecating the tail-based sampling processor #1797

Closed

crobert-1 added the documentation Improvements or additions to documentation label Sep 25, 2024

This was referenced Oct 1, 2024

Weekly Report: 2024-09-24 - 2024-10-01 #35498

Closed

Weekly Report: 2024-10-01 - 2024-10-08 #35659

Closed

This was referenced Oct 15, 2024

Weekly Report: 2024-10-08 - 2024-10-15 #35785

Closed

Weekly Report: 2024-10-15 - 2024-10-22 #35905

Closed

This was referenced Oct 29, 2024

Weekly Report: 2024-10-22 - 2024-10-29 #36039

Closed

Weekly Report: 2024-10-29 - 2024-11-05 #36187

Closed

github-actions bot mentioned this issue Nov 12, 2024

Weekly Report: 2024-11-05 - 2024-11-12 #36302

Closed

github-actions bot mentioned this issue Nov 19, 2024

Weekly Report: 2024-11-12 - 2024-11-19 #36426

Closed

github-actions bot added the Stale label Nov 25, 2024

github-actions bot mentioned this issue Nov 26, 2024

Weekly Report: 2024-11-19 - 2024-11-26 #36533

Closed

github-actions bot mentioned this issue Dec 3, 2024

Weekly Report: 2024-11-26 - 2024-12-03 #36628

Closed

atoulme added question Further information is requested waiting-for-code-owners and removed Stale needs triage New item requiring triage labels Dec 7, 2024

jpkrohling self-assigned this Dec 9, 2024

jpkrohling removed the waiting-for-code-owners label Dec 9, 2024

jpkrohling added a commit to jpkrohling/opentelemetry-collector-contrib that referenced this issue Dec 9, 2024

[chore][processor/tailsampling] Better description for rate_limiting

ff36081

Fixes open-telemetry#35419 Signed-off-by: Juraci Paixão Kröhling <juraci@kroehling.de>

jpkrohling mentioned this issue Dec 9, 2024

[chore][processor/tailsampling] Better description for rate_limiting #36721

Merged

codeboten closed this as completed in #36721 Jan 8, 2025

codeboten closed this as completed in e57a0d9 Jan 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need a detailed description of the rate_sampling policy under the tailSampling processor #35419

Need a detailed description of the rate_sampling policy under the tailSampling processor #35419

DraegoG commented Sep 25, 2024

github-actions bot commented Sep 25, 2024

github-actions bot commented Nov 25, 2024

atoulme commented Dec 7, 2024

jpkrohling commented Dec 9, 2024

mugli commented Dec 12, 2024