improve trace_context performance #4721

xiehuc · 2023-11-16T03:03:33Z

Propagation is essential logic, regardless of whether it hits sampling. In the past, the implementation of Propagation did not prioritize memory management, allocating a lot of temporary heap memory. This caused its efficiency to be low. In some scenarios, too much CPU was wasted.

For example, even if it doesn't hit sampling, startServerSpan consumes 25% of the entire request CPU.

Typical server span Propagation:

Typical client span Propagation

It can be seen that the poor performance of Propagation is mainly due to regular expressions and Sprintf.

This CR can significantly improve Propagation by optimizing it in the following ways:

Use string.Builder to pre-allocate memory
Use hex.Decode to a stack-based [N]byte array instead of hex.DecodeString, which temporarily allocates a heap-based string.
Use strings.Cut instead of strings.Split
Avoid using regex
All unit tests pass. Benchmarks:

New logic:

goos: linux
goarch: amd64
pkg: go.opentelemetry.io/otel/propagation
cpu: Intel(R) Xeon(R) Platinum 8255C CPU @ 2.50GHz
BenchmarkInject/SampledSpanContext-16            3489816               344.4 ns/op            96 B/op          3 allocs/op
BenchmarkInject/WithoutSpanContext-16           19206630                60.12 ns/op           16 B/op          1 allocs/op
BenchmarkExtract/Sampled-16                      2094048               576.1 ns/op           160 B/op          4 allocs/op
BenchmarkExtract/BogusVersion-16                 8635071               137.9 ns/op            16 B/op          1 allocs/op
BenchmarkExtract/FutureAdditionalData-16         2073709               577.8 ns/op           160 B/op          4 allocs/op
PASS
ok      go.opentelemetry.io/otel/propagation    7.692s

Old logic:

goos: linux
goarch: amd64
pkg: go.opentelemetry.io/otel/propagation
cpu: Intel(R) Xeon(R) Platinum 8255C CPU @ 2.50GHz
BenchmarkInject/SampledSpanContext-16            1289794               926.0 ns/op           224 B/op         11 allocs/op
BenchmarkInject/WithoutSpanContext-16           18479284                60.12 ns/op           16 B/op          1 allocs/op
BenchmarkExtract/Sampled-16                       760831              1423 ns/op             320 B/op          6 allocs/op
BenchmarkExtract/BogusVersion-16                 6141056               191.4 ns/op            16 B/op          1 allocs/op
BenchmarkExtract/FutureAdditionalData-16          704563              1586 ns/op             320 B/op          6 allocs/op
PASS
ok      go.opentelemetry.io/otel/propagation    6.938s

It can be seen that the performance has improved by about 2-3 times.

* using strings.Builder instead of fmt.Sprint * using strings.Cut instead of strings.Split * using hex.Decode instead of hex.DecodeString * avoid use regexp

linux-foundation-easycla · 2023-11-16T03:03:38Z

The committers listed above are authorized under a signed CLA.

✅ login: xiehuc / name: xiehuc (88c89b1, 960afe9, 0746142, 46f8df2, 6ea74ff, 770fb29)
✅ login: hanyuancheung / name: Chester Cheung (8769f2f, b822a32)
✅ login: pellared / name: Robert Pająk (685e13a)

codecov · 2023-11-16T03:11:54Z

Codecov Report

Merging #4721 (770fb29) into main (204be61) will increase coverage by 0.1%.
The diff coverage is 100.0%.

Additional details and impacted files

@@           Coverage Diff           @@
##            main   #4721     +/-   ##
=======================================
+ Coverage   81.8%   81.9%   +0.1%     
=======================================
  Files        224     224             
  Lines      18113   18116      +3     
=======================================
+ Hits       14817   14847     +30     
+ Misses      3000    2982     -18     
+ Partials     296     287      -9

Files	Coverage Δ
propagation/trace_context.go	`96.6% <100.0%> (+31.1%)`	⬆️

propagation/trace_context.go

hanyuancheung

LGTM👍

MrAlias · 2023-11-16T16:22:57Z

Similar to #4722, please use benchstat to make comparisons between benchmarks in your description.

Also, the increases in allocations is concerning. The impact on GC is not captured in the benchmarks but will be when it is run on real systems.

xiehuc · 2023-11-17T01:52:51Z

Similar to #4722, please use benchstat to make comparisons between benchmarks in your description.

Also, the increases in allocations is concerning. The impact on GC is not captured in the benchmarks but will be when it is run on real systems.

allocation is on stack, less GC than old implement,

xiehuc · 2023-11-17T02:07:23Z

goos: linux
goarch: amd64
pkg: go.opentelemetry.io/otel/propagation
cpu: Intel(R) Xeon(R) Platinum 8255C CPU @ 2.50GHz
                                │ /tmp/old.txt │            /tmp/new.txt             │
                                │    sec/op    │   sec/op     vs base                │
Inject/SampledSpanContext-16       916.0n ± 0%   341.2n ± 0%  -62.75% (p=0.000 n=10)
Inject/WithoutSpanContext-16       61.79n ± 1%   59.72n ± 1%   -3.35% (p=0.000 n=10)
Extract/Sampled-16                1398.0n ± 0%   574.8n ± 0%  -58.88% (p=0.000 n=10)
Extract/BogusVersion-16            190.4n ± 0%   138.6n ± 0%  -27.23% (p=0.000 n=10)
Extract/FutureAdditionalData-16   1553.0n ± 0%   574.8n ± 1%  -62.99% (p=0.000 n=10)
geomean                            471.9n        247.7n       -47.50%

                                │ /tmp/old.txt │             /tmp/new.txt             │
                                │     B/op     │    B/op     vs base                  │
Inject/SampledSpanContext-16       224.00 ± 0%   96.00 ± 0%  -57.14% (p=0.000 n=10)
Inject/WithoutSpanContext-16        16.00 ± 0%   16.00 ± 0%        ~ (p=1.000 n=10) ¹
Extract/Sampled-16                  320.0 ± 0%   160.0 ± 0%  -50.00% (p=0.000 n=10)
Extract/BogusVersion-16             16.00 ± 0%   16.00 ± 0%        ~ (p=1.000 n=10) ¹
Extract/FutureAdditionalData-16     320.0 ± 0%   160.0 ± 0%  -50.00% (p=0.000 n=10)
geomean                             89.90        57.51       -36.03%
¹ all samples are equal

                                │ /tmp/old.txt │             /tmp/new.txt             │
                                │  allocs/op   │ allocs/op   vs base                  │
Inject/SampledSpanContext-16       11.000 ± 0%   3.000 ± 0%  -72.73% (p=0.000 n=10)
Inject/WithoutSpanContext-16        1.000 ± 0%   1.000 ± 0%        ~ (p=1.000 n=10) ¹
Extract/Sampled-16                  6.000 ± 0%   4.000 ± 0%  -33.33% (p=0.000 n=10)
Extract/BogusVersion-16             1.000 ± 0%   1.000 ± 0%        ~ (p=1.000 n=10) ¹
Extract/FutureAdditionalData-16     6.000 ± 0%   4.000 ± 0%  -33.33% (p=0.000 n=10)
geomean                             3.308        2.169       -34.43%
¹ all samples are equal

xiehuc · 2023-11-17T02:09:41Z

original benchmark result
old.txt
new.txt

xiehuc · 2023-11-17T02:10:03Z

@MrAlias please check again, thanks

MrAlias · 2023-11-17T15:24:40Z

It seems like I was confusing old vs new in my last comment. The benchstat output was helpful in clarifying this.

propagation/trace_context.go

xiehuc · 2023-11-18T01:37:09Z

@MrAlias all suggestion modified, please check again, thanks

propagation/trace_context.go

xiehuc · 2023-11-29T02:20:55Z

newest code benchmark result

goos: linux
goarch: amd64
pkg: go.opentelemetry.io/otel/propagation
cpu: Intel(R) Xeon(R) Platinum 8255C CPU @ 2.50GHz
                                │ /tmp/old.txt │            /tmp/new2.txt            │
                                │    sec/op    │   sec/op     vs base                │
Inject/SampledSpanContext-16       916.0n ± 0%   335.7n ± 1%  -63.35% (p=0.000 n=10)
Inject/WithoutSpanContext-16       61.79n ± 1%   60.83n ± 0%   -1.55% (p=0.000 n=10)
Extract/Sampled-16                1398.0n ± 0%   586.4n ± 1%  -58.05% (p=0.000 n=10)
Extract/BogusVersion-16            190.4n ± 0%   137.0n ± 0%  -28.07% (p=0.000 n=10)
Extract/FutureAdditionalData-16   1553.0n ± 0%   579.0n ± 0%  -62.72% (p=0.000 n=10)
geomean                            471.9n        248.6n       -47.32%

                                │ /tmp/old.txt │            /tmp/new2.txt             │
                                │     B/op     │    B/op     vs base                  │
Inject/SampledSpanContext-16       224.00 ± 0%   96.00 ± 0%  -57.14% (p=0.000 n=10)
Inject/WithoutSpanContext-16        16.00 ± 0%   16.00 ± 0%        ~ (p=1.000 n=10) ¹
Extract/Sampled-16                  320.0 ± 0%   160.0 ± 0%  -50.00% (p=0.000 n=10)
Extract/BogusVersion-16             16.00 ± 0%   16.00 ± 0%        ~ (p=1.000 n=10) ¹
Extract/FutureAdditionalData-16     320.0 ± 0%   160.0 ± 0%  -50.00% (p=0.000 n=10)
geomean                             89.90        57.51       -36.03%
¹ all samples are equal

                                │ /tmp/old.txt │            /tmp/new2.txt             │
                                │  allocs/op   │ allocs/op   vs base                  │
Inject/SampledSpanContext-16       11.000 ± 0%   3.000 ± 0%  -72.73% (p=0.000 n=10)
Inject/WithoutSpanContext-16        1.000 ± 0%   1.000 ± 0%        ~ (p=1.000 n=10) ¹
Extract/Sampled-16                  6.000 ± 0%   4.000 ± 0%  -33.33% (p=0.000 n=10)
Extract/BogusVersion-16             1.000 ± 0%   1.000 ± 0%        ~ (p=1.000 n=10) ¹
Extract/FutureAdditionalData-16     6.000 ± 0%   4.000 ± 0%  -33.33% (p=0.000 n=10)
geomean                             3.308        2.169       -34.43%
¹ all samples are equal

xiehuc · 2023-11-29T02:21:26Z

new2.txt

xiehuc · 2023-11-29T02:21:47Z

@MrAlias all comment fixed, please check again, thanks

improve trace_context performance

88c89b1

* using strings.Builder instead of fmt.Sprint * using strings.Cut instead of strings.Split * using hex.Decode instead of hex.DecodeString * avoid use regexp

xiehuc marked this pull request as ready for review November 16, 2023 03:08

xiehuc requested review from MrAlias, Aneurysm9, evantorrie, XSAM, dashpole, MadVikingGod, pellared, hanyuancheung and dmathieu as code owners November 16, 2023 03:08

fix code style

960afe9

hanyuancheung reviewed Nov 16, 2023

View reviewed changes

propagation/trace_context.go Outdated Show resolved Hide resolved

fix build

0746142

hanyuancheung approved these changes Nov 16, 2023

View reviewed changes

update changelog

46f8df2

MrAlias reviewed Nov 17, 2023

View reviewed changes

propagation/trace_context.go Outdated Show resolved Hide resolved

propagation/trace_context.go Outdated Show resolved Hide resolved

propagation/trace_context.go Outdated Show resolved Hide resolved

propagation/trace_context.go Outdated Show resolved Hide resolved

refine code

6ea74ff

hanyuancheung and others added 3 commits November 18, 2023 16:23

Merge branch 'main' into propagation

8769f2f

Merge branch 'main' into propagation

b822a32

Merge branch 'main' into propagation

685e13a

MrAlias approved these changes Nov 28, 2023

View reviewed changes

propagation/trace_context.go Outdated Show resolved Hide resolved

update comment

770fb29

MrAlias merged commit 0405492 into open-telemetry:main Nov 29, 2023
25 checks passed

MrAlias added this to the v1.22.0 milestone Jan 11, 2024

MadVikingGod mentioned this pull request Jan 11, 2024

Release v1.22.0/v0.45.0 #4821

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve trace_context performance #4721

improve trace_context performance #4721

xiehuc commented Nov 16, 2023 •

edited

Loading

linux-foundation-easycla bot commented Nov 16, 2023 •

edited

Loading

codecov bot commented Nov 16, 2023 •

edited

Loading

hanyuancheung left a comment

MrAlias commented Nov 16, 2023

xiehuc commented Nov 17, 2023

xiehuc commented Nov 17, 2023

xiehuc commented Nov 17, 2023

xiehuc commented Nov 17, 2023

MrAlias commented Nov 17, 2023

xiehuc commented Nov 18, 2023

xiehuc commented Nov 29, 2023

xiehuc commented Nov 29, 2023

xiehuc commented Nov 29, 2023

improve trace_context performance #4721

improve trace_context performance #4721

Conversation

xiehuc commented Nov 16, 2023 • edited Loading

linux-foundation-easycla bot commented Nov 16, 2023 • edited Loading

codecov bot commented Nov 16, 2023 • edited Loading

Codecov Report

hanyuancheung left a comment

Choose a reason for hiding this comment

MrAlias commented Nov 16, 2023

xiehuc commented Nov 17, 2023

xiehuc commented Nov 17, 2023

xiehuc commented Nov 17, 2023

xiehuc commented Nov 17, 2023

MrAlias commented Nov 17, 2023

xiehuc commented Nov 18, 2023

xiehuc commented Nov 29, 2023

xiehuc commented Nov 29, 2023

xiehuc commented Nov 29, 2023

xiehuc commented Nov 16, 2023 •

edited

Loading

linux-foundation-easycla bot commented Nov 16, 2023 •

edited

Loading

codecov bot commented Nov 16, 2023 •

edited

Loading