Fix instrumentation for Kafka Streams 2.6+ #2951

tylerbenson · 2021-07-22T16:44:25Z

The method signature of what we're instrumenting changed in 2.6 from process() to process(long). By removing the no arg constraint, we now instrument either method.

I've also extended the test to verify the latest version of the library. In order to do so, I had to split the tests to resolve a compilation issue.

Inspired by open-telemetry/opentelemetry-java-instrumentation#3438. Thanks @GuillaumeWaignier!

The method signature of what we're instrumenting changed in 2.6 from `process()` to `process(long)`. By removing the no arg constraint, we instrument either method. I've also extended the test to verify the latest version of the library.

richardstartin · 2021-07-22T16:55:33Z

What do the checkpoint timelines look like?

tylerbenson · 2021-07-22T17:02:56Z

Here's output from running locally. Not quite sure how to interpret it though:

Activity checkpoints by thread ordered by time
Test worker:                                                                                                   |-startSpan/1-|-suspend/1-|----------|-----------|-------------|-----------|-------------|-------------|-----------|-----------|----------|-----------|----------|-----------|-------------|-----------|
kafka-producer-network-thread | producer-1:                                                                    |-------------|-----------|-resume/1-|-endSpan/1-|-------------|-----------|-------------|-------------|-----------|-----------|----------|-----------|----------|-----------|-------------|-----------|
test-application-278e57c6-a8cf-4522-8406-0cb56bebd042-StreamThread-1:                                          |-------------|-----------|----------|-----------|-startSpan/2-|-endSpan/2-|-startSpan/3-|-startSpan/4-|-suspend/4-|-endSpan/3-|----------|-----------|----------|-----------|-------------|-----------|
kafka-producer-network-thread | test-application-278e57c6-a8cf-4522-8406-0cb56bebd042-StreamThread-1-producer: |-------------|-----------|----------|-----------|-------------|-----------|-------------|-------------|-----------|-----------|-resume/4-|-endSpan/4-|-resume/3-|-endTask/3-|-------------|-----------|
-C-1:                                                                                                          |-------------|-----------|----------|-----------|-------------|-----------|-------------|-------------|-----------|-----------|----------|-----------|----------|-----------|-startSpan/5-|-endSpan/5-|

richardstartin · 2021-07-22T17:18:47Z

Span 3 should have been suspended on test-application-278e57c6-a8cf-4522-8406-0cb56bebd042-StreamThread-1 before being resumed on kafka-producer-network-thread | test-application-278e57c6-a8cf-4522-8406-0cb56bebd042-StreamThread-1-producer but this isn't a new problem.

dougqh · 2021-07-27T14:47:07Z

...n/java/datadog/trace/instrumentation/kafka_streams/KafkaStreamsProcessorInstrumentation.java

@@ -103,7 +102,7 @@ public StopInstrumentation() {
    @Override
    public void adviceTransformations(AdviceTransformation transformation) {
      transformation.applyAdvice(
-          isMethod().and(isPublic()).and(named("process")).and(takesArguments(0)),
+          isMethod().and(isPublic()).and(named("process")),


Just a question? Is this how we want to be handling this?

I'm concerned this opens us to overmatching if process is overloaded at a later time.
I think we'd be better off having a match for each signature that we want to instrument.

I'm interested to hear other opinions. I'm also not going to block this PR for this reason alone.

There hasn't been much change to this method - previously it took no-arguments, now it takes a single long. Looking at the class and how this method is used I think it's unlikely that this method will be overloaded, but of course not impossible.

In other instrumentations we've generally gone for looser method matching if it's an internal method of an implementation class. If the advice was simple then I wouldn't be concerned - but this is one case where we're grabbing activeSpan / activeScope and trusting it is the right thing to finish/close. So if we did ever match overloaded (or 'bridge' methods) then this would risk closing both the scope we wanted to close as well as any surrounding scope...

So I would be happier to see more of an explicit match here - or alternatively an extra check in StopSpanAdvice that we're closing the right span/scope, even just a check of the type (note there is work planned to address this separately by adding methods to the scope manager, but that work still needs to be scheduled)

Would you prefer if I changed it to an or that checks for either number of args?

I'm ok with that since we're going to fix the StopSpanAdvice soon

Ok, I've updated the matcher.

Fix instrumentation for Kafka Streams 2.6+

87c4af0

The method signature of what we're instrumenting changed in 2.6 from `process()` to `process(long)`. By removing the no arg constraint, we instrument either method. I've also extended the test to verify the latest version of the library.

tylerbenson requested a review from a team as a code owner July 22, 2021 16:44

tylerbenson mentioned this pull request Jul 22, 2021

Remove kafka-streams-0.11 latestDepTest limits open-telemetry/opentelemetry-java-instrumentation#3451

Closed

dougqh reviewed Jul 27, 2021

View reviewed changes

mcculls approved these changes Jul 29, 2021

View reviewed changes

richardstartin approved these changes Jul 29, 2021

View reviewed changes

Make matcher more specific rather than remove arg matcher.

aa611e7

tylerbenson merged commit ee5187d into master Jul 29, 2021

tylerbenson deleted the tyler/kafka-streams-latest branch July 29, 2021 17:04

github-actions bot added this to the 0.84.0 milestone Jul 29, 2021

tylerbenson mentioned this pull request Aug 23, 2021

Kafka Streams: remove span/scope from task #3019

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix instrumentation for Kafka Streams 2.6+ #2951

Fix instrumentation for Kafka Streams 2.6+ #2951

tylerbenson commented Jul 22, 2021

richardstartin commented Jul 22, 2021

tylerbenson commented Jul 22, 2021

richardstartin commented Jul 22, 2021 •

edited

Loading

dougqh Jul 27, 2021

mcculls Jul 29, 2021

tylerbenson Jul 29, 2021

mcculls Jul 29, 2021

tylerbenson Jul 29, 2021

Fix instrumentation for Kafka Streams 2.6+ #2951

Fix instrumentation for Kafka Streams 2.6+ #2951

Conversation

tylerbenson commented Jul 22, 2021

richardstartin commented Jul 22, 2021

tylerbenson commented Jul 22, 2021

richardstartin commented Jul 22, 2021 • edited Loading

dougqh Jul 27, 2021

Choose a reason for hiding this comment

mcculls Jul 29, 2021

Choose a reason for hiding this comment

tylerbenson Jul 29, 2021

Choose a reason for hiding this comment

mcculls Jul 29, 2021

Choose a reason for hiding this comment

tylerbenson Jul 29, 2021

Choose a reason for hiding this comment

richardstartin commented Jul 22, 2021 •

edited

Loading