[APM] Display related trace logs in the trace sample content #67611

formgeist · 2020-05-28T12:44:05Z

Blocked: An embeddable log component from Logs team is required.

Summary

Based on the design solution in elastic/apm#179

We're currently linking out to the Logs app to display the related logs for a specific trace.id, but it would be a better experience to show the logs in the context of the trace sample immediately available from within the APM app.

Design proposal

Figma prototype link

Possible API

<LogStream 
 timestamp="1590690626648" 
 filter={encodeURIComponent('trace.id:"0570667f4e27e2cac0d6c5b311c65918"')} 
/>

Prerequisites for starting implementation

There's a number of things that can be seen as dependencies for this feature to be implemented and this work should be planned ahead when we prioritize this feature.

A way to display the log events in a similar style as the Logs stream in the Logs app. Currently the log stream is not embedabble.
The embedded view would also need to support actions like "View log line in context" and "View log details" which would link to the Logs app with the relevant views.
Ways of adding our own data columns to the logs viewer e.g. the service legend (see screenshots).

The text was updated successfully, but these errors were encountered:

elasticmachine · 2020-05-28T12:44:07Z

Pinging @elastic/apm-ui (Team:apm)

tbragin · 2020-05-28T16:49:35Z

Big +1 from me.

I see this seems blocked by what looks like Logs Stream app being embeddable.

@elastic/logs-metrics-ui do we have an issue we're tracking around converting Logs Stream app to an embeddable component?

cc @roncohen @alvarolobato @mukeshelastic @cyrille-leclerc

weltenwort · 2020-05-28T17:55:51Z

do we have an issue we're tracking around converting Logs Stream app to an embeddable component?

Not yet - the approach probably highly depends on how exactly the embedding UI would have to control and inject the source configuration.

sorenlouv · 2020-05-28T18:45:29Z

@weltenwort makes sense. What do you think about a simple first version that takes the same parameters as the link from apm to logs (timestamp and filter)

<LogStream 
 timestamp="1590690626648" 
 filter={encodeURIComponent('trace.id:"0570667f4e27e2cac0d6c5b311c65918"')} 
/>

In the first version I think it's fine if it's just a static number of log lines. If users want to see more lines, they can click "Open in Logs" which will take them to the logs app.

sorenlouv · 2020-05-28T18:48:06Z

An alternative approach could be to specify the time range:

<LogStream 
 startTime="1590690626648" 
 endTime="1590690636648" 
 filter={encodeURIComponent('trace.id:"0570667f4e27e2cac0d6c5b311c65918"')} 
/>

graphaelli · 2020-05-29T21:58:56Z

@formgeist and I came up with two other contexts (besides per-trace) to consider:

per-transaction logs - similar to marks but with the full abilities of a log statement
per-span logs - common in OT. but without specialized APIs and event types.

In both cases, the data should actually be small so a more minimal UI might be appropriate, handy if an embeddable logging ui component is a ways off. @formgeist is exploring the possibilities in parallel with this issue.

sorenlouv · 2020-05-30T20:16:39Z

per-span logs

Do we annotate log lines with transaction.id/span.id - I thought we only did this for trace.id ?

weltenwort · 2020-06-02T10:49:45Z

The complicated part of embedding the logs is the source configuration, i.e. which log indices to look at and which columns to show. I see two options for how to implement it:

We refactor the log stream (and underlying APIs) to work with a dynamically passed source configuration. This configuration would have to be passed to the embedded log stream component and via the link URL.
APM statically injects a (hidden) source configuration via the server-side plugin contract and references that when embedding and linking to the Logs UI. (This is what stack monitoring already does when linking.)

The former is more powerful but also requires more effort and handling of edge-cases. The latter is more restricted but requires fewer changes on the logs stream side.

felixbarny · 2020-06-02T11:09:07Z

per-span logs

Do we annotate log lines with transaction.id/span.id - I thought we only did this for trace.id ?

In the Java agent, we add transaction.id and trace.id. If possible, I'd like to avoid span-scoped logs (adding span.id to logs) as the performance overhead for that would be significantly higher. Given that the transaction.id and the timestamp should tell you most of the time in which span the log happened, the cost/benefit ratio seems like it's not worth it to me.

sorenlouv · 2020-06-02T22:16:37Z

@weltenwort The first approach definitely sounds more appealing to me since it's stateless and therefore typically less error-prone but if the other approach is significantly easier on your part we might have to go that route.

Either way, how can APM know which log indices to link to?

sorenlouv · 2020-06-02T22:18:10Z

Thanks @felixbarny . Not having span scoped logs sounds okay to me.

weltenwort · 2020-06-03T08:49:03Z

Either way, how can APM know which log indices to link to?

And how can the Logs UI know? One way out might be to rely on the new indexing strategy, from which we could derive a convention about the data type and namespace. Does the APM config have the concept of a namespace?

sorenlouv · 2020-06-03T10:50:46Z

One way out might be to rely on the new indexing strategy, from which we could derive a convention about the data type and namespace. Does the APM config have the concept of a namespace?

I haven't spent any time looking into the new indexing strategy for apm. "namespace" is the last part like {type}-{dataset}-{namespace}, right?

graphaelli · 2020-06-03T14:57:08Z

I would expect APM to use the same log source the logs UI is using - there is a single data source configuration for logs UI right? That is, logs will be in filebeat-* now, logs-* soon, plus the user can customize them.

I'd like to avoid span-scoped logs (adding span.id to logs) as the performance overhead for that would be significantly higher

This is common with Jaeger and through opentracing and users are already doing their own log correlation using span.id so I don't think the two - displaying span scoped logs and logging span.id by agents by default - need to be interdependent. I'd like to see what the design looks like and have a discussion on priorities before discarding this idea.

jasonrhodes · 2020-07-01T22:03:11Z

I'm thinking through some options for how we can make this as simple as possible. I don't think most apps in Kibana are going to want logs from specific indices, but rather "just please give me 'the logs' that match these ECS-based and/or time-based criteria" so I think we'll need to be able to accommodate that based on a very simple index-based convention.

sorenlouv · 2020-07-01T22:28:59Z

so I think we'll need to be able to accommodate that based on a very simple index-based convention.

Does that mean querying the indices that's specified in the Logs Settings already? I think that would be what the user would expect

jasonrhodes · 2020-07-02T01:21:42Z

@sqren with the way things are set up today, yes I think that's the sanest and simplest approach. If a user has created a separate space in order to look at a specific subsection of logs with a much more custom source, the logs component may not work as well in that use case, but we may be able to live with that.

I may be forgetting some problems with this approach but I don't think we need to make the indices customizable for a component like this.

sorenlouv · 2020-07-02T06:29:08Z

I don't think we need to make the indices customizable for a component like this.

Agree, I think it should simply read from the indicies specified in Log Settings. In that case what do you think about a Log component with the following interface?

<LogStream 
 timestamp="1590690626648" 
 filter={encodeURIComponent('trace.id:"0570667f4e27e2cac0d6c5b311c65918"')} 
/>

weltenwort · 2020-07-06T10:34:25Z

In that case what do you think about a Log component with the following interface?

The log stream is now limited using a time range as well (for performance and uniformity reasons). So we have three timestamps: startTime, endTime and a timestamp for the log line of interest. How about the following semantics:

`startTime`	`endTime`	`timestamp`
given	given	`endTime`
`timestamp` - 1 day	`timestamp` + 1 day	given
given	given	given, but clamped to `[startTime, endTime]`

The filter wouldn't have to be URI-encoded if passed as a prop.

For the sake of keeping things organized, could we formulate the requirements as a response to #70513?

sorenlouv · 2020-07-06T10:57:53Z

and a timestamp for the log line of interest

What is the purpose of timestamp when startTime and endTime is given? Will lines that occur at that timestamp be highlighted?

The filter wouldn't have to be URI-encoded if passed as a prop.

sgtm 👍

For the sake of keeping things organized, could we formulate the requirements as a response to #70513?

Sure, I'll add that

weltenwort · 2020-07-06T12:37:20Z

What is the purpose of timestamp when startTime and endTime is given? Will lines that occur at that timestamp be highlighted?

It's the time that logs stream should be scrolled to. A [startTime, endTime] interval of one hour, for example, could contain millions of lines so we'd have to know which time to scroll to initially.

sorenlouv · 2020-07-06T12:42:07Z

Okay, makes sense 👍

cyrille-leclerc · 2020-07-23T09:41:41Z

Would it make sense to add span.id in the log messages emitted by elastic/ecs-logging libraries to have finer correlation between the distributed trace waterfall view and the log message?

CC @felixbarny @alex-fedotyev

Notes

span.idhas just been added to ECS ( schemas/tracing.yml: add span.id ecs#882 )
Sample log message emitted by ECS Logging 0.4.0 https://gist.github.com/cyrille-leclerc/df77175685fd3dc4c053fe060d04e29c

felixbarny · 2020-07-23T11:04:28Z

It might but I don't think it's worth it given the alternatives and overhead.

Adding the span ID would significantly increase the overhead, especially if there are many spans (some users might do 100s or 1000s of DB queries per transaction). Depending on the Agent and how it integrates with the loggers, this overhead also occurs if there are no log statements associated with a span.

As we already add the transaciton.id and as we also have the timestamp, we can have a pretty good guess to which span a log belongs.

Which UI features do you have in mind that could benefit from a span.id in the logs? Are you thinking to add a logs tab to spans? If so, would it be good enough to show all transaction logs and to highlight those that were logged within the span's duration?

cyrille-leclerc · 2020-07-30T15:40:53Z

FYI we are doing additional design work with @formgeist on this topic.

afgomez · 2020-09-08T08:43:15Z

👋! We have made a first version of an embeddable logs component in #76262

Please take a look and play with it. Feel free to ask me any questions or feature requests :)

sorenlouv · 2020-09-08T12:50:34Z

Awesome @afgomez ! I'll ping you later this week and get it implemented in APM. Looks like it's going to be very easy. Thanks for doing this!

sorenlouv · 2020-10-08T13:10:00Z

Replaced by #79995

formgeist added blocked Team:APM All issues that need APM UI Team support labels May 28, 2020

formgeist mentioned this issue May 28, 2020

Embedding logs into the APM UI elastic/apm#179

Closed

sorenlouv added [zube]: Inbox v7.10.0 and removed [zube]: Inbox labels Jun 3, 2020

jasonrhodes mentioned this issue Jul 1, 2020

[R&D] Logs embeddables: what are your use cases? #70513

Closed

sorenlouv added the [zube]: (7.10) Planned for release label Jul 15, 2020

jasonrhodes mentioned this issue Jul 21, 2020

[Logs UI] Simple Logs component POC #72629

Closed

afgomez mentioned this issue Aug 21, 2020

Implement embeddable <LogStream /> component #75650

Closed

5 tasks

sorenlouv self-assigned this Sep 8, 2020

sorenlouv added [zube]: In Progress and removed [zube]: (7.10) Planned for release blocked v7.10.0 labels Sep 8, 2020

sorenlouv removed their assignment Sep 9, 2020

sorenlouv added [zube]: Backlog v7.11.0 and removed [zube]: (7.10) Planned for release labels Sep 9, 2020

sorenlouv assigned nehaduggal Sep 9, 2020

sorenlouv added [zube]: (7.11) and removed [zube]: Backlog labels Sep 23, 2020

sorenlouv closed this as completed Oct 8, 2020

zube bot added [zube]: Done and removed [zube]: (7.11) labels Oct 8, 2020

graphaelli removed the [zube]: Done label Oct 21, 2020

bmorelli25 mentioned this issue Oct 29, 2020

[APM] docs: Trace logs integration #82057

Closed

sorenlouv mentioned this issue Nov 3, 2020

[APM] Display related trace logs #79995

Closed

formgeist mentioned this issue Dec 15, 2020

[APM] Discuss: Configure columns in the Logstream component used in the Trace detail view #85947

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[APM] Display related trace logs in the trace sample content #67611

[APM] Display related trace logs in the trace sample content #67611

formgeist commented May 28, 2020 •

edited by sorenlouv

Loading

elasticmachine commented May 28, 2020

tbragin commented May 28, 2020

weltenwort commented May 28, 2020

sorenlouv commented May 28, 2020

sorenlouv commented May 28, 2020 •

edited

Loading

graphaelli commented May 29, 2020

sorenlouv commented May 30, 2020

weltenwort commented Jun 2, 2020

felixbarny commented Jun 2, 2020

sorenlouv commented Jun 2, 2020

sorenlouv commented Jun 2, 2020 •

edited

Loading

weltenwort commented Jun 3, 2020

sorenlouv commented Jun 3, 2020

graphaelli commented Jun 3, 2020

jasonrhodes commented Jul 1, 2020

sorenlouv commented Jul 1, 2020

jasonrhodes commented Jul 2, 2020 •

edited

Loading

sorenlouv commented Jul 2, 2020 •

edited

Loading

weltenwort commented Jul 6, 2020

sorenlouv commented Jul 6, 2020 •

edited

Loading

weltenwort commented Jul 6, 2020 •

edited

Loading

sorenlouv commented Jul 6, 2020

cyrille-leclerc commented Jul 23, 2020

felixbarny commented Jul 23, 2020

cyrille-leclerc commented Jul 30, 2020

afgomez commented Sep 8, 2020 •

edited

Loading

sorenlouv commented Sep 8, 2020

sorenlouv commented Oct 8, 2020

[APM] Display related trace logs in the trace sample content #67611

[APM] Display related trace logs in the trace sample content #67611

Comments

formgeist commented May 28, 2020 • edited by sorenlouv Loading

Summary

Design proposal

Possible API

Prerequisites for starting implementation

elasticmachine commented May 28, 2020

tbragin commented May 28, 2020

weltenwort commented May 28, 2020

sorenlouv commented May 28, 2020

sorenlouv commented May 28, 2020 • edited Loading

graphaelli commented May 29, 2020

sorenlouv commented May 30, 2020

weltenwort commented Jun 2, 2020

felixbarny commented Jun 2, 2020

sorenlouv commented Jun 2, 2020

sorenlouv commented Jun 2, 2020 • edited Loading

weltenwort commented Jun 3, 2020

sorenlouv commented Jun 3, 2020

graphaelli commented Jun 3, 2020

jasonrhodes commented Jul 1, 2020

sorenlouv commented Jul 1, 2020

jasonrhodes commented Jul 2, 2020 • edited Loading

sorenlouv commented Jul 2, 2020 • edited Loading

weltenwort commented Jul 6, 2020

sorenlouv commented Jul 6, 2020 • edited Loading

weltenwort commented Jul 6, 2020 • edited Loading

sorenlouv commented Jul 6, 2020

cyrille-leclerc commented Jul 23, 2020

felixbarny commented Jul 23, 2020

cyrille-leclerc commented Jul 30, 2020

afgomez commented Sep 8, 2020 • edited Loading

sorenlouv commented Sep 8, 2020

sorenlouv commented Oct 8, 2020

formgeist commented May 28, 2020 •

edited by sorenlouv

Loading

sorenlouv commented May 28, 2020 •

edited

Loading

sorenlouv commented Jun 2, 2020 •

edited

Loading

jasonrhodes commented Jul 2, 2020 •

edited

Loading

sorenlouv commented Jul 2, 2020 •

edited

Loading

sorenlouv commented Jul 6, 2020 •

edited

Loading

weltenwort commented Jul 6, 2020 •

edited

Loading

afgomez commented Sep 8, 2020 •

edited

Loading