Add SimpleConcurrentProcessor for Logs, clarify existing export concurrency #4163

cijothomas · 2024-07-25T15:27:26Z

Fixes most of #4134 (Tracing signal not addressed in this PR)

Changes

Clarifies explicitly that the existing built-in processors should not invoke Export() concurrently. This was already the intention (from what I gather!), but not explicitly listed.
Adds SimpleConcurrentProcessor (happy to get alternative name suggestions), which can call Export() concurrently.

.NET, Rust, C++ have their SimpleProcessor already matching this.
Java, Go, Python does not. I believe they can fix the SimpleProcessor implementation to match this. This should be treated as a bug fix. Since SimpleProcessors were meant to be used in test/debug scenarios (stdout exporters), the extra perf hit should not be a concern. If it is indeed a concern, then users can be advised to switch to SimpleConcurrentProcessor.

(I also considered the possibility of adding extra capability to existing SimpleProcessor, but I believe it is better to have a dedicated one to avoid any confusion about "Simple" being in used for high-perf, prod scenarios. Also, adding more capabilities may make SimpleProcessor more complex without much gains.)

Additional background/context:
.NET, Rust, C++ had the equivalent of SimpleConcurrentProcessors for quite a while and was used when exporting to OS native tracing systems like ETW (Windows), Linux user_events. Exporting to these systems are done for scenarios requiring very high performance, and the existing Simple/Batch processors does not allow such high performance, necessitating the need of an official exporting processor.

(Note: A similar PR can be done for tracing sdk as well, but want to first try out in Log SDK. Once this is done, will make a follow up PR to trace sdk as well.)

Related issues #
Related OTEP(s) #
Links to the prototypes (when adding or changing features)
CHANGELOG.md file updated for non-trivial changes
spec-compliance-matrix.md updated if necessary

specification/logs/sdk.md

pellared · 2024-07-26T06:39:49Z

specification/logs/sdk.md

@@ -427,6 +429,21 @@ the configured `processor`.

 * `processor` - processor to be isolated.

+#### Simple Concurrent processor


I strongly suggest NOT using the term "concurrent" as usually it conveys that there is some of synchronization behind. E.g. "concurrent collections" are collections that can be used in multithreaded code. In some languages there is an idiom that if the type has Concurrent in its name, the method calls are synchronized.

My naming proposal is "Passthrough processor"

This resonates with me too -- the use of "concurrent" suggests some kind of attention to concurrency, a limit of some sort in addition to offering concurrency. Maybe the name could be AsynchronousProcessor -- starts the export and returns success immediately.

I admit this worries me a bit -- is the exporter expected to implement some kind of limit, to avoid running out of memory? Can this be composed with the BatchSpanProcessor? What will control whether the BatchSpanProcessor drops spans vs. this processor consuming unlimited amounts of data?

Here, we have an OTel Collector processor that offers unlimited concurrency, but subject to a limit on the total amount of pending data, I consider it a potential solution to the problems posed above: https://github.com/open-telemetry/otel-arrow/tree/main/collector/processor/concurrentbatchprocessor

Maybe the name could be AsynchronousProcessor -- starts the export and returns success immediately.

That is not the intend. The intended exporters here not just 'starts' the export. It does everything inline (i.e serialization, writing to destination.), and then return success/failure. (from what I know, such exporters are typically writing to ETW, user-events)

is the exporter expected to implement some kind of limit, to avoid running out of memory?

No. Such exporters (etw/user-events) does not buffer anything. The logRecord is serialized and handed over to destination inline. ETW/user-events system is backed by OperatingSystem kernel memory, and they have ample mechanisms to keep memory in control, but those are outside the scope of the exporter.

@cijothomas any thoughts about my naming proposal?

My other proposals are

"Direct processor" meaning that it directly calls the exporter.

"Adapter processor" meaning that it is only an adapter and has no logic/ synchronization

naming is hard 🤣 I am not sure if any of the alternate suggestions are significantly better. I'll keep thinking and hope we get more suggestions.

specification/logs/sdk.md

pellared

👍

specification/logs/sdk.md

pellared · 2024-07-26T07:00:21Z

specification/logs/sdk.md

-Depending on the implementation the result of the export may be returned to the
+`Export` MAY be called concurrently for the same exporter instance if paired
+with [Simple Concurrent Processor](#simple-concurrent-processor). Each exporter
+implementation MUST document whether it supports concurrent `Export` calls, and


We cannot force custom implementation on how they document their implementations

It is not only about exporting but also ForceFlush and Shutdown

Suggested change

implementation MUST document whether it supports concurrent `Export` calls, and

implementation provided by the SDK MUST document whether it is concurrent safe, and

We cannot enforce, but the spec wording is to make sure any person authoring own exporter should follow this spec. If they don't follow, then there could be undesired behavior.
Alternatively, we can word it in such a way that "It must be documented to the exporter authors that they should document the exporters' concurrency characteristics....

I believe this was part of why it was went with that the processor should called all exporters the same and not have to worry about concurrency.

@tsloughter, the users could still use processors that does not require the exporters to be concurrent safe. But we would give a more performant option for cases where exporter are concurrent safe and can be used synchronously.

@cijothomas, the alternative could be that such exporter packages (like ETW or user_events exporters) would provide their own processors which are "tailored" to their exporters. It can give more flexibility and not increase the SDK functionality surface.

the alternative could be that such exporter packages (like ETW or user_events exporters) would provide their own processors which are "tailored" to their exporters

You are right! That is exactly what we have been doing for years. But that does not mean spec shouldn't support them.
The increase in SDK functionality surface is necessary, to support such scenarios. The spec already has wording that state implementations must have simple and batch (meaning others are optional), so additional processors don't put extra burden on implementations, unless they chose to support.

What is more (I have forgotten tot call it out explictly) I think that e.g. for ETW we can simply provide a single EtwProcessor which does the exporting. There is no need for batching or synchronization so there is no need to provide an Exporter interface implementation.

I think that e.g. for ETW we can simply provide a single EtwProcessor which does the exporting

Yes that is also possible. (In certain ways, OTel Rust does that. Its etwexporter is not really an exporter, but a ~processor).
But I think it is best to model exporters (the thing which does serialization, export telemetry to outside the process) as exporters itself consistently.

But I think it is best to model exporters

I am not really sure as these exporters are emitting batches. And for use cases like ETW and user_events you probably prefer to operate on "single" log record.

They are exporters! (They do the job of serializing and transferring the telemetry to an external entity.) Whether it gets a batch of 1 item or multiple is purely based on choice of processor used. (when used with SimpleProcessor, even OTLPExporter gets a batch of single item only).

specification/logs/sdk.md

tsloughter · 2024-07-26T11:29:31Z

Deleted my last comment because I had misread what was and wasn't in the current spec for Logs and don't want to confuse anyone reading these later.

Exporting can be done concurrently, what is the reason for needing a "ConcurrentProcessor"?

cijothomas · 2024-07-26T14:58:44Z

@tsloughter

Exporting can be done concurrently, what is the reason for needing a "ConcurrentProcessor"?

I don't think it is possible to achieve what we want today. We need exporter.export() method called synchronously from the processor. More specifically, when logger.logsomething("hello", foo=bar); statement returns, the LogRecord for that is already sent to processor, and to exporter and serialized and exported! Put differently, if the program crashes the moment after logsomething() returns, the user can be assured that no telemetry is lost due to being stuck in in-memory buffers.

tsloughter · 2024-07-26T17:56:16Z

@cijothomas right, and it is called synchronously. It is the exporter that handles concurrency, or lack there of. By default SimpleProcessor and builtin exporter must not return from export for spans and logs in order to support cases like you describe and it is relied on by users of like aws lambda, but it remains the exporter where concurrency is handled, not the processor.

tsloughter · 2024-07-26T18:56:17Z

Talking on Slack I now see what SimpleConcurrentProcessor is meant to achieve and will give here my description of it in case anyone else messed up like me and jumped to the conclusion it was meant to achieve the already hashed out "concurrent export" functionality:

The issue arises when you have N processes/threads each calling logger.log(record) concurrently and not wanting to block on any others data being sent but does want to block on their record being sent.

To repeat here what I said on slack, I would not do this with a SimpleConcurrentProcessor. I want to look at the existing examples given of exporters before I describe what I'd do in Erlang since it may just be more confusing than useful, but it would keep the restriction on Export intact and not add a SimpleConcurrentProcessor -- as well as have the ability to build on top a way to alleviate lock contention between threads on use of the SimpleProcessor through a SimpleProcessorPool.

cijothomas · 2024-07-26T21:24:01Z

I want to look at the existing examples given of exporters before I describe what I'd do in Erlang

OTel .NET:
Processor: https://github.com/open-telemetry/opentelemetry-dotnet-contrib/blob/main/src/OpenTelemetry.Exporter.Geneva/Internal/ReentrantExportProcessor.cs#L31-L34
Exporter: https://github.com/open-telemetry/opentelemetry-dotnet-contrib/blob/main/src/OpenTelemetry.Exporter.Geneva/MsgPackExporter/MsgPackLogExporter.cs#L116

OTel Rust:
Processor: https://github.com/open-telemetry/opentelemetry-rust-contrib/blob/main/opentelemetry-etw-logs/src/logs/reentrant_logprocessor.rs#L35-L37
Exporter : https://github.com/open-telemetry/opentelemetry-rust-contrib/blob/main/opentelemetry-etw-logs/src/logs/exporter.rs#L356

cijothomas · 2024-07-26T21:25:26Z

alleviate lock contention between threads on use of the SimpleProcessor through a SimpleProcessorPool.

I am not sure what is SimpleProcessorPool.

jmacd · 2024-07-31T15:41:08Z

Relates to #3616

pellared · 2024-08-01T09:33:44Z

My own previous suggestion:

It is not only about exporting but also ForceFlush and Shutdown

I double-checked and this would be a a breaking change in the specification.

Right now, it is allowed to call Shutdown or ForceFlush in parallel with Export. However, calls to Export has to be synchronized.

I created #4173 based on this PR to separate clarification of the export concurrency from proposing a new exporting processor.

github-actions · 2024-08-13T03:17:28Z

This PR was marked stale due to lack of activity. It will be closed in 7 days.

github-actions · 2024-08-24T03:17:15Z

Closed as inactive. Feel free to reopen if this PR is still being worked on.

Add SimpleConcurrentProcessor, clarify existing export concurrency

f2b9c32

cijothomas requested review from a team July 25, 2024 15:27

github-actions bot assigned reyang Jul 25, 2024

cijothomas and others added 2 commits July 25, 2024 08:30

toc

8ecc57b

Merge branch 'main' into cijothomas/processor

71114f8

lalitb reviewed Jul 25, 2024

View reviewed changes

specification/logs/sdk.md Outdated Show resolved Hide resolved

pellared reviewed Jul 26, 2024

View reviewed changes

Update specification/logs/sdk.md

de530e8

pellared reviewed Jul 26, 2024

View reviewed changes

cijothomas added 2 commits July 26, 2024 09:28

Merge branch 'main' into cijothomas/processor

109f817

add review suggestions

8d70036

Merge branch 'main' into cijothomas/processor

e445921

cijothomas mentioned this pull request Jul 30, 2024

Clarification on SimpleProcessor concurrency #4134

Closed

arminru changed the title ~~Add SimpleConcurrentProcessor, clarify existing export concurrency~~ Add SimpleConcurrentProcessor for Logs, clarify existing export concurrency Jul 30, 2024

This was referenced Aug 1, 2024

Clarify logs export concurrency #4172

Closed

Clarify logs export concurrency #4173

Merged

lalitb mentioned this pull request Aug 5, 2024

Eliminate Dynamic Dispatching in Log Pipeline for Performance Optimization open-telemetry/opentelemetry-rust#1942

Open

Merge branch 'main' into cijothomas/processor

a40ab74

github-actions bot added the Stale label Aug 13, 2024

github-actions bot closed this Aug 24, 2024

cijothomas mentioned this pull request Sep 26, 2024

Allow Simple/Batch processor to concurrently invoke Exporter.Export() #4231

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add SimpleConcurrentProcessor for Logs, clarify existing export concurrency #4163

Add SimpleConcurrentProcessor for Logs, clarify existing export concurrency #4163

cijothomas commented Jul 25, 2024 •

edited

Loading

pellared Jul 26, 2024

jmacd Jul 26, 2024

cijothomas Jul 26, 2024

cijothomas Jul 26, 2024

pellared Jul 29, 2024 •

edited

Loading

cijothomas Jul 30, 2024

pellared left a comment

pellared Jul 26, 2024

cijothomas Jul 26, 2024

tsloughter Jul 26, 2024

pellared Jul 31, 2024

cijothomas Jul 31, 2024

pellared Aug 1, 2024

cijothomas Aug 1, 2024

pellared Aug 1, 2024 •

edited

Loading

cijothomas Aug 5, 2024

tsloughter commented Jul 26, 2024

cijothomas commented Jul 26, 2024

tsloughter commented Jul 26, 2024

tsloughter commented Jul 26, 2024

cijothomas commented Jul 26, 2024

cijothomas commented Jul 26, 2024

jmacd commented Jul 31, 2024

pellared commented Aug 1, 2024 •

edited

Loading

github-actions bot commented Aug 13, 2024

github-actions bot commented Aug 24, 2024

		@@ -427,6 +429,21 @@ the configured `processor`.

		* `processor` - processor to be isolated.

		#### Simple Concurrent processor

	implementation MUST document whether it supports concurrent `Export` calls, and
	implementation provided by the SDK MUST document whether it is concurrent safe, and

Add SimpleConcurrentProcessor for Logs, clarify existing export concurrency #4163

Add SimpleConcurrentProcessor for Logs, clarify existing export concurrency #4163

Conversation

cijothomas commented Jul 25, 2024 • edited Loading

Changes

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pellared Jul 29, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pellared left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pellared Aug 1, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tsloughter commented Jul 26, 2024

cijothomas commented Jul 26, 2024

tsloughter commented Jul 26, 2024

tsloughter commented Jul 26, 2024

cijothomas commented Jul 26, 2024

cijothomas commented Jul 26, 2024

jmacd commented Jul 31, 2024

pellared commented Aug 1, 2024 • edited Loading

github-actions bot commented Aug 13, 2024

github-actions bot commented Aug 24, 2024

cijothomas commented Jul 25, 2024 •

edited

Loading

pellared Jul 29, 2024 •

edited

Loading

pellared Aug 1, 2024 •

edited

Loading

pellared commented Aug 1, 2024 •

edited

Loading