feat(telemetry): add OpenTelemetry instrumentation with Aspire Dashboard support #6629

Hona · 2026-01-02T07:09:07Z

Adds experimental OpenTelemetry support for debugging and observability.

What

Full OTEL instrumentation: all tools, MCP, sessions, LLM, LSP, plugins
Aspire Dashboard integration via bun run dev:otel
Service differentiation: opencode-cli vs opencode-server
Structured logs with full key=value context + exception stack traces
AI SDK telemetry with GenAI message content capture

Enabling OpenTelemetry

Add to your global config (~/.config/opencode/opencode.json:

{
  "experimental": {
    "openTelemetry": true
  }
}

Run with Aspire Dashboard:

cd packages/opencode
bun run dev:otel

Open dashboard at http://localhost:18888

The OTEL_EXPORTER_OTLP_ENDPOINT env var controls the endpoint (defaults to http://localhost:4317).

Images

…tegration

…ging

…ibutes

…andard attribute names

…r gRPC trace export

… gRPC log export

…lete

Change experimental.openTelemetry config from boolean to union type supporting both boolean and object with enabled/endpoint fields. This allows users to configure custom OTLP endpoints for Aspire Dashboard integration while maintaining backward compatibility with boolean config.

…tion Add telemetry module with: - Config interface and resolveConfig() for endpoint resolution - init() function with NodeSDK, LoggerProvider, trace/log exporters - shutdown() for graceful cleanup - withSpan() helper for span creation with error handling - isEnabled(), getTracer(), getLogger() utility functions - SeverityMap for log level mapping

Integrate OpenTelemetry log emission into the Log module. When telemetry is enabled, all log messages (debug/info/warn/error) are emitted to the OTLP endpoint alongside file-based logging. - Lazy-load telemetry module to avoid circular dependency - Guard against recursive calls during module initialization - Emit logs with proper severity levels using Telemetry.SeverityMap

- Initialize telemetry in yargs middleware after Log.init() - Check OTEL_EXPORTER_OTLP_ENDPOINT env var or config.experimental.openTelemetry - Register SIGTERM and SIGINT handlers for graceful shutdown - Call Telemetry.shutdown() in finally block before process.exit()

…ctions

…es top

…tion

# Conflicts: # bun.lock # packages/opencode/package.json

Add the standard OpenTelemetry endpoint environment variable to the Flag namespace for use in config loading to consolidate telemetry enablement checks.

…onfig load time

… var checks Since OTEL_EXPORTER_OTLP_ENDPOINT is now applied at config load time (Phase 2), the CLI entry points no longer need to check the env var directly. This removes the conditional that skipped config loading when the env var was set.

…isEnabled() - Replace inline env var and config check with Telemetry.isEnabled() helper - Remove unused Config import since telemetry config is now consolidated - This ensures consistent telemetry enablement logic via single source of truth

…y.isEnabled()

The OTEL_EXPORTER_OTLP_ENDPOINT env var is now applied to config at load time (in config/config.ts), so resolveConfig no longer needs to check it directly. This simplifies the function to only handle the config object.

Add unit tests for Telemetry.resolveConfig and config loading behavior: - Test resolveConfig handles boolean/object/undefined inputs correctly - Test config loading from file with boolean and object openTelemetry config - Test openTelemetry defaults to undefined when not configured - Test OTEL_EXPORTER_OTLP_ENDPOINT env var override behavior Update plan.md to mark testing task as completed.

strangemonad · 2026-01-03T19:14:21Z

packages/opencode/src/telemetry/index.ts

+    })
+    logs.setGlobalLoggerProvider(loggerProvider)
+
+    sdk = new NodeSDK({


My recommendation (and what's somewhat conventional) is the app implementation is responsible for setting up this exporter machinery and then if the app is using a library that has an existing otel instrumentation, you enable that. For example ai-sdk provides otel instrumentation. If you use the openai sdk or claude sdk directly, you'd leverage that instrumentation.

The main issue I could imaging (assuming you don't want to be churning on setting all the attributes to work well across vendors) is that the attribute naming for llm related spans is still a bit of a mess as everyone is trying to figure out how to consistently name all these attributes.

If you're just collecting traces for performance sake and don't care about llm/eval then all these traces will show up just fine with the span operation.names you've defined in any trace viewer. The use case I mostly care about is shipping the signals to a tool like langfuse. Those tools expect specific names to show things like sessionID, llm generation, tool call etc.

The different vendors a re working on making the core attributes more uniform but it's not there yet, so personally, I'd try and punt most of that churn on to something like the ai sdk.

Here's a quick snapshot of the landscape of attribute definitions

Openllmetry has a reasonable collection here https://github.com/traceloop/openllmetry/blob/main/packages/opentelemetry-semantic-conventions-ai/opentelemetry/semconv_ai/__init__.py

Here's what ai-sdk collects https://ai-sdk.dev/docs/ai-sdk-core/telemetry#collected-data

Langfuse span operation types https://langfuse.com/docs/observability/features/observation-types and trace attributes https://langfuse.com/integrations/native/opentelemetry#trace-level-attributes

The emerging OTEL semantic conventions (still very incomplete)

https://opentelemetry.io/docs/specs/semconv/gen-ai/

https://opentelemetry.io/docs/specs/semconv/gen-ai/gen-ai-agent-spans/ e.g. operation types still don't have a standard naming here yet.

agreed on the running application chooses the exporter.
ill check more but running opencode cli should configure the exporter.
also tbh this otel work is just to support development/profiling/debugging for now.

i'll check those emerging standards for attributes to see if I can consolidate.
for now any span/attribute is good and we can easily rename later.

I'll check your other comments later but I'm sure you saw I pushed a big refactor to clean up the implementation to be more like a decorator/using pattern to remove heaps of noise.

yep, saw the cleanup / refactor. This comment still applies and summarizes whatever still applies after your refactor.

agreed on the running application chooses the exporter.
ill check more but running opencode cli should configure the exporter.

Yep, I think we're saying the same thing here. the current experimental_telemetry just enables the ai sdk instrumentation but doesn't start an exporter. So mainly calling out that the biggest missing piece is something needs to start the exporter.

also tbh this otel work is just to support development/profiling/debugging for now.

That makes sense. Mostly calling out that if you retain the ai-sdk enabling, you don't need to re-instrument llm calls, tool calls etc since those are already done for you and will the most up to date evolving attributes so the same traces become useful for building agentic engineering evals or workflow review (the part I'm actually more excited about).

That obviously doesn't prevent wrapping those ai SDK calls in you own spans to get even finer-grained instrumentation.

That said, if you're mostly interested in instrumentation for performance profiling. I'd definitely consider setting up the node.js otel auto-instrumentation and metrics. Eg you'll probably find at least having metrics around GC stats like runs and pauses useful.

crude example:

const sdk = new NodeSDK({ traceExporter: new ConsoleSpanExporter(), metricReader: new PeriodicExportingMetricReader({ exporter: new ConsoleMetricExporter(), }), instrumentations: [ getNodeAutoInstrumentations() ], });

I've got some more work to make it perfect - but agree on your points.

i'll double check the ai sdk vs my custom spans.
the aspire dashboard looked pretty nice but i'll check if there's exact double ups

yup I'll add the typical instrumentation for node, even seeing if our underlying opentui/zig stuff can have instrumentation.

I'll see what is first cut/vs add to the PR. the team will check this out over the next few weeks

FWIW, if you wanted to spot check stuff against another otel collector and trace view, you can run a langfuse stack fully locally

git clone https://github.com/langfuse/langfuse.git cd langfuse docker compose up

everything (web app and otel collector endpoint is available on localhost:3000).

OpenTelemetry support for OpenCode is pending upstream approval. Link to tracking PR: anomalyco/opencode#6629 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Hona added 30 commits January 2, 2026 14:11

Create plan.md

0afb363

feat(otel): add @opentelemetry/api dependency for Aspire Dashboard in…

5fc301a

…tegration

feat(otel): add @opentelemetry/api-logs dependency for structured log…

466f629

…ging

docs: mark @opentelemetry/api-logs dependency as complete in plan.md

1530771

feat(otel): add @opentelemetry/sdk-node dependency for telemetry SDK

d18935b

feat(otel): add @opentelemetry/sdk-logs dependency for logging SDK

44b6734

feat(otel): add @opentelemetry/resources dependency for resource attr…

67b08e0

…ibutes

docs: mark @opentelemetry/resources dependency as complete in plan.md

8b9fbb6

feat(otel): add @opentelemetry/semantic-conventions dependency for st…

ea9c6e3

…andard attribute names

feat(otel): add npm scripts for Aspire Dashboard integration

ad92664

feat(otel): add @opentelemetry/exporter-trace-otlp-grpc dependency fo…

210f7b1

…r gRPC trace export

feat(otel): add @opentelemetry/exporter-logs-otlp-grpc dependency for…

192f806

… gRPC log export

docs: mark exporter-logs-otlp-grpc dependency and bun install as comp…

65812b7

…lete

feat(otel): instrument Bash tool with OpenTelemetry spans

c591620

feat(otel): instrument Read tool with OpenTelemetry spans

a7ce76f

feat(otel): instrument Edit tool with OpenTelemetry spans

645ed5d

feat(otel): instrument Write tool with OpenTelemetry spans

83e5ebe

feat(otel): instrument Glob tool with OpenTelemetry spans

d4d2cee

docs: mark Glob tool instrumentation complete in plan.md

875ac44

feat(otel): instrument Grep tool with OpenTelemetry spans

5c1f2d4

feat(otel): instrument WebFetch tool with OpenTelemetry spans

8de56f7

feat(otel): instrument WebSearch tool with OpenTelemetry spans

a4dcae2

docs: mark WebSearch tool instrumentation complete in plan.md

d1bc5f0

feat(otel): instrument CodeSearch tool with OpenTelemetry spans

b806142

feat(otel): instrument Task tool with OpenTelemetry spans

4139766

docs: mark Task tool instrumentation complete in plan.md

6673d29

Hona added 26 commits January 3, 2026 17:44

docs: mark 6.5.3 complete - verified no telemetry in tool execute fun…

6cbe2eb

…ctions

docs: mark 6.5.4 complete - per-file diff summary shows framework fil…

ee85577

…es top

docs: mark 6.5.5 complete - verified clean diffs with no mass indenta…

8302f55

…tion

docs: mark 6.3.2 complete - bun dev manual test passes

4fa84ef

docs: mark 6.5.6 complete - final SLOC count comparison documented

c53a52a

docs: mark OTel verification tasks complete after manual Aspire testing

a45eabf

Merge remote-tracking branch 'upstream/dev' into feat/aspire-otel

1d693f7

# Conflicts: # bun.lock # packages/opencode/package.json

refacotr/clean done

dedd716

feat(telemetry): add OTEL_EXPORTER_OTLP_ENDPOINT flag definition

8a519ff

Add the standard OpenTelemetry endpoint environment variable to the Flag namespace for use in config loading to consolidate telemetry enablement checks.

docs: mark Phase 1 flag definition as completed

b7d08cb

feat(config): apply OTEL_EXPORTER_OTLP_ENDPOINT env var override at c…

f777fa8

…onfig load time

docs: mark Phase 2 config loading tasks as completed

2d66a6e

refactor(telemetry): simplify agent.ts telemetry check using Telemetr…

dbc67df

…y.isEnabled()

docs: mark agent.ts telemetry simplification as completed

57dea6d

docs(telemetry): add inline comments explaining config precedence

ff4d4c2

docs(config): add OpenTelemetry experimental option documentation

9646c4f

docs(plan): mark OpenTelemetry refactor as complete

91fddf2

Delete plan.md

3b1922f

simplify config lol

27349e3

kill dead tests

16f9bb9

clean clean clean

a842b67

generate

eb76917

strangemonad reviewed Jan 3, 2026

View reviewed changes

github-actions bot mentioned this pull request Jan 5, 2026

fix: enable MCP server after OAuth completion #6960

Open

Merge remote-tracking branch 'upstream/dev' into feat/aspire-otel

e06f3ab

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(telemetry): add OpenTelemetry instrumentation with Aspire Dashboard support #6629

feat(telemetry): add OpenTelemetry instrumentation with Aspire Dashboard support #6629

Hona commented Jan 2, 2026 •

edited

Loading

Uh oh!

strangemonad Jan 3, 2026

Uh oh!

Hona Jan 4, 2026

Uh oh!

strangemonad Jan 5, 2026

Uh oh!

Hona Jan 6, 2026

Uh oh!

strangemonad Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat(telemetry): add OpenTelemetry instrumentation with Aspire Dashboard support #6629

Are you sure you want to change the base?

feat(telemetry): add OpenTelemetry instrumentation with Aspire Dashboard support #6629

Conversation

Hona commented Jan 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Enabling OpenTelemetry

Images

Uh oh!

strangemonad Jan 3, 2026

Choose a reason for hiding this comment

Uh oh!

Hona Jan 4, 2026

Choose a reason for hiding this comment

Uh oh!

strangemonad Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

Hona Jan 6, 2026

Choose a reason for hiding this comment

Uh oh!

strangemonad Jan 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Hona commented Jan 2, 2026 •

edited

Loading