[Feature] Implement Persistence and Tracing for CrewAI BYO Agents #951

supreme-gg-gg · 2025-09-25T15:50:13Z

This PR is a follow up for #920

Features

Creates go handlers for /crewai routes for storing and retrieving memory items
Updates the database handler to store and retrieve long term memory items from CrewAI Crew and state for CrewAI Flow
Adds custom memory store to agents before execution to allow for session-based persistence
Updates samples to show usage of memory by setting memory=True and persistence by @persist()
Support tracing with opentelemetry-instrumentation-crewai for crewai specific spans as suggested in code review

Tests

e2e test is added for CrewAI Poem flow sample agents using the mock LLM server. The test case will create the agent resource, create the mock LLM server using mock response in invoke_creawi_agent.json, test synchronous, streaming, and persistence for the agent. It requires the agent container to be build and pushed to the registry (by running make poem-flow-sample or the Dockerfile directly in samples/crewai/poem_flow).

The following changes are made to helper functions in e2e test:

runSyncTest accepts an optional contextID to be included in the mock message to test session persistence
runSyncTest accepts an optional useArtifacts argument to indicate if the expected output should be checked for in the history messages or the artifact returned by the A2A server, since the A2A protocol specifies that Artifacts are the standard way to convey final outputs of a task

EItanya

Mostly looking good, just nits so far

go/internal/database/models.go

python/packages/kagent-crewai/src/kagent/crewai/_memory.py

python/samples/crewai/poem_flow/src/poem_flow/main.py

krisztianfekete · 2025-10-02T09:12:17Z

python/samples/crewai/research-crew/agent.yaml

            secretKeyRef:
-              name: kagent-google
-              key: GOOGLE_API_KEY
+              name: kagent-openai


Have you tested with both LLM providers? Did you notice any differences for spans between the two?

For Gemini it seems that the instrumentation library is not instrumenting all the spans, but for openAI it works fine.

Signed-off-by: Jet Chiang <jetjiang.ez@gmail.com>

Copilot

Pull Request Overview

This PR implements persistence and tracing support for CrewAI BYO (Bring Your Own) agents in the KAgent platform. It adds session-based memory storage for CrewAI crews and flow state persistence for CrewAI flows, along with OpenTelemetry tracing integration.

Implements session-scoped memory storage for CrewAI crews using LongTermMemory interface
Adds flow state persistence for CrewAI flows with checkpointing capabilities
Integrates OpenTelemetry tracing with opentelemetry-instrumentation-crewai

Reviewed Changes

Copilot reviewed 28 out of 30 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
python/samples/crewai/research-crew/src/research_crew/crew.py	Adds commented memory configuration option
python/samples/crewai/research-crew/src/research_crew/config/agents.yaml	Removes hardcoded LLM configuration
python/samples/crewai/poem_flow/src/poem_flow/main.py	Implements persistence with @persist decorator and flow state continuation
python/packages/kagent-crewai/src/kagent/crewai/_state.py	New KagentFlowPersistence class for CrewAI flow state management
python/packages/kagent-crewai/src/kagent/crewai/_memory.py	New KagentMemoryStorage class for session-scoped memory
python/packages/kagent-crewai/src/kagent/crewai/_executor.py	Updates executor to support memory and flow persistence
python/packages/kagent-crewai/src/kagent/crewai/_a2a.py	Adds CrewAI OpenTelemetry instrumentation
go/internal/httpserver/handlers/crewai.go	New Go handler for CrewAI memory and flow state endpoints
go/internal/database/models.go	Adds database models for CrewAI memory and flow state
go/test/e2e/invoke_api_test.go	Adds comprehensive e2e test for CrewAI agent with persistence

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

python/samples/crewai/research-crew/src/research_crew/crew.py

python/samples/crewai/poem_flow/README.md

python/packages/kagent-crewai/src/kagent/crewai/_state.py

go/internal/database/fake/client.go

supreme-gg-gg · 2025-10-07T21:46:21Z

@EItanya for the existing E2E tests would it be a good idea to also use Artifacts instead of history when checking agent response since artifact is the usual way of returning task output? I tried it on the ADK inline / declarative agent and it passes

Signed-off-by: Jet Chiang <jetjiang.ez@gmail.com>

go/test/e2e/invoke_api_test.go

EItanya · 2025-10-08T14:24:56Z

python/packages/kagent-core/src/kagent/core/tracing/_utils.py

+        # Check if a TracerProvider already exists (e.g., set by CrewAI)
+        current_provider = trace.get_tracer_provider()
+        if isinstance(current_provider, TracerProvider):
+            # TracerProvider already exists, just add our processor to it


I think we also need to add our labels here, e.g. Resource({"service.name": "kagent"})

CrewAI already creates a tracing provider that uses crewAI-telemetry as their service name (https://github.com/crewAIInc/crewAI/blob/8d93361cb305e39638cb6b1c257c572e129ac9a1/src/crewai/telemetry/telemetry.py). I tried disabling CrewAI's built in tracing provider so we create our own Kagent one, but that resulted in no spans being sent.

A reliable workaround I found is to create a custom span processor wrapper that overwrites the resource service name before exporting:

class ResourceOverrideSpanProcessor(SpanProcessor): """A span processor that overrides the resource attributes before exporting.""" def __init__(self, exporter: SpanExporter, resource: Resource): self.batch_processor = BatchSpanProcessor(exporter) self.override_resource = resource def on_start(self, span: "Span", parent_context: Optional[Context] = None) -> None: self.batch_processor.on_start(span, parent_context) def on_end(self, span: ReadableSpan) -> None: # Override the resource before passing to the batch processor # This ensures the service.name is always the one in the resource (i.e. kagent) span._resource = self.override_resource self.batch_processor.on_end(span) def shutdown(self) -> None: self.batch_processor.shutdown() def force_flush(self, timeout_millis: int = 30000) -> bool: return self.batch_processor.force_flush(timeout_millis)

And we can add it like this:

exporter = OTLPSpanExporter(endpoint=trace_endpoint) processor = ResourceOverrideSpanProcessor(exporter, resource) existing_crewai_trace_provider.add_span_processor(processor)

This would work for any framework already creating a OTEL trace provider and all their spans will be renamed to kagent. How does this sound?

It's interesting because this should probably be configurable, I think let's merge this as is and we can potentially revisit.

python/packages/kagent-crewai/src/kagent/crewai/_memory.py

python/packages/kagent-crewai/src/kagent/crewai/_state.py

Signed-off-by: Jet Chiang <jetjiang.ez@gmail.com>

…gent-dev#951) This PR is a follow up for kagent-dev#920 - [x] Creates go handlers for `/crewai` routes for storing and retrieving memory items - [x] Updates the database handler to store and retrieve long term memory items from CrewAI Crew and state for CrewAI Flow - [x] Adds custom memory store to agents before execution to allow for session-based persistence - [x] Updates samples to show usage of memory by setting `memory=True` and persistence by `@persist()` - [x] Support tracing with `opentelemetry-instrumentation-crewai` for crewai specific spans as suggested in code review e2e test is added for CrewAI Poem flow sample agents using the mock LLM server. The test case will create the agent resource, create the mock LLM server using mock response in `invoke_creawi_agent.json`, test synchronous, streaming, and persistence for the agent. It requires the agent container to be build and pushed to the registry (by running `make poem-flow-sample` or the Dockerfile directly in `samples/crewai/poem_flow`). The following changes are made to helper functions in e2e test: 1. `runSyncTest` accepts an optional `contextID` to be included in the mock message to test session persistence 2. `runSyncTest` accepts an optional `useArtifacts` argument to indicate if the expected output should be checked for in the history messages or the artifact returned by the A2A server, since the A2A protocol specifies that `Artifacts` are the standard way to convey final outputs of a task --------- Signed-off-by: Jet Chiang <jetjiang.ez@gmail.com>

…gent-dev#951) This PR is a follow up for kagent-dev#920 ## Features - [x] Creates go handlers for `/crewai` routes for storing and retrieving memory items - [x] Updates the database handler to store and retrieve long term memory items from CrewAI Crew and state for CrewAI Flow - [x] Adds custom memory store to agents before execution to allow for session-based persistence - [x] Updates samples to show usage of memory by setting `memory=True` and persistence by `@persist()` - [x] Support tracing with `opentelemetry-instrumentation-crewai` for crewai specific spans as suggested in code review ## Tests e2e test is added for CrewAI Poem flow sample agents using the mock LLM server. The test case will create the agent resource, create the mock LLM server using mock response in `invoke_creawi_agent.json`, test synchronous, streaming, and persistence for the agent. It requires the agent container to be build and pushed to the registry (by running `make poem-flow-sample` or the Dockerfile directly in `samples/crewai/poem_flow`). The following changes are made to helper functions in e2e test: 1. `runSyncTest` accepts an optional `contextID` to be included in the mock message to test session persistence 2. `runSyncTest` accepts an optional `useArtifacts` argument to indicate if the expected output should be checked for in the history messages or the artifact returned by the A2A server, since the A2A protocol specifies that `Artifacts` are the standard way to convey final outputs of a task --------- Signed-off-by: Jet Chiang <jetjiang.ez@gmail.com>

supreme-gg-gg force-pushed the feat/crewai-memory branch from b378d5a to 2c52e01 Compare September 25, 2025 15:51

EItanya requested changes Sep 26, 2025

View reviewed changes

This was referenced Sep 30, 2025

Add CrewAI BYO Agent docs kagent-dev/website#238

Merged

[Feature] Support CrewAI for BYO agents #920

Merged

supreme-gg-gg force-pushed the feat/crewai-memory branch from 409c0b8 to 8bbad04 Compare October 1, 2025 21:40

krisztianfekete reviewed Oct 2, 2025

View reviewed changes

github-actions bot added the enhancement-proposal Indicates that this PR is for an enhancement proposal label Oct 7, 2025

supreme-gg-gg force-pushed the feat/crewai-memory branch from e6220bf to 0d70579 Compare October 7, 2025 21:39

crewai persistence, tracing, e2e tests

a843e4a

Signed-off-by: Jet Chiang <jetjiang.ez@gmail.com>

supreme-gg-gg force-pushed the feat/crewai-memory branch from 0d70579 to a843e4a Compare October 7, 2025 21:40

supreme-gg-gg marked this pull request as ready for review October 7, 2025 21:43

supreme-gg-gg requested review from ilackarms and yuval-k as code owners October 7, 2025 21:43

Copilot AI review requested due to automatic review settings October 7, 2025 21:43

supreme-gg-gg requested a review from peterj as a code owner October 7, 2025 21:43

supreme-gg-gg requested a review from EItanya October 7, 2025 21:43

Copilot AI reviewed Oct 7, 2025

View reviewed changes

build crewai e2e test agnet when installing kagent

12c8dc1

Signed-off-by: Jet Chiang <jetjiang.ez@gmail.com>