docs: Add agentic RAG architecture documentation #701

manavgup · 2025-11-27T18:07:56Z

Summary

Add comprehensive architecture documentation for the Agentic RAG Platform. These documents
establish the design foundation for transforming RAG Modulo into a fully agentic system.

Documents Added (6 files, ~3,700 lines)

Document	Description
`agentic-ui-architecture.md`	React component hierarchy, state management, API integration
`backend-architecture-diagram.md`	Backend architecture with Mermaid diagrams
`mcp-integration-architecture.md`	MCP client/server strategy, PR comparison
`rag-modulo-mcp-server-architecture.md`	RAG as MCP server (tools, resources, auth)
`search-agent-hooks-architecture.md`	3-stage agent pipeline architecture
`system-architecture.md`	Complete system architecture overview

Architecture Highlights

3-Stage Agent Pipeline

User Query → Pre-Search Agents → RAG Search → Post-Search Agents → Generation → Response Agents

Pre-search: Query expansion, translation, intent classification
Post-search: Re-ranking, deduplication, enrichment
Response: Artifact generation (PowerPoint, PDF, charts) in parallel

MCP Integration

Client: Consume external tools via Context Forge
Server: Expose rag_search, rag_ingest, etc. to Claude Desktop

Implementation Roadmap

These documents guide:

PR feat: SPIFFE/SPIRE Integration Architecture for Agent Identity #695 (SPIFFE/SPIRE agent identity)
PR feat(mcp): Add MCP Gateway integration for tool invocation and enrichment #671 (MCP Gateway client)
Issue feat: Implement SearchService 3-stage agent execution hooks #697 (Agent execution hooks)
Issue feat: Expose RAG Modulo as MCP Server #698 (MCP Server)
Issue feat: Agentic UI components for agent configuration and artifacts #699 (Agentic UI)

Test Plan

All markdown files lint-clean
Only documentation files in this PR (no code changes)
Team review for architectural decisions

Closes #696

🤖 Generated with Claude Code

Add comprehensive architecture documentation for the Agentic RAG Platform: - agentic-ui-architecture.md: React component hierarchy, state management, and API integration for agent features - backend-architecture-diagram.md: Overall backend architecture with Mermaid diagrams showing service layers and data flow - mcp-integration-architecture.md: MCP client/server integration strategy, PR comparison (#671 vs #684), and Context Forge integration - rag-modulo-mcp-server-architecture.md: Exposing RAG capabilities as MCP server with tools (rag_search, rag_ingest, etc.) and resources - search-agent-hooks-architecture.md: 3-stage agent pipeline (pre-search, post-search, response) with database schema and execution flow - system-architecture.md: Complete system architecture overview with technology stack and data flows These documents guide implementation of: - PR #695 (SPIFFE/SPIRE agent identity) - PR #671 (MCP Gateway client) - Issue #697 (Agent execution hooks) - Issue #698 (MCP Server) - Issue #699 (Agentic UI) Closes #696 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

github-actions · 2025-11-27T18:08:10Z

🚀 Development Environment Options

This repository supports Dev Containers for a consistent development environment.

Option 1: GitHub Codespaces (Recommended)

Create a cloud-based development environment:

Click the green Code button above
Select the Codespaces tab
Click Create codespace on docs/agentic-architecture-696-clean
Wait 2-3 minutes for environment setup
Start coding with all tools pre-configured!

Option 2: VS Code Dev Containers (Local)

Use Dev Containers on your local machine:

Install Docker Desktop
Install VS Code
Install the Dev Containers extension
Clone this PR branch locally
Open in VS Code and click "Reopen in Container" when prompted

Option 3: Traditional Local Setup

Set up the development environment manually:

# Clone the repository
git clone https://github.com/manavgup/rag_modulo.git
cd rag_modulo
git checkout docs/agentic-architecture-696-clean

# Initialize development environment
make dev-init
make dev-build
make dev-up
make dev-validate

Available Commands

Once in your development environment:

make help           # Show all available commands
make dev-validate   # Validate environment setup
make test-atomic    # Run atomic tests
make test-unit      # Run unit tests
make lint          # Run linting

Services Available

When running make dev-up:

This automated message helps reviewers quickly set up the development environment.

github-actions · 2025-11-27T18:12:44Z

Code Review: Agentic RAG Architecture Documentation

I've completed a comprehensive review of PR #701, which adds ~3,700 lines of architecture documentation across 6 new files. This is a documentation-only PR with no code changes.

Summary

Comprehensive architecture documentation for transforming RAG Modulo into a fully agentic system.

Files Added:

agentic-ui-architecture.md (1,470 lines)
backend-architecture-diagram.md (517 lines)
mcp-integration-architecture.md (200 lines)
rag-modulo-mcp-server-architecture.md (689 lines)
search-agent-hooks-architecture.md (416 lines)
system-architecture.md (425 lines)

Strengths

Excellent Documentation Quality - Clear diagrams, comprehensive code examples, well-organized
Thorough 3-Stage Agent Pipeline - Pre-search, Post-search, Response agents with clear separation
Complete Component Specifications - TypeScript/Python interfaces, YAML specs, database schemas
Security & Identity Integration - SPIFFE/SPIRE, CBAC, authentication flows documented
Implementation Guidance - Links to related PRs, phased roadmap

Areas for Improvement

1. Frontend Component Complexity

Proposes 22 new React components - recommend MVP subset (5-8 components)
Risk: Frontend could balloon to 5,000+ lines
Suggest phased approach: Core display → Config UI → Advanced features

2. MCP Integration Complexity

Dual MCP roles (client + server) adds complexity
Recommend: Prioritize MCP server first (~40% scope reduction)
Server enables Claude Desktop integration immediately

3. Database Schema Evolution

New tables (agent_configs, collection_agents) lack migration strategy
Missing: Alembic migration script, rollback plan, backward compatibility

4. Error Handling

High-level strategy mentioned but not fully specified
Need: Retry policies, circuit breaker thresholds, user notification strategy

5. Performance Concerns

3-stage pipeline adds latency without quantification
Need: Latency budgets, SLOs, performance testing plan
Suggest: Pre-search <500ms, Post-search <1s, Response <5s

6. Missing Sections

Testing strategy (E2E, mocks, integration tests)
Observability details (span names, metrics, dashboards)
Agent marketplace (discovery, versioning, quality review)
Deployment strategy (rollout plan, feature flags)

Potential Issues

Agent Context Overhead - Passing full conversation history + documents could consume significant memory
Agent Chaining - Sequential execution means one failure blocks chain, compounds latency
Identity Complexity - Two identity systems (SPIFFE + Bearer tokens) complex to implement
Frontend State - Real-time updates, WebSocket management not specified

Code Quality Assessment

Category	Rating	Notes
Documentation	5/5	Extremely thorough
Architecture	4/5	Minor complexity concerns
Implementation	4/5	Missing testing/deployment
Security	4/5	SPIFFE solid, auth complexity
Performance	3/5	Needs SLOs
Error Handling	3/5	Lacks concrete policies

Overall: 4/5 - Excellent foundation, needs refinement

Recommendations

Priority 1: Simplify MVP scope (remove ~40% of components)
Priority 2: Add implementation checklists
Priority 3: Specify performance budgets and SLOs
Priority 4: Document rollout strategy

Final Verdict

APPROVE with recommendations for refinement before implementation.

Excellent architectural foundation with thorough documentation. Before starting implementation:

Simplify MVP scope
Add performance budgets and SLOs
Specify error handling policies
Include database migration strategy
Add testing and deployment checklists

Estimated effort: MVP 4-6 weeks, Full system 10-12 weeks

Related PRs: #695, #671, #697-699
Closes: #696

Great work!

manavgup added documentation Improvements or additions to documentation priority:high High priority - important for release architecture Architectural decisions and design choices labels Nov 27, 2025

manavgup merged commit 6701795 into main Nov 27, 2025
22 checks passed

manavgup deleted the docs/agentic-architecture-696-clean branch November 27, 2025 18:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs: Add agentic RAG architecture documentation #701

docs: Add agentic RAG architecture documentation #701

Uh oh!

manavgup commented Nov 27, 2025

Uh oh!

github-actions bot commented Nov 27, 2025

Uh oh!

Uh oh!

github-actions bot commented Nov 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

docs: Add agentic RAG architecture documentation #701

docs: Add agentic RAG architecture documentation #701

Uh oh!

Conversation

manavgup commented Nov 27, 2025

Summary

Documents Added (6 files, ~3,700 lines)

Architecture Highlights

3-Stage Agent Pipeline

MCP Integration

Implementation Roadmap

Test Plan

Uh oh!

github-actions bot commented Nov 27, 2025

🚀 Development Environment Options

Option 1: GitHub Codespaces (Recommended)

Option 2: VS Code Dev Containers (Local)

Option 3: Traditional Local Setup

Available Commands

Services Available

Uh oh!

Uh oh!

github-actions bot commented Nov 27, 2025

Code Review: Agentic RAG Architecture Documentation

Summary

Strengths

Areas for Improvement

1. Frontend Component Complexity

2. MCP Integration Complexity

3. Database Schema Evolution

4. Error Handling

5. Performance Concerns

6. Missing Sections

Potential Issues

Code Quality Assessment

Recommendations

Final Verdict

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants