RAG Modulo

A production-ready, modular Retrieval-Augmented Generation (RAG) platform with Chain of Thought reasoning, multi-LLM support, and enterprise-grade features

🚀 Quick Start • 📚 Documentation • 🛠️ Development • ✨ Features • 🤝 Contributing

🎯 What is RAG Modulo?

RAG Modulo is a production-ready Retrieval-Augmented Generation platform that provides enterprise-grade document processing, intelligent search, and AI-powered question answering with advanced Chain of Thought (CoT) reasoning. Built with modern technologies and designed for scalability, it supports multiple vector databases (Milvus, Elasticsearch, Pinecone, Weaviate, ChromaDB), LLM providers (WatsonX, OpenAI, Anthropic), and document formats including enhanced support via IBM Docling integration.

✨ Key Features

🧠 AI-Powered	🔍 Advanced Search	💬 Interactive UI	🚀 Production Ready
Chain of Thought reasoning Automatic pipeline resolution Multi-LLM provider support Token tracking & monitoring	Vector similarity search Hybrid search strategies Intelligent source attribution Auto-generated suggestions	Modern React interface Real-time document upload Podcast generation Voice preview features	Docker + GHCR images Multi-stage CI/CD Security scanning 947 automated tests

🎨 Frontend Features

Modern UI: React 18 with Tailwind CSS and Carbon Design System principles
Reusable Component Library: 8 accessible, type-safe components with consistent design patterns
Enhanced Search: Interactive chat interface with Chain of Thought reasoning visualization
Document Management: Real-time file upload with drag-and-drop support and validation
Smart Display: Document source attribution with chunk-level page references
Podcast Generation: AI-powered podcast creation with voice preview
Question Suggestions: Intelligent query recommendations based on collection content

🎉 Current Status: Production Ready

Component	Status	Progress
🏗️ Infrastructure	✅ Production Ready	Docker + GHCR + Cloud Deployment
🧪 Testing	✅ Comprehensive	947 tests (atomic, unit, integration, API)
🚀 Core Services	✅ Fully Operational	26+ services with DI pattern
📚 Documentation	✅ Extensive	API docs, guides, deployment
🔧 Development	✅ Optimized	Containerless local dev workflow
🔒 Security	✅ Hardened	Multi-layer scanning (Trivy, Bandit, Gitleaks)

🎉 Recent Major Improvements

Feature	Description	Benefit
🧠 Chain of Thought	Automatic question decomposition with step-by-step reasoning	40%+ better answer quality on complex queries
⚡ Auto Pipeline Resolution	Zero-config search - backend handles pipeline selection	Simplified API, reduced client complexity
🎨 Reusable UI Components	8 accessible, type-safe components (Button, Input, Modal, etc.)	44% code reduction, consistent design system
🔒 Security Hardening	Multi-layer scanning (Trivy, Bandit, Gitleaks, Semgrep)	Production-grade security posture
🚀 Containerless Dev	Local development without containers	10x faster iteration, instant hot-reload
📄 IBM Docling	Enhanced document processing for complex formats	Better PDF/DOCX/XLSX handling
🎙️ Podcast Generation	AI-powered podcast creation with voice preview	Interactive content from documents
💡 Smart Suggestions	Auto-generated relevant questions	Improved user experience and discovery
📦 GHCR Images	Pre-built production images	Faster deployments, consistent environments

🚀 Quick Start

Prerequisites

Tool	Version	Installation	Purpose
Python	3.12+	`brew install python@3.12` (macOS) `sudo add-apt-repository ppa:deadsnakes/ppa && sudo apt update && sudo apt install python3.12 python3.12-venv` (Ubuntu 22.04) `apt install python3.12` (Ubuntu 24.04+)	Backend runtime
Poetry	Latest	`make venv` (auto-installs) or `curl -sSL https://install.python-poetry.org \| python3 -`	Python dependency management
Node.js	18+	`brew install node` (macOS) `nvm install 18` (Linux/macOS)	Frontend development
Docker	Latest	Docker Desktop	Infrastructure containers
Docker Compose	V2	Included with Docker Desktop `apt install docker-compose-plugin` (Linux)	Service orchestration

📋 Verify Installation: Run make check-docker to verify Docker requirements

Option 1: Local Development (⚡ Fastest - Recommended)

Best for: Daily development, feature work, rapid iteration

🎯 Step-by-Step Setup

# 1. Clone repository
git clone https://github.com/manavgup/rag-modulo.git
cd rag-modulo

# 2. Set up environment variables
cp env.example .env
# Edit .env with your API keys (WatsonX, OpenAI, Anthropic)

# 3. Install all dependencies (one command!)
make local-dev-setup
# This creates Python venv + installs backend (Poetry) + frontend (npm)

# 4. Start infrastructure only (Postgres, Milvus, MinIO, MLFlow)
make local-dev-infra

# 5a. Option A: Start backend and frontend in separate terminals
# Terminal 1:
make local-dev-backend

# Terminal 2:
make local-dev-frontend

# 5b. Option B: Start everything in background (easier!)
make local-dev-all

🔍 What Gets Installed

Backend (Python/Poetry):

FastAPI and dependencies
LLM providers (WatsonX, OpenAI, Anthropic)
Vector DB clients (Milvus, Elasticsearch, etc.)
Testing frameworks (pytest, coverage)
Code quality tools (ruff, mypy, bandit)

Frontend (npm):

React 18 + Vite
Tailwind CSS + Carbon Design
TypeScript dependencies
Testing libraries

Infrastructure (Docker):

PostgreSQL (metadata)
Milvus (vector storage)
MinIO (object storage)
MLFlow (model tracking)

Access Points:

🌐 Frontend: http://localhost:3000
🔧 Backend API: http://localhost:8000/docs (Swagger UI)
📊 MLFlow: http://localhost:5001
💾 MinIO Console: http://localhost:9001

Benefits:

⚡ Instant reload - Python/React changes reflected immediately (no container rebuilds)
🐛 Native debugging - Use PyCharm, VS Code debugger with breakpoints
📦 Local caching - Poetry/npm caches work natively for faster dependency installs
🔥 Fastest iteration - Pre-commit hooks optimized (fast on commit, comprehensive on push)

When to use:

✅ Daily development work
✅ Feature development and bug fixes
✅ Rapid iteration and testing
✅ Debugging with breakpoints

Option 2: Production Mode (🐳 Docker)

Best for: Production-like testing, deployment validation

# Clone repository
git clone https://github.com/manavgup/rag-modulo.git
cd rag-modulo

# Set up environment
cp env.example .env
# Edit .env with your API keys

# Start with pre-built images from GHCR
make run-ghcr

# OR build and run locally
make build-all-local
docker compose up -d

When to use:

✅ Testing production configurations
✅ Validating Docker builds
✅ Deployment rehearsal
✅ Performance benchmarking

Option 3: GitHub Codespaces (☁️ Cloud)

Best for: Quick experimentation, onboarding, cloud development

Go to repository → "Code" → "Codespaces"
Click "Create codespace" on your branch
Start coding in browser-based VS Code
Run: make venv && make run-infra

When to use:

✅ No local setup required
✅ Consistent development environment
✅ Work from any device
✅ Team onboarding

🏗️ Architecture Overview

RAG Modulo follows a modern, service-based architecture with clear separation of concerns:

graph TB
    subgraph "Frontend Layer"
        UI[React Web UI]
        CLI[Command Line Interface]
    end

    subgraph "API Layer"
        API[FastAPI Backend]
        AUTH[OIDC Authentication]
    end

    subgraph "Service Layer"
        SEARCH[Search Service]
        CONV[Conversation Service]
        TOKEN[Token Tracking]
        COT[Chain of Thought]
    end

    subgraph "Data Layer"
        VDB[(Vector Database)]
        PG[(PostgreSQL)]
        MINIO[(MinIO Storage)]
    end

    subgraph "External Services"
        LLM[LLM Providers]
        EMB[Embedding Models]
    end

    UI --> API
    CLI --> API
    API --> SEARCH
    API --> CONV
    API --> TOKEN
    API --> COT
    SEARCH --> VDB
    SEARCH --> PG
    CONV --> LLM
    TOKEN --> PG
    COT --> LLM
    API --> MINIO

🛠️ Development Workflow

🎯 Recommended Daily Workflow

Philosophy: Develop locally without containers for maximum speed, use containers only for infrastructure and production.

💻 Daily Development Routine

# ============================================================
# ONE-TIME SETUP (First Day Only)
# ============================================================
make local-dev-setup    # Creates venv + installs all dependencies

# ============================================================
# DAILY ROUTINE (Every Day)
# ============================================================

# Option A: Manual Start (Recommended for debugging)
# ---------------------------------------------------
make local-dev-infra              # Start infrastructure

# Terminal 1: Backend
make local-dev-backend            # Auto-reload enabled

# Terminal 2: Frontend
make local-dev-frontend           # Hot Module Replacement (HMR)

# Option B: Background Start (Faster)
# ---------------------------------------------------
make local-dev-all                # Everything runs in background
make local-dev-status             # Check what's running

# ============================================================
# DEVELOPMENT CYCLE
# ============================================================
# 1. Make code changes in your editor
# 2. See changes instantly (auto-reload for both backend/frontend)
# 3. Test via http://localhost:3000
# 4. Run quality checks before commit
make quick-check

# ============================================================
# END OF DAY
# ============================================================
make local-dev-stop               # Stop all services

⚡ One-Command Quick Start

For the impatient developer:

# First time ever
make local-dev-setup && make local-dev-all

# Every day after
make local-dev-all

Done! Services running at:

🔧 Essential Commands Reference

Command	What It Does	When to Use
`make venv`	Creates Python virtual environment (auto-installs Poetry if missing)	First time setup or after clean
`make local-dev-setup`	Installs ALL dependencies (backend + frontend)	First time setup or dependency updates
`make local-dev-infra`	Starts infrastructure only (Postgres, Milvus, MinIO, MLFlow)	Every day before development
`make local-dev-all`	Starts full stack in background	Quick startup, background development
`make local-dev-status`	Shows what's running (infrastructure + backend + frontend)	Verify services are up
`make local-dev-stop`	Stops ALL services (infra + backend + frontend)	End of day cleanup
`make quick-check`	Fast lint + format validation	Before every commit

🧪 Testing & Quality

🏃 Fast Testing (No Containers)

# Atomic tests (fastest - no DB, no coverage)
make test-atomic

# Unit tests (mocked dependencies)
make test-unit-fast

# All local tests
make test-atomic && make test-unit-fast

🔗 Integration Testing (Requires Infrastructure)

# Ensure infrastructure is running
make local-dev-infra

# Run integration tests
make test-integration

# Full test suite
make test-all    # atomic + unit + integration

🎨 Code Quality Checks

make quick-check       # Fast: format + lint (use before commit)
make lint              # Full linting (ruff + mypy)
make format            # Auto-fix formatting issues
make security-check    # Security scans (bandit + safety)
make coverage          # Generate coverage report (60% threshold)

🐳 Production & Container Workflows

🏭 Production Deployment

Only for production-like testing or deployment validation:

# Build production images locally
make build-all

# Start production environment
make prod-start

# Check status
make prod-status

# View logs
make prod-logs

# Stop production
make prod-stop

📦 Pre-built Images (Fastest)

Use pre-built images from GitHub Container Registry:

# Pull latest images and start
docker compose -f docker-compose.production.yml pull
make prod-start

Available images:

ghcr.io/manavgup/rag_modulo/backend:latest
ghcr.io/manavgup/rag_modulo/frontend:latest

📊 Features & Capabilities

🧠 Advanced AI Features

Chain of Thought Reasoning: Automatic question decomposition with step-by-step reasoning, iterative context building, and transparent reasoning visualization
Automatic Pipeline Resolution: Zero-config search experience - backend automatically selects and creates pipelines based on user context
Token Tracking & Monitoring: Real-time usage tracking across all LLM interactions with detailed breakdowns
Multi-LLM Support: Seamless switching between WatsonX, OpenAI, and Anthropic with provider-specific optimizations
IBM Docling Integration: Enhanced document processing for complex formats (PDF, DOCX, XLSX)
Question Suggestions: AI-generated relevant questions based on document collection content

🔍 Search & Retrieval

Vector Databases: Pluggable support for Milvus (default), Elasticsearch, Pinecone, Weaviate, ChromaDB via common interface
Hybrid Search: Combines semantic vector similarity with keyword search strategies
Source Attribution: Granular document source tracking with chunk-level page references across reasoning steps
Advanced Chunking: Hierarchical chunking strategies with configurable size and overlap
Conversation History: Context-aware search with conversation memory for multi-turn interactions

🏗️ Architecture & Scalability

Service-Based Design: 26+ services with clean separation of concerns and dependency injection pattern
Repository Pattern: Data access abstraction layer for improved testability and maintainability
Asynchronous Operations: Async/await throughout for efficient concurrent request handling
Production Deployment: Docker + GHCR images, multi-stage builds, cloud-ready (AWS, Azure, GCP, IBM Cloud)
Modular Design: Pluggable components for vector DBs, LLM providers, embedding models

🧪 Testing & Quality Assurance

Comprehensive Test Suite: 947 automated tests across all layers (atomic, unit, integration, API, E2E)
Multi-Layer Testing Strategy:
- Atomic tests for schemas and data structures
- Unit tests for business logic and services
- Integration tests for service interactions
- API tests for endpoint validation
Security Scanning: Multi-layer security with Trivy (containers), Bandit (Python), Gitleaks (secrets), Semgrep (SAST)
Code Quality: Ruff linting, MyPy type checking, Pylint analysis, pre-commit hooks
CI/CD Pipeline: Multi-stage GitHub Actions with test isolation, builds, and comprehensive integration testing

📚 Documentation

📖 Complete Documentation

📚 Full Documentation - Comprehensive guides and API reference
🚀 Getting Started - Quick start guide
🛠️ Development Guide - Development workflow and best practices
🧪 Testing Guide - Testing strategies and execution
🚀 Deployment Guide - Production deployment instructions

🔧 Configuration & Tools

⚙️ Configuration Guide - Environment setup and configuration
🔌 API Reference - Complete API documentation
🖥️ CLI Documentation - Command-line interface guide
🔐 Secret Management - Comprehensive guide for safe secret handling

🛠️ Command-Line Interface (CLI)

RAG Modulo includes a powerful CLI for interacting with the system:

# After installation, use the CLI commands:
rag-cli --help          # Main CLI help
rag-search              # Search operations
rag-admin               # Administrative tasks

# Example: Search a collection
rag-cli search query <collection-id> "your question here"

# Create a collection
rag-cli collection create --name "My Docs"

# Upload documents
rag-cli collection upload <collection-id> path/to/documents/

🚀 Deployment & Packaging

Production Deployment

RAG Modulo supports multiple deployment strategies:

1. Docker Compose (Recommended)

# Start production environment (all containers)
make prod-start

# Check status
make prod-status

# View logs
make prod-logs

# Stop production environment
make prod-stop

2. Pre-built Images from GHCR

# Pull and run latest images from GitHub Container Registry
make run-ghcr

Available Images:

ghcr.io/manavgup/rag_modulo/backend:latest
ghcr.io/manavgup/rag_modulo/frontend:latest

3. Custom Docker Deployment

# Build local images
make build-all

# Start services
make run-app

Cloud Deployment Options

AWS Deployment

ECS (Elastic Container Service): Use docker-compose.production.yml
EKS (Kubernetes): Deploy with Kubernetes manifests
EC2: Docker Compose or standalone containers
Lambda: Serverless functions for specific services

Azure Deployment

Azure Container Instances: Quick container deployment
AKS (Azure Kubernetes Service): Production-grade orchestration
Azure Container Apps: Serverless container hosting

Google Cloud Deployment

Cloud Run: Fully managed serverless platform
GKE (Google Kubernetes Engine): Kubernetes orchestration
Compute Engine: VM-based deployment with Docker

IBM Cloud Deployment

Code Engine: Serverless container platform
IKS (IBM Kubernetes Service): Enterprise Kubernetes
Red Hat OpenShift: Advanced container platform

Kubernetes Deployment

# Apply Kubernetes manifests
kubectl apply -f deployment/k8s/

# Or deploy with Helm (if charts exist)
helm install rag-modulo ./charts/rag-modulo

🔄 CI/CD Pipeline

GitHub Actions Workflows

RAG Modulo uses a comprehensive CI/CD pipeline with multiple stages:

1. Code Quality & Testing (`.github/workflows/ci.yml`)

Triggers: Push to main, Pull Requests

Stages:

Lint and Unit Tests (No infrastructure)
- Ruff linting (120 char line length)
- MyPy type checking
- Unit tests with pytest
- Fast feedback (~5-10 minutes)
Build Docker Images
- Backend image build
- Frontend image build
- Push to GitHub Container Registry (GHCR)
- Tagged with: latest, sha-<commit>, branch name
Integration Tests
- Full stack deployment
- PostgreSQL, Milvus, MLFlow, MinIO
- API tests, integration tests
- End-to-end validation

Status Badges:

[![CI Pipeline](https://github.com/manavgup/rag_modulo/workflows/CI/badge.svg)](https://github.com/manavgup/rag_modulo/actions)

2. Security Scanning (`.github/workflows/02-security.yml`)

Triggers: Push to main, Pull Requests

Secret Detection (3-Layer Defense):

Pre-commit hooks: detect-secrets with baseline (< 1 sec)
Local testing: Gitleaks via make pre-commit-run (~1-2 sec)
CI/CD: Gitleaks + TruffleHog (~45 sec)

Scans:

Gitleaks: Pattern-based secret scanning with custom rules (.gitleaks.toml)
TruffleHog: Entropy-based + verified secret detection
Trivy: Container vulnerability scanning
Bandit: Python security linting
Safety: Python dependency vulnerabilities
Semgrep: SAST code analysis

⚠️ IMPORTANT: CI now fails on ANY secret detection (no continue-on-error). This ensures no secrets make their way to the repository.

Supported Secret Types:

Cloud: AWS, Azure, GCP keys
LLM: OpenAI, Anthropic, WatsonX, Gemini API keys
Infrastructure: PostgreSQL, MinIO, MLFlow, JWT secrets
Version Control: GitHub/GitLab tokens
Generic: High-entropy strings, private keys

See: docs/development/secret-management.md for comprehensive guide

3. Documentation (`.github/workflows/docs.yml`)

Triggers: Push to main, Pull Requests to docs/

Actions:

Build MkDocs site
Deploy to GitHub Pages
API documentation generation

Local CI Validation

Test CI pipeline locally before pushing:

# Run same checks as CI
make ci-local

# Validate CI workflows
make validate-ci

# Security checks
make security-check
make scan-secrets

Pre-commit Hooks

Optimized for developer velocity:

On Commit (fast, 5-10 sec):

Ruff formatting
Trailing whitespace
YAML syntax
File size limits

On Push (slow, 30-60 sec):

MyPy type checking
Pylint analysis
Security scans
Strangler pattern checks

In CI (comprehensive):

All checks run regardless
Ensures quality gates

Container Registry

GitHub Container Registry (GHCR):

Automatic image builds on push
Multi-architecture support (amd64, arm64)
Image signing and verification
Retention policies

Image Tags:

latest: Latest main branch build
sha-<commit>: Specific commit
<branch>: Branch-specific builds
v<version>: Release tags

🧪 Testing

Test Categories

Category	Count	Description	Command
⚡ Atomic Tests	100+	Schema validation, data structures	`pytest -m atomic`
🏃 Unit Tests	83+	Service logic, business rules	`make test-unit-fast`
🔗 Integration Tests	43+	Service interactions, DB integration	`make test-integration`
🔌 API Tests	21+	Endpoint validation, request/response	`pytest -m api`
🌐 E2E Tests	22+	Full workflow scenarios	`pytest -m e2e`
📊 Total	947	Complete test coverage	`make test-all`

Running Tests

# Fast local testing (no containers, recommended for development)
make test-unit-fast

# Specific test categories
make test-atomic       # Schema and data structure tests
make test-integration  # Service integration tests (requires infrastructure)
make test-api          # API endpoint tests

# Full test suite with coverage
make coverage

# Run specific test file
make test testfile=tests/unit/test_search_service.py

🤝 Contributing

We welcome contributions! Please see our Contributing Guide for details.

Development Guidelines

Service Layer Architecture - Follow service-based patterns
Code Quality - Use type hints, comprehensive docstrings, PEP 8
Testing - Write tests for all new features
Documentation - Update docs for any changes

Contribution Process

Fork and Clone the repository
Create Feature Branch from main
Make Changes following our guidelines
Run Tests and ensure they pass
Submit Pull Request with clear description

📈 Roadmap

✅ Phase 1: Foundation (Completed)

Service-based architecture with 26+ services
Comprehensive test infrastructure (947 tests)
Multi-LLM provider support (WatsonX, OpenAI, Anthropic)
Vector database abstraction layer
CI/CD pipeline with security scanning

✅ Phase 2: Advanced Features (Completed)

Chain of Thought (CoT) reasoning system
Automatic pipeline resolution
Token tracking and monitoring
IBM Docling integration
Podcast generation with voice preview
Question suggestion system
Containerless local development workflow

🔄 Phase 3: Production Enhancement (Current)

Production deployment with GHCR images
Multi-stage Docker builds
Security hardening (Trivy, Bandit, Gitleaks, Semgrep)
Enhanced monitoring and observability
Performance optimization and caching
Authentication system improvements (OIDC)

🚀 Phase 4: Enterprise Features (Next)

🔮 Phase 5: Innovation (Future)

Multi-modal support (image, audio)
Agentic AI workflows
Real-time collaborative features
Advanced reasoning strategies
Federated learning support

🆘 Troubleshooting

Quick Diagnostic Commands

Run these first to identify issues:

make check-docker          # Verify Docker installation
make local-dev-status      # Check what's running
make venv                  # Ensure Python environment is set up
docker compose -f docker-compose-infra.yml ps  # Check infrastructure

Common Issues & Solutions

🚨 Quick Fixes (Try These First)

Most Common Problems:

# Problem: "Command not found" errors
make venv                    # Creates Python venv (auto-installs Poetry)
source backend/.venv/bin/activate

# Problem: Services won't start
make local-dev-stop          # Stop everything cleanly
make check-docker            # Verify Docker requirements
make local-dev-infra         # Restart infrastructure only

# Problem: Port conflicts
make local-dev-stop          # Stops all services and containers
lsof -i :8000 && kill $(lsof -t -i:8000)  # Kill backend on port 8000
lsof -i :3000 && kill $(lsof -t -i:3000)  # Kill frontend on port 3000

# Problem: Weird behavior after updates
make clean-venv              # Remove Python venv
make local-dev-setup         # Fresh install
make clean-all               # Nuclear option: clean everything

🐍 Python & Virtual Environment Issues

Poetry Not Installed

# Makefile auto-installs Poetry (recommended)
make venv

# OR install manually
curl -sSL https://install.python-poetry.org | python3 -

Wrong Python Version

# Check Python version
python --version    # Should be 3.12+

# macOS: Install Python 3.12
brew install python@3.12

# Ubuntu/Debian: Install Python 3.12
sudo apt update
sudo apt install python3.12 python3.12-venv

Dependencies Not Installing

# Method 1: Use Makefile (recommended)
make clean-venv
make local-dev-setup

# Method 2: Manual Poetry setup
cd backend
poetry config virtualenvs.in-project true
poetry cache clear . --all
poetry install --with dev,test --sync
source .venv/bin/activate

Wrong Tool Versions (e.g., Ruff 0.5.7 instead of 0.14.0)

# Ensure you're in Poetry venv
cd backend
source .venv/bin/activate
which python    # Should show: backend/.venv/bin/python
which ruff      # Should show: backend/.venv/bin/ruff
ruff --version  # Should be 0.14.0+

# If wrong version, reinstall
poetry install --with dev,test --sync

Import Errors

# Verify PYTHONPATH and reinstall
cd backend
source .venv/bin/activate
poetry install --with dev,test --sync
python -c "import sys; print('\n'.join(sys.path))"

🐳 Docker & Container Issues

Docker Not Installed

# Check if Docker is installed
docker --version
docker compose version

# If missing, install:
# macOS: brew install --cask docker
# Or download Docker Desktop: https://www.docker.com/products/docker-desktop

Docker Compose V2 Missing

# Ubuntu/Debian
sudo apt update
sudo apt install docker-compose-plugin

# Verify
docker compose version

Infrastructure Services Fail to Start

# Clean restart
make local-dev-stop
docker system prune -f
make local-dev-infra

# Check what failed
docker compose -f docker-compose-infra.yml ps
docker compose -f docker-compose-infra.yml logs

# Common fixes
docker volume prune -f        # Clear old volumes
make clean-all                # Nuclear option

Port Already in Use

# Find what's using ports
lsof -i :8000   # Backend
lsof -i :3000   # Frontend
lsof -i :5432   # PostgreSQL
lsof -i :19530  # Milvus

# Stop conflicting services
make local-dev-stop

# Kill specific port (if needed)
kill $(lsof -t -i:8000)

Container Permission Issues

# Fix volume permissions
chmod -R 777 ./volumes
docker compose -f docker-compose-infra.yml down -v
make local-dev-infra

🧪 Testing Issues

Tests Fail to Run

# Ensure venv is activated
source backend/.venv/bin/activate

# Install test dependencies
cd backend
poetry install --with test --sync

# Run specific test with verbose output
pytest tests/unit/test_example.py -vv -s

Database Connection Errors in Tests

# Ensure infrastructure is running
make local-dev-infra
make local-dev-status

# Wait for services to be ready
sleep 10

# Run tests
make test-integration

Import Errors in Tests

# Check PYTHONPATH
cd backend
source .venv/bin/activate
pytest tests/ --collect-only  # See what tests are collected

# If imports fail, reinstall
poetry install --with dev,test --sync

🔐 Authentication & API Issues

Login Fails

Development Mode (bypass authentication):

# Add to .env
SKIP_AUTH=true
DEVELOPMENT_MODE=true
ENABLE_MOCK_AUTH=true

# Restart backend
make local-dev-stop
make local-dev-backend

Production Mode (fix OIDC):

Verify .env has correct OIDC settings
Check IBM Cloud credentials are valid
Ensure redirect URLs match your setup
Check logs: tail -f /tmp/rag-backend.log

API Returns 500 Errors

# Check backend logs
tail -f /tmp/rag-backend.log

# Restart backend with debug
cd backend
source .venv/bin/activate
LOG_LEVEL=DEBUG uvicorn main:app --reload --port 8000

📦 Frontend Issues

npm install Fails

# Clear cache and retry
cd frontend
rm -rf node_modules package-lock.json
npm cache clean --force
npm install

Frontend Won't Start

# Check Node version
node --version    # Should be 18+

# Update Node (macOS)
brew upgrade node

# Update Node (Linux - using nvm)
nvm install 18
nvm use 18

# Reinstall and start
cd frontend
rm -rf node_modules
npm install
npm run dev

Build Errors

# Clear build cache
cd frontend
rm -rf dist .vite
npm run dev

🔧 Performance Issues

Slow Backend Response

# Check if infrastructure is running locally
make local-dev-status

# Monitor resource usage
docker stats

# Check logs for bottlenecks
tail -f /tmp/rag-backend.log | grep -i "slow\|timeout\|error"

High Memory Usage

# Restart services to clear memory
make local-dev-stop
docker system prune -f
make local-dev-infra

🆘 Getting Help

Resource	When to Use	Link
📚 Full Documentation	Comprehensive guides and API reference	docs site
🐛 GitHub Issues	Report bugs or request features	issues
💬 Discussions	Ask questions, share ideas	discussions
📖 IMMEDIATE_FIX.md	Quick fixes for common development issues	See project root
🔧 CLAUDE.md	Development best practices and architecture	See project root

Still Stuck?

Check service status: make local-dev-status
View logs: tail -f /tmp/rag-backend.log or make logs
Try clean restart: make local-dev-stop && make clean && make local-dev-all
Nuclear option: make clean-all (removes everything - be careful!)
Ask for help: Open an issue with:
- Error message
- Output of make local-dev-status
- OS and Docker version
- Steps to reproduce

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

IBM Docling - Advanced document processing and understanding
IBM WatsonX - Enterprise AI foundation models
FastAPI - Modern, high-performance web framework
React - Powerful UI library for building interactive interfaces
Milvus - High-performance vector database
Docker - Containerization and deployment platform
All Contributors - Thank you for your contributions!

⬆ Back to Top

Made with ❤️ by the RAG Modulo Team

Name		Name	Last commit message	Last commit date
Latest commit History 1,068 Commits
.claude/agents		.claude/agents
.devcontainer		.devcontainer
.github		.github
.ralph		.ralph
ISSUES		ISSUES
backend		backend
deployment		deployment
docs		docs
experiments		experiments
frontend		frontend
scripts		scripts
specs		specs
tests		tests
.actrc		.actrc
.env.ci		.env.ci
.env.dev		.env.dev
.env.example		.env.example
.flake8		.flake8
.gitignore		.gitignore
.gitleaks.toml		.gitleaks.toml
.markdownlint.json		.markdownlint.json
.pre-commit-config.yaml		.pre-commit-config.yaml
.pylintrc		.pylintrc
.secrets.baseline		.secrets.baseline
.tool-versions		.tool-versions
.yamllint		.yamllint
AGENTS.md		AGENTS.md
AI_WORKFLOW_SETUP_COMPLETE.md		AI_WORKFLOW_SETUP_COMPLETE.md
ASYNC_TEST_FIX_GUIDE.md		ASYNC_TEST_FIX_GUIDE.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CODE_QUALITY_GUIDE.md		CODE_QUALITY_GUIDE.md
CUSTOM_VOICE_IMPLEMENTATION_PROGRESS.md		CUSTOM_VOICE_IMPLEMENTATION_PROGRESS.md
DEFENSIVE_PROGRAMMING_AUDIT.md		DEFENSIVE_PROGRAMMING_AUDIT.md
DEPLOYMENT_TEST_RESULTS.md		DEPLOYMENT_TEST_RESULTS.md
DEVELOPMENT.md		DEVELOPMENT.md
Dockerfile.codeengine		Dockerfile.codeengine
FINAL_SESSION_SUMMARY.md		FINAL_SESSION_SUMMARY.md
FINAL_TEST_FIX_SUMMARY.md		FINAL_TEST_FIX_SUMMARY.md
IMPLEMENTATION_PLAN.md		IMPLEMENTATION_PLAN.md
INSTALLATION_TEST_RESULTS.md		INSTALLATION_TEST_RESULTS.md
LOCAL_CI.md		LOCAL_CI.md
Makefile		Makefile
PHASE1_PR1_SUMMARY.md		PHASE1_PR1_SUMMARY.md
PODCAST.md		PODCAST.md
PODCAST_FRONTEND_IMPLEMENTATION.md		PODCAST_FRONTEND_IMPLEMENTATION.md
PODCAST_IMPLEMENTATION_COMPLETE.md		PODCAST_IMPLEMENTATION_COMPLETE.md
PODCAST_IMPLEMENTATION_PLAN.md		PODCAST_IMPLEMENTATION_PLAN.md
PODCAST_PROMPT_FOR_TESTING.md		PODCAST_PROMPT_FOR_TESTING.md
PROMPT_FEATURE_DEVELOPMENT.md		PROMPT_FEATURE_DEVELOPMENT.md
PROMPT_ISSUES.md		PROMPT_ISSUES.md
PR_405_IMPLEMENTATION_SUMMARY.md		PR_405_IMPLEMENTATION_SUMMARY.md
PR_DETAILED_BREAKDOWN.md		PR_DETAILED_BREAKDOWN.md
PR_PLAN.md		PR_PLAN.md
PR_SUMMARY.md		PR_SUMMARY.md
README.md		README.md
SEAMLESS_INTEGRATION_EPIC_FOR_ISSUE_229.md		SEAMLESS_INTEGRATION_EPIC_FOR_ISSUE_229.md
SESSION_COMPLETE_OCT6.md		SESSION_COMPLETE_OCT6.md
SESSION_SUMMARY.md		SESSION_SUMMARY.md
TDD_GREEN_PHASE_PROGRESS.md		TDD_GREEN_PHASE_PROGRESS.md
TDD_GREEN_PHASE_PROGRESS_UPDATE.md		TDD_GREEN_PHASE_PROGRESS_UPDATE.md
TDD_RED_PHASE_SUMMARY.md		TDD_RED_PHASE_SUMMARY.md
UNIT_TEST_FIXES_SUMMARY.md		UNIT_TEST_FIXES_SUMMARY.md
US-1.2_IMPLEMENTATION_PLAN.md		US-1.2_IMPLEMENTATION_PLAN.md
add_reranking_template.py		add_reranking_template.py
api_tests.py		api_tests.py
ci_pipeline_issue.md		ci_pipeline_issue.md
docker-compose-ci.yml		docker-compose-ci.yml
docker-compose-e2e.yml		docker-compose-e2e.yml
docker-compose-infra.yml		docker-compose-infra.yml
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.hotreload.yml		docker-compose.hotreload.yml
docker-compose.production.yml		docker-compose.production.yml
docker-compose.test.yml		docker-compose.test.yml
docker-compose.yml		docker-compose.yml
env.example		env.example
generate_service_tests.py		generate_service_tests.py
http_ca.crt		http_ca.crt
issue_227_comprehensive_update.md		issue_227_comprehensive_update.md
mkdocs.yml		mkdocs.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
pytest-atomic.ini		pytest-atomic.ini
pytest.ini		pytest.ini
rag-cli		rag-cli
ralph-runner.sh		ralph-runner.sh
run_tests.sh		run_tests.sh
test_embeddings.py		test_embeddings.py
test_embeddings_simple.py		test_embeddings_simple.py
test_podcast_script_generation.py		test_podcast_script_generation.py

Uh oh!

Uh oh!

manavgup/rag_modulo

Folders and files

Latest commit

History

Repository files navigation

RAG Modulo

🎯 What is RAG Modulo?

✨ Key Features

🎨 Frontend Features

🎉 Current Status: Production Ready

🎉 Recent Major Improvements

🚀 Quick Start

Prerequisites

Option 1: Local Development (⚡ Fastest - Recommended)

Option 2: Production Mode (🐳 Docker)

Option 3: GitHub Codespaces (☁️ Cloud)

🏗️ Architecture Overview

🛠️ Development Workflow

🎯 Recommended Daily Workflow

🔧 Essential Commands Reference

🧪 Testing & Quality

🐳 Production & Container Workflows

📊 Features & Capabilities

🧠 Advanced AI Features

🔍 Search & Retrieval

🏗️ Architecture & Scalability

🧪 Testing & Quality Assurance

📚 Documentation

📖 Complete Documentation

🔧 Configuration & Tools

🛠️ Command-Line Interface (CLI)

🚀 Deployment & Packaging

Production Deployment

1. Docker Compose (Recommended)

2. Pre-built Images from GHCR

3. Custom Docker Deployment

Cloud Deployment Options

Kubernetes Deployment

🔄 CI/CD Pipeline

GitHub Actions Workflows

1. Code Quality & Testing (.github/workflows/ci.yml)

2. Security Scanning (.github/workflows/02-security.yml)

3. Documentation (.github/workflows/docs.yml)

Local CI Validation

Pre-commit Hooks

Container Registry

🧪 Testing

Test Categories

Running Tests

🤝 Contributing

Development Guidelines

Contribution Process

📈 Roadmap

✅ Phase 1: Foundation (Completed)

✅ Phase 2: Advanced Features (Completed)

🔄 Phase 3: Production Enhancement (Current)

🚀 Phase 4: Enterprise Features (Next)

🔮 Phase 5: Innovation (Future)

🆘 Troubleshooting

Quick Diagnostic Commands

Common Issues & Solutions

Poetry Not Installed

Wrong Python Version

Dependencies Not Installing

Wrong Tool Versions (e.g., Ruff 0.5.7 instead of 0.14.0)

Import Errors

Docker Not Installed

Docker Compose V2 Missing

Infrastructure Services Fail to Start

Port Already in Use

Container Permission Issues

Tests Fail to Run

Database Connection Errors in Tests

Import Errors in Tests

Login Fails

API Returns 500 Errors

npm install Fails

Frontend Won't Start

1. Code Quality & Testing (`.github/workflows/ci.yml`)

2. Security Scanning (`.github/workflows/02-security.yml`)

3. Documentation (`.github/workflows/docs.yml`)

Packages