diff --git a/DEVELOPMENT.md b/DEVELOPMENT.md
new file mode 100644
index 00000000..f7d05ed7
--- /dev/null
+++ b/DEVELOPMENT.md
@@ -0,0 +1,63 @@
+# AutoCoder Development Roadmap
+
+This roadmap breaks work into clear phases so you can pick the next most valuable items quickly.
+
+## Phase 0 — Baseline (ship ASAP)
+- **PR discipline:** Enforce branch protection requiring “PR Check” (already configured in workflows; ensure GitHub rule is on).
+- **Secrets hygiene:** Move all deploy secrets into repo/environment secrets; prohibit `.env` commits via pre-commit hook.
+- **Smoke tests:** Keep `/health` and `/readiness` endpoints green; add UI smoke (landing page loads) to CI.
+
+## Phase 1 — Reliability & Observability
+- **Structured logging:** Add JSON logging for FastAPI (uvicorn access + app logs) with request IDs; forward to stdout for Docker/Traefik.
+- **Error reporting:** Wire Sentry (or OpenTelemetry + OTLP) for backend exceptions and front-end errors.
+- **Metrics:** Expose `/metrics` (Prometheus) for FastAPI; Traefik already exposes metrics option—enable when scraping is available.
+- **Tracing:** Add OTEL middleware to FastAPI; propagate trace IDs through to Claude/Gemini calls when possible.
+
+## Phase 2 — Platform & DevX
+- **Local dev parity:** Add `docker-compose.dev.yml` with hot-reload for FastAPI + Vite UI; document one-command setup.
+- **Makefile/taskfile:** Common commands (`make dev`, `make test`, `make lint`, `make format`, `make seed`).
+- **Pre-commit:** Ruff, mypy, black (if adopted), eslint/prettier for `ui/`.
+- **Typed APIs:** Add mypy strict mode to `server/` and type `schemas.py` fully (Pydantic v2 ConfigDict).
+
+## Phase 3 — Product & Agent Quality
+- **Model selection UI:** Let users choose assistant provider (Claude/Gemini) in settings; display active provider badge in chat.
+- **Tooling guardrails:** For Gemini (chat-only), show “no tools” notice in UI and fallback logic to Claude when tools needed.
+- **Conversation persistence:** Add pagination/search over assistant history; export conversation to file.
+- **Feature board:** Surface feature stats/graph from MCP in the UI (read-only dashboard).
+
+## Phase 4 — Security & Compliance
+- **AuthN/AuthZ:** Add optional login (JWT/OIDC) gate for UI/API; role for “admin” vs “viewer” at least.
+- **Rate limiting:** Enable per-IP rate limits at Traefik and per-token limits in FastAPI.
+- **Audit trails:** Log agent actions and feature state changes with user identity.
+- **Headers/HTTPS:** HSTS via Traefik, content-security-policy header from FastAPI.
+
+## Phase 5 — Performance & Scale
+- **Caching:** CDN/Traefik static cache for UI assets; server-side cache for model list/status endpoints.
+- **Worker separation:** Optionally split agent runner from API via separate services and queues (e.g., Redis/RQ or Celery).
+- **Background jobs:** Move long-running tasks to scheduler/worker with backoff and retries.
+
+## Phase 6 — Testing & Quality Gates
+- **Backend tests:** Add pytest suite for key routers (`/api/setup/status`, assistant chat happy-path with mock Claude/Gemini).
+- **Frontend tests:** Add Vitest + React Testing Library smoke tests for core pages (dashboard loads, settings save).
+- **E2E:** Playwright happy-path (login optional, start agent, view logs).
+- **Coverage:** Fail CI if coverage drops below threshold (start at 60–70%). 
+
+## Phase 7 — Deployment & Ops
+- **Blue/green deploy:** Add image tagging `:sha` + `:latest` (already for CI) with Traefik service labels to toggle.
+- **Backups:** Snapshot `~/.autocoder` data volume; document restore.
+- **Runbooks:** Add `RUNBOOK.md` for common ops (restart, rotate keys, renew certs, roll back).
+
+## Phase 8 — Documentation & Onboarding
+- **Getting started:** Short path for “run locally in 5 minutes” (scripted).
+- **Config matrix:** Document required/optional env vars (Claude, Gemini, DuckDNS, Traefik, TLS).
+- **Architecture:** One-page diagram: UI ↔ FastAPI ↔ Agent subprocess ↔ Claude/Gemini; MCP servers; Traefik front.
+
+## Stretch Ideas
+- **Telemetry-driven tuning:** Auto-select model/provider based on latency/cost SLA.
+- **Cost controls:** Show per-run token/cost estimates; configurable budgets.
+- **Offline/edge mode:** Ollama provider toggle with cached models.
+
+## How to use this roadmap
+- Pick the next phase that unblocks your current goal (reliability → platform → product).
+- Keep PRs small and scoped to one bullet.
+- Update this document when a bullet ships or is reprioritized.
diff --git a/README.md b/README.md
index 9fe4588c..4543cd2b 100644
--- a/README.md
+++ b/README.md
@@ -35,6 +35,13 @@ You need one of the following:
 - **Claude Pro/Max Subscription** - Use `claude login` to authenticate (recommended)
 - **Anthropic API Key** - Pay-per-use from https://console.anthropic.com/
 
+### Optional: Gemini API (assistant chat only)
+- `GEMINI_API_KEY` (required)
+- `GEMINI_MODEL` (optional, default `gemini-1.5-flash`)
+- `GEMINI_BASE_URL` (optional, default `https://generativelanguage.googleapis.com/v1beta/openai`)
+
+Notes: Gemini is used for assistant chat when configured; coding agents still run on Claude/Anthropic (tools are not available in Gemini mode).
+
 ---
 
 ## Quick Start
diff --git a/docker-compose.traefik.yml b/docker-compose.traefik.yml
new file mode 100644
index 00000000..29d79632
--- /dev/null
+++ b/docker-compose.traefik.yml
@@ -0,0 +1,40 @@
+version: "3.9"
+
+services:
+  traefik:
+    image: traefik:v3.1
+    command:
+      - --providers.docker=true
+      - --providers.docker.exposedbydefault=false
+      - --entrypoints.web.address=:80
+      - --entrypoints.websecure.address=:443
+      - --certificatesresolvers.le.acme.httpchallenge=true
+      - --certificatesresolvers.le.acme.httpchallenge.entrypoint=web
+      - --certificatesresolvers.le.acme.email=${LETSENCRYPT_EMAIL}
+      - --certificatesresolvers.le.acme.storage=/letsencrypt/acme.json
+    ports:
+      - "80:80"
+      - "443:443"
+    volumes:
+      - /var/run/docker.sock:/var/run/docker.sock:ro
+      - ./letsencrypt:/letsencrypt
+    networks:
+      - traefik-proxy
+
+  autocoder:
+    networks:
+      - traefik-proxy
+    labels:
+      - traefik.enable=true
+      - traefik.http.routers.autocoder.rule=Host(`${DOMAIN}`)
+      - traefik.http.routers.autocoder.entrypoints=websecure
+      - traefik.http.routers.autocoder.tls.certresolver=le
+      - traefik.http.services.autocoder.loadbalancer.server.port=${APP_PORT:-8888}
+      - traefik.http.routers.autocoder-web.rule=Host(`${DOMAIN}`)
+      - traefik.http.routers.autocoder-web.entrypoints=web
+      - traefik.http.routers.autocoder-web.middlewares=redirect-to-https
+      - traefik.http.middlewares.redirect-to-https.redirectscheme.scheme=https
+
+networks:
+  traefik-proxy:
+    external: true
diff --git a/requirements.txt b/requirements.txt
index 9cf420e0..51413362 100644
--- a/requirements.txt
+++ b/requirements.txt
@@ -10,6 +10,7 @@ aiofiles>=24.0.0
 apscheduler>=3.10.0,<4.0.0
 pywinpty>=2.0.0; sys_platform == "win32"
 pyyaml>=6.0.0
+openai>=1.52.0
 
 # Dev dependencies
 ruff>=0.8.0
diff --git a/scripts/deploy.sh b/scripts/deploy.sh
new file mode 100644
index 00000000..7315321a
--- /dev/null
+++ b/scripts/deploy.sh
@@ -0,0 +1,133 @@
+#!/usr/bin/env bash
+
+# One-click Docker deploy for AutoCoder on a VPS with DuckDNS + Traefik + Let's Encrypt.
+# Prompts for domain, DuckDNS token, email, repo, branch, and target install path.
+
+set -euo pipefail
+
+if [[ $EUID -ne 0 ]]; then
+  echo "Please run as root (sudo)." >&2
+  exit 1
+fi
+
+prompt_required() {
+  local var_name="$1" prompt_msg="$2"
+  local value
+  while true; do
+    read -r -p "$prompt_msg: " value
+    if [[ -n "$value" ]]; then
+      printf -v "$var_name" '%s' "$value"
+      export "$var_name"
+      return
+    fi
+    echo "Value cannot be empty."
+  done
+}
+
+echo "=== AutoCoder VPS Deploy (Docker + Traefik + DuckDNS + Let's Encrypt) ==="
+
+prompt_required DOMAIN "Enter your DuckDNS domain (e.g., myapp.duckdns.org)"
+prompt_required DUCKDNS_TOKEN "Enter your DuckDNS token"
+prompt_required LETSENCRYPT_EMAIL "Enter email for Let's Encrypt notifications"
+
+read -r -p "Git repo URL [https://github.com/heidi-dang/autocoder.git]: " REPO_URL
+REPO_URL=${REPO_URL:-https://github.com/heidi-dang/autocoder.git}
+
+read -r -p "Git branch to deploy [main]: " DEPLOY_BRANCH
+DEPLOY_BRANCH=${DEPLOY_BRANCH:-main}
+
+read -r -p "Install path [/opt/autocoder]: " APP_DIR
+APP_DIR=${APP_DIR:-/opt/autocoder}
+
+read -r -p "App internal port (container) [8888]: " APP_PORT
+APP_PORT=${APP_PORT:-8888}
+
+echo
+echo "Domain: $DOMAIN"
+echo "Repo:   $REPO_URL"
+echo "Branch: $DEPLOY_BRANCH"
+echo "Path:   $APP_DIR"
+echo
+read -r -p "Proceed? [y/N]: " CONFIRM
+if [[ "${CONFIRM,,}" != "y" ]]; then
+  echo "Aborted."
+  exit 1
+fi
+
+ensure_packages() {
+  echo "Installing Docker & prerequisites..."
+  apt-get update -y
+  apt-get install -y ca-certificates curl git gnupg
+  install -m 0755 -d /etc/apt/keyrings
+  if [[ ! -f /etc/apt/keyrings/docker.gpg ]]; then
+    curl -fsSL https://download.docker.com/linux/ubuntu/gpg | gpg --dearmor -o /etc/apt/keyrings/docker.gpg
+    chmod a+r /etc/apt/keyrings/docker.gpg
+    echo \
+      "deb [arch=$(dpkg --print-architecture) signed-by=/etc/apt/keyrings/docker.gpg] https://download.docker.com/linux/ubuntu \
+      $(. /etc/os-release && echo "$VERSION_CODENAME") stable" > /etc/apt/sources.list.d/docker.list
+    apt-get update -y
+  fi
+  apt-get install -y docker-ce docker-ce-cli containerd.io docker-buildx-plugin docker-compose-plugin
+  systemctl enable --now docker
+}
+
+configure_duckdns() {
+  echo "Configuring DuckDNS..."
+  local cron_file="/etc/cron.d/duckdns"
+  cat > "$cron_file" <<EOF
+*/5 * * * * root curl -fsS "https://www.duckdns.org/update?domains=$DOMAIN&token=$DUCKDNS_TOKEN&ip=" >/var/log/duckdns.log 2>&1
+EOF
+  chmod 644 "$cron_file"
+  # Run once immediately
+  curl -fsS "https://www.duckdns.org/update?domains=$DOMAIN&token=$DUCKDNS_TOKEN&ip=" >/var/log/duckdns.log 2>&1 || true
+}
+
+clone_repo() {
+  if [[ -d "$APP_DIR/.git" ]]; then
+    echo "Repo already exists, pulling latest..."
+    git -C "$APP_DIR" fetch --all
+    git -C "$APP_DIR" checkout "$DEPLOY_BRANCH"
+    git -C "$APP_DIR" pull --ff-only origin "$DEPLOY_BRANCH"
+  else
+    echo "Cloning repository..."
+    mkdir -p "$APP_DIR"
+    git clone --branch "$DEPLOY_BRANCH" "$REPO_URL" "$APP_DIR"
+  fi
+}
+
+write_env() {
+  echo "Writing deploy env (.env.deploy)..."
+  cat > "$APP_DIR/.env.deploy" <<EOF
+DOMAIN=$DOMAIN
+LETSENCRYPT_EMAIL=$LETSENCRYPT_EMAIL
+APP_PORT=$APP_PORT
+EOF
+  echo "DuckDNS token stored in /etc/cron.d/duckdns (not in repo)."
+}
+
+prepare_ssl_storage() {
+  mkdir -p "$APP_DIR/letsencrypt"
+  touch "$APP_DIR/letsencrypt/acme.json"
+  chmod 600 "$APP_DIR/letsencrypt/acme.json"
+}
+
+run_compose() {
+  echo "Bringing up stack with Traefik reverse proxy and TLS..."
+  cd "$APP_DIR"
+  docker network inspect traefik-proxy >/dev/null 2>&1 || docker network create traefik-proxy
+  docker compose --env-file .env.deploy -f docker-compose.yml -f docker-compose.traefik.yml pull || true
+  docker compose --env-file .env.deploy -f docker-compose.yml -f docker-compose.traefik.yml up -d --build
+}
+
+ensure_packages
+configure_duckdns
+clone_repo
+write_env
+prepare_ssl_storage
+run_compose
+
+echo
+echo "Deployment complete."
+echo "Check: http://$DOMAIN (will redirect to https after cert is issued)."
+echo "Logs:  docker compose -f docker-compose.yml -f docker-compose.traefik.yml logs -f"
+echo "To update: rerun this script; it will git pull and restart."
diff --git a/server/gemini_client.py b/server/gemini_client.py
new file mode 100644
index 00000000..c794dfc5
--- /dev/null
+++ b/server/gemini_client.py
@@ -0,0 +1,80 @@
+"""
+Lightweight Gemini API client (OpenAI-compatible endpoint).
+
+Uses Google's OpenAI-compatible Gemini endpoint:
+https://generativelanguage.googleapis.com/v1beta/openai
+
+Environment variables:
+- GEMINI_API_KEY   (required)
+- GEMINI_MODEL     (optional, default: gemini-1.5-flash)
+- GEMINI_BASE_URL  (optional, default: official OpenAI-compatible endpoint)
+"""
+
+import os
+from typing import AsyncGenerator, Iterable, Optional
+
+from openai import AsyncOpenAI
+
+# Default OpenAI-compatible base URL for Gemini
+DEFAULT_GEMINI_BASE_URL = "https://generativelanguage.googleapis.com/v1beta/openai"
+DEFAULT_GEMINI_MODEL = os.getenv("GEMINI_MODEL", "gemini-1.5-flash")
+
+
+def is_gemini_configured() -> bool:
+    """Return True if a Gemini API key is available."""
+    return bool(os.getenv("GEMINI_API_KEY"))
+
+
+def _build_client() -> AsyncOpenAI:
+    api_key = os.getenv("GEMINI_API_KEY")
+    if not api_key:
+        raise RuntimeError("GEMINI_API_KEY is not set")
+
+    base_url = os.getenv("GEMINI_BASE_URL", DEFAULT_GEMINI_BASE_URL)
+    return AsyncOpenAI(api_key=api_key, base_url=base_url)
+
+
+async def stream_chat(
+    user_message: str,
+    *,
+    system_prompt: Optional[str] = None,
+    model: Optional[str] = None,
+    extra_messages: Optional[Iterable[dict]] = None,
+) -> AsyncGenerator[str, None]:
+    """
+    Stream a chat completion from Gemini.
+
+    Args:
+        user_message: Primary user input
+        system_prompt: Optional system prompt to prepend
+        model: Optional model name; defaults to GEMINI_MODEL env or fallback constant
+        extra_messages: Optional prior messages (list of {"role","content"})
+    Yields:
+        Text chunks as they arrive.
+    """
+    client = _build_client()
+    messages = []
+
+    if system_prompt:
+        messages.append({"role": "system", "content": system_prompt})
+
+    if extra_messages:
+        messages.extend(extra_messages)
+
+    messages.append({"role": "user", "content": user_message})
+
+    completion = await client.chat.completions.create(
+        model=model or DEFAULT_GEMINI_MODEL,
+        messages=messages,
+        stream=True,
+    )
+
+    async for chunk in completion:
+        for choice in chunk.choices:
+            delta = choice.delta
+            if delta and delta.content:
+                # delta.content is a list of content parts
+                for part in delta.content:
+                    text = getattr(part, "text", None) or part.get("text") if isinstance(part, dict) else None
+                    if text:
+                        yield text
diff --git a/server/main.py b/server/main.py
index 0e091648..0600e3c3 100644
--- a/server/main.py
+++ b/server/main.py
@@ -204,7 +204,11 @@ async def setup_status():
 
     # If GLM mode is configured via .env, we have alternative credentials
     glm_configured = bool(os.getenv("ANTHROPIC_BASE_URL") and os.getenv("ANTHROPIC_AUTH_TOKEN"))
-    credentials = has_claude_config or glm_configured
+
+    # Gemini configuration (OpenAI-compatible Gemini API)
+    gemini_configured = bool(os.getenv("GEMINI_API_KEY"))
+
+    credentials = has_claude_config or glm_configured or gemini_configured
 
     # Check for Node.js and npm
     node = shutil.which("node") is not None
@@ -215,6 +219,7 @@ async def setup_status():
         credentials=credentials,
         node=node,
         npm=npm,
+        gemini=gemini_configured,
     )
 
 
diff --git a/server/schemas.py b/server/schemas.py
index 0a2807cc..b06cc9ec 100644
--- a/server/schemas.py
+++ b/server/schemas.py
@@ -227,6 +227,7 @@ class SetupStatus(BaseModel):
     credentials: bool
     node: bool
     npm: bool
+    gemini: bool = False
 
 
 # ============================================================================
diff --git a/server/services/assistant_chat_session.py b/server/services/assistant_chat_session.py
index f15eee8a..190e8207 100755
--- a/server/services/assistant_chat_session.py
+++ b/server/services/assistant_chat_session.py
@@ -20,6 +20,7 @@
 from claude_agent_sdk import ClaudeAgentOptions, ClaudeSDKClient
 from dotenv import load_dotenv
 
+from ..gemini_client import is_gemini_configured, stream_chat
 from .assistant_database import (
     add_message,
     create_conversation,
@@ -182,6 +183,8 @@ def __init__(self, project_name: str, project_dir: Path, conversation_id: Option
         self._client_entered: bool = False
         self.created_at = datetime.now()
         self._history_loaded: bool = False  # Track if we've loaded history for resumed conversations
+        self.provider: str = "gemini" if is_gemini_configured() else "claude"
+        self._system_prompt: str | None = None
 
     async def close(self) -> None:
         """Clean up resources and close the Claude client."""
@@ -249,6 +252,7 @@ async def start(self) -> AsyncGenerator[dict, None]:
 
         # Get system prompt with project context
         system_prompt = get_system_prompt(self.project_name, self.project_dir)
+        self._system_prompt = system_prompt
 
         # Write system prompt to CLAUDE.md file to avoid Windows command line length limit
         # The SDK will read this via setting_sources=["project"]
@@ -257,42 +261,46 @@ async def start(self) -> AsyncGenerator[dict, None]:
             f.write(system_prompt)
         logger.info(f"Wrote assistant system prompt to {claude_md_path}")
 
-        # Use system Claude CLI
-        system_cli = shutil.which("claude")
+        if self.provider == "gemini":
+            logger.info("Assistant session using Gemini provider (no tools).")
+            self.client = None
+        else:
+            # Use system Claude CLI
+            system_cli = shutil.which("claude")
 
-        # Build environment overrides for API configuration
-        sdk_env = {var: os.getenv(var) for var in API_ENV_VARS if os.getenv(var)}
+            # Build environment overrides for API configuration
+            sdk_env = {var: os.getenv(var) for var in API_ENV_VARS if os.getenv(var)}
 
-        # Determine model from environment or use default
-        # This allows using alternative APIs (e.g., GLM via z.ai) that may not support Claude model names
-        model = os.getenv("ANTHROPIC_DEFAULT_OPUS_MODEL", "claude-opus-4-5-20251101")
+            # Determine model from environment or use default
+            # This allows using alternative APIs (e.g., GLM via z.ai) that may not support Claude model names
+            model = os.getenv("ANTHROPIC_DEFAULT_OPUS_MODEL", "claude-opus-4-5-20251101")
 
-        try:
-            logger.info("Creating ClaudeSDKClient...")
-            self.client = ClaudeSDKClient(
-                options=ClaudeAgentOptions(
-                    model=model,
-                    cli_path=system_cli,
-                    # System prompt loaded from CLAUDE.md via setting_sources
-                    # This avoids Windows command line length limit (~8191 chars)
-                    setting_sources=["project"],
-                    allowed_tools=[*READONLY_BUILTIN_TOOLS, *ASSISTANT_FEATURE_TOOLS],
-                    mcp_servers=mcp_servers,
-                    permission_mode="bypassPermissions",
-                    max_turns=100,
-                    cwd=str(self.project_dir.resolve()),
-                    settings=str(settings_file.resolve()),
-                    env=sdk_env,
+            try:
+                logger.info("Creating ClaudeSDKClient...")
+                self.client = ClaudeSDKClient(
+                    options=ClaudeAgentOptions(
+                        model=model,
+                        cli_path=system_cli,
+                        # System prompt loaded from CLAUDE.md via setting_sources
+                        # This avoids Windows command line length limit (~8191 chars)
+                        setting_sources=["project"],
+                        allowed_tools=[*READONLY_BUILTIN_TOOLS, *ASSISTANT_FEATURE_TOOLS],
+                        mcp_servers=mcp_servers,
+                        permission_mode="bypassPermissions",
+                        max_turns=100,
+                        cwd=str(self.project_dir.resolve()),
+                        settings=str(settings_file.resolve()),
+                        env=sdk_env,
+                    )
                 )
-            )
-            logger.info("Entering Claude client context...")
-            await self.client.__aenter__()
-            self._client_entered = True
-            logger.info("Claude client ready")
-        except Exception as e:
-            logger.exception("Failed to create Claude client")
-            yield {"type": "error", "content": f"Failed to initialize assistant: {str(e)}"}
-            return
+                logger.info("Entering Claude client context...")
+                await self.client.__aenter__()
+                self._client_entered = True
+                logger.info("Claude client ready")
+            except Exception as e:
+                logger.exception("Failed to create Claude client")
+                yield {"type": "error", "content": f"Failed to initialize assistant: {str(e)}"}
+                return
 
         # Send initial greeting only for NEW conversations
         # Resumed conversations already have history loaded from the database
@@ -329,7 +337,7 @@ async def send_message(self, user_message: str) -> AsyncGenerator[dict, None]:
             - {"type": "response_done"}
             - {"type": "error", "content": str}
         """
-        if not self.client:
+        if self.provider != "gemini" and not self.client:
             yield {"type": "error", "content": "Session not initialized. Call start() first."}
             return
 
@@ -365,11 +373,15 @@ async def send_message(self, user_message: str) -> AsyncGenerator[dict, None]:
                 logger.info(f"Loaded {len(history)} messages from conversation history")
 
         try:
-            async for chunk in self._query_claude(message_to_send):
-                yield chunk
+            if self.provider == "gemini":
+                async for chunk in self._query_gemini(message_to_send):
+                    yield chunk
+            else:
+                async for chunk in self._query_claude(message_to_send):
+                    yield chunk
             yield {"type": "response_done"}
         except Exception as e:
-            logger.exception("Error during Claude query")
+            logger.exception("Error during assistant query")
             yield {"type": "error", "content": f"Error: {str(e)}"}
 
     async def _query_claude(self, message: str) -> AsyncGenerator[dict, None]:
@@ -413,6 +425,27 @@ async def _query_claude(self, message: str) -> AsyncGenerator[dict, None]:
         if full_response and self.conversation_id:
             add_message(self.project_dir, self.conversation_id, "assistant", full_response)
 
+    async def _query_gemini(self, message: str) -> AsyncGenerator[dict, None]:
+        """
+        Query Gemini and stream plain-text responses (no tool calls).
+        """
+        full_response = ""
+        try:
+            async for text in stream_chat(
+                message,
+                system_prompt=self._system_prompt,
+                model=os.getenv("GEMINI_MODEL"),
+            ):
+                full_response += text
+                yield {"type": "text", "content": text}
+        except Exception as e:
+            logger.exception("Gemini query failed")
+            yield {"type": "error", "content": f"Gemini error: {e}"}
+            return
+
+        if full_response and self.conversation_id:
+            add_message(self.project_dir, self.conversation_id, "assistant", full_response)
+
     def get_conversation_id(self) -> Optional[int]:
         """Get the current conversation ID."""
         return self.conversation_id
diff --git a/ui/src/components/SetupWizard.tsx b/ui/src/components/SetupWizard.tsx
index 79d009ee..95a11a3a 100644
--- a/ui/src/components/SetupWizard.tsx
+++ b/ui/src/components/SetupWizard.tsx
@@ -98,6 +98,24 @@ export function SetupWizard({ onComplete }: SetupWizardProps) {
               helpText="Install Node.js"
               optional
             />
+
+            {/* Gemini (chat-only) */}
+            <SetupItem
+              label="Gemini (assistant chat only)"
+              description="Optional: set GEMINI_API_KEY to use Gemini for the read-only assistant. Coding agents still use Claude/Anthropic."
+              status={
+                setupLoading
+                  ? 'loading'
+                  : setupError
+                  ? 'warning'
+                  : setupStatus?.gemini
+                  ? 'success'
+                  : 'warning'
+              }
+              helpLink="https://ai.google.dev/gemini-api/docs/openai-compatibility"
+              helpText="Configure Gemini"
+              optional
+            />
           </div>
 
           {/* Continue Button */}
diff --git a/ui/src/lib/types.ts b/ui/src/lib/types.ts
index d883432f..4afe0d04 100644
--- a/ui/src/lib/types.ts
+++ b/ui/src/lib/types.ts
@@ -144,6 +144,7 @@ export interface SetupStatus {
   credentials: boolean
   node: boolean
   npm: boolean
+  gemini: boolean
 }
 
 // Dev Server types