Merge branch 'main' into claude/issue-11-20251026-2132

strawgate · web-flow · commit 6db1a86c3554 · 2025-11-02T08:31:38.000-06:00
diff --git a/.github/workflows/claude-on-test-failure.yml b/.github/workflows/claude-on-test-failure.yml
@@ -0,0 +1,163 @@
+name: Claude Test Failure Analysis
+
+on:
+  workflow_run:
+    workflows: ["Run Tests"]
+    types:
+      - completed
+
+concurrency:
+  group: claude-test-failure-${{ github.event.workflow_run.head_branch }}
+  cancel-in-progress: true
+
+jobs:
+  claude-analyze-failure:
+    # Only run if the test workflow failed
+    if: ${{ github.event.workflow_run.conclusion == 'failure' }}
+    runs-on: ubuntu-latest
+    permissions:
+      contents: read
+      pull-requests: write
+      issues: write
+      id-token: write
+      actions: read # Required for Claude to read CI results
+    steps:
+      - name: Checkout repository
+        uses: actions/checkout@v4
+        with:
+          fetch-depth: 1
+
+      - name: Set up Python 3.10
+        uses: actions/setup-python@v5
+        with:
+          python-version: '3.10'
+
+      # Install UV package manager
+      - name: Install UV
+        uses: astral-sh/setup-uv@v7
+
+      - name: Set analysis prompt
+        id: analysis-prompt
+        run: |
+          cat >> $GITHUB_OUTPUT << 'EOF'
+          PROMPT<<PROMPT_END
+          You're a test failure analysis assistant for Py-Key-Value, a Python framework for interacting with Key-Value stores.
+
+          # Your Task
+          A GitHub Actions workflow has failed. Your job is to:
+          1. Analyze the test failure(s) to understand what went wrong
+          2. Identify the root cause of the failure(s)
+          3. Suggest a clear, actionable solution to fix the failure(s)
+
+          # Getting Started
+          1. Call the generate_agents_md tool to get a high-level summary of the project
+          2. Get the pull request associated with this workflow run from the GitHub repository: ${{ github.repository }}
+             - The workflow run ID is: ${{ github.event.workflow_run.id }}
+             - The workflow run was triggered by: ${{ github.event.workflow_run.event }}
+             - Use GitHub MCP tools to get PR details and workflow run information
+          3. Use the GitHub MCP tools to fetch job logs and failure information:
+             - Use get_workflow_run to get details about the failed workflow
+             - Use list_workflow_jobs to see which jobs failed
+             - Use get_job_logs with failed_only=true to get logs for failed jobs
+             - Use summarize_run_log_failures to get an AI summary of what failed
+          4. Analyze the failures to understand the root cause
+          5. Search the codebase for relevant files, tests, and implementations
+
+          # Your Response
+          Post a comment on the pull request with your analysis. Your comment should include:
+
+          ## Test Failure Analysis
+
+          **Summary**: A brief 1-2 sentence summary of what failed.
+
+          **Root Cause**: A clear explanation of why the tests failed, based on your analysis of the logs and code.
+
+          **Suggested Solution**: Specific, actionable steps to fix the failure(s). Include:
+          - Which files need to be modified
+          - What changes are needed
+          - Why these changes will fix the issue
+
+          <details>
+          <summary>Detailed Analysis</summary>
+
+          Include here:
+          - Relevant log excerpts showing the failure
+          - Code snippets that are causing the issue
+          - Any related issues or PRs that might be relevant
+          </details>
+
+          <details>
+          <summary>Related Files</summary>
+
+          List files that are relevant to the failure with brief explanations of their relevance.
+          </details>
+
+          # Important Guidelines
+          - Be concise and actionable - developers want to quickly understand and fix the issue
+          - Focus on facts from the logs and code, not speculation
+          - If you can't determine the root cause, say so clearly
+          - Provide specific file names, line numbers, and code references when possible
+          - You can run make commands (e.g., `make lint`, `make typecheck`, `make sync`) to build, test, or lint the code
+          - You can also run git commands (e.g., `git status`, `git log`, `git diff`) to inspect the repository
+          - You can use WebSearch and WebFetch to research errors, stack traces, or related issues
+          - For bash commands, you are limited to make and git commands only
+
+          # CRITICAL: Loop Detection
+          **IMPORTANT**: Before posting your analysis, check the PR comments to detect if there's a loop where:
+          - CodeRabbit or another bot triggered this workflow
+          - Your previous analysis triggered CodeRabbit or another bot
+          - This created a repeating cycle of bot comments
+
+          If you detect such a loop (e.g., you see multiple similar bot comments or your own previous analysis comments):
+          1. **DO NOT** post another analysis comment
+          2. Instead, post a brief comment stating: "Loop detected: Multiple automated analysis comments found. Stopping to prevent further automated comments. Please review the existing analysis comments."
+          3. Exit immediately without further action
+
+          # Problems Encountered
+          If you encounter any problems during your analysis (e.g., unable to fetch logs, tools not working), document them clearly so the team knows what limitations you faced.
+          PROMPT_END
+          EOF
+
+      - name: Setup GitHub MCP Server
+        run: |
+          mkdir -p /tmp/mcp-config
+          cat > /tmp/mcp-config/mcp-servers.json << 'EOF'
+          {
+            "mcpServers": {
+              "repository-summary": {
+                "type": "http",
+                "url": "https://agents-md-generator.fastmcp.app/mcp"
+              },
+              "code-search": {
+                "type": "http",
+                "url": "https://public-code-search.fastmcp.app/mcp"
+              },
+              "github-research": {
+                "type": "stdio",
+                "command": "uvx",
+                "args": [
+                  "github-research-mcp"
+                ],
+                "env": {
+                  "DISABLE_SUMMARIES": "true",  # Disable verbose summaries for faster analysis
+                  "GITHUB_PERSONAL_ACCESS_TOKEN": "${{ secrets.GITHUB_TOKEN }}"
+                }
+              }
+            }
+          }
+          EOF
+
+      - name: Run Claude Code
+        id: claude
+        uses: anthropics/claude-code-action@v1
+        with:
+          claude_code_oauth_token: ${{ secrets.CLAUDE_CODE_OAUTH_TOKEN }}
+
+          additional_permissions: |
+            actions: read
+
+          prompt: ${{ steps.analysis-prompt.outputs.PROMPT }}
+          track_progress: true
+          claude_args: |
+            --allowed-tools mcp__repository-summary,mcp__code-search,mcp__github-research,WebSearch,WebFetch,Bash(make:*,git:*)
+            --mcp-config /tmp/mcp-config/mcp-servers.json
diff --git a/.github/workflows/docs.yml b/.github/workflows/docs.yml
@@ -25,11 +25,11 @@ jobs:
         with:
           enable-cache: true
 
-      - name: Install documentation dependencies
-        run: uv pip install --system mkdocs mkdocs-material 'mkdocstrings[python]' mkdocstrings-python
+      - name: Install all packages and dependencies
+        run: uv sync --all-packages --group dev
 
       - name: Build documentation
-        run: mkdocs build
+        run: uv run --extra docs mkdocs build
 
       - name: Deploy to GitHub Pages
-        run: mkdocs gh-deploy --force
+        run: uv run --extra docs mkdocs gh-deploy --force
diff --git a/AGENTS.md b/AGENTS.md
@@ -112,6 +112,139 @@ to the async package. You will need to include the generated code in your pull
 request. Nobody will generate it for you. This also means pull requests will contain
 two copies of your changes, this is intentional!
 
+## Working with Code Review Feedback
+
+This project uses AI-assisted code review (CodeRabbit) and development (Claude).
+This section provides guidance for both AI agents and human developers on how
+to handle automated code review feedback effectively.
+
+### For AI Coding Agents (Claude)
+
+When responding to CodeRabbit feedback on pull requests:
+
+#### 1. Triage Before Acting
+
+Always categorize feedback before implementation:
+
+- **Critical**: Security issues, data corruption, resource leaks, production bugs
+- **Important**: Type safety, error handling, performance issues, test failures
+- **Optional**: Style preferences, nitpicks, suggestions that conflict with
+  existing patterns
+
+Document your triage in the response to the user.
+
+#### 2. Evaluate Against Existing Patterns
+
+Before accepting suggestions:
+
+1. Search the codebase for similar patterns
+2. Check if other stores/wrappers handle this differently
+3. Preserve consistency over isolated "best practices"
+4. If uncertain, ask the repository owner
+
+**Example**: Test patterns should match existing `ContextManagerStoreTestMixin`
+usage rather than one-off suggestions for individual test files.
+
+#### 3. Consider Context and Scope
+
+Not all code has the same requirements:
+
+- **Production stores**: Prioritize correctness, performance, security
+- **Debug/development tools**: Can defer async optimization, extensive validation
+- **Test code**: Clarity and coverage over production patterns
+- **Examples**: Simplicity and readability over comprehensive error handling
+
+#### 4. Verify Completion
+
+Before claiming work is "ready to merge" or "complete":
+
+- [ ] All critical issues addressed or documented as out-of-scope
+- [ ] All important issues addressed or explicitly deferred with rationale
+- [ ] No unrelated changes from bad merges
+- [ ] `make precommit` passes (lint, typecheck, codegen)
+- [ ] Tests pass
+
+Never claim completion with unresolved critical or important issues.
+
+#### 5. Document Deferrals
+
+If feedback is not implemented, explain why:
+
+- Conflicts with established pattern (cite similar code)
+- Out of scope for this component's purpose
+- Trade-off not worth the complexity
+- Requires design decision from repository owner
+
+### For AI Code Reviewers (CodeRabbit)
+
+When reviewing pull requests, please consider:
+
+#### Project-Specific Patterns
+
+- **Async-first architecture**: All changes originate in `key-value/key-value-aio/`
+  and are code-generated into `key-value/key-value-sync/`. PRs will naturally
+  contain duplicate changes - this is intentional.
+- **Test patterns**: The project uses `ContextManagerStoreTestMixin` for store
+  tests. Look for consistency with existing test implementations.
+- **ManagedEntry wrapper**: Values are never stored directly but are wrapped in
+  `ManagedEntry` objects. This is by design, not a mistake.
+
+#### Prioritization Guidance
+
+When providing feedback, please categorize suggestions by severity:
+
+- **Critical**: Issues that could cause data loss, security vulnerabilities,
+  resource leaks, or production failures
+- **Important**: Type safety issues, missing error handling, performance problems,
+  test coverage gaps
+- **Minor/Optional**: Style preferences, minor optimizations, suggestions that
+  may conflict with existing patterns
+
+This helps human developers and AI agents prioritize their work effectively.
+
+#### Context Awareness
+
+Consider the purpose and scope of the code being reviewed:
+
+- **Production stores** (DynamoDB, Redis, PostgreSQL, etc.): Apply strict
+  standards for correctness, performance, security, and resource management
+- **Debug/development tools** (FileTreeStore, LoggingWrapper): More lenient on
+  performance optimizations; prioritize clarity and simplicity
+- **Test code**: Focus on clarity, coverage, and maintainability over production
+  patterns
+- **Example code**: Prioritize readability and educational value over
+  comprehensive error handling
+
+#### Pattern Consistency
+
+Before suggesting changes:
+
+1. Check if similar patterns exist elsewhere in the codebase
+2. If the pattern exists in multiple stores/wrappers, it's likely intentional
+3. Suggest consistency improvements across all implementations rather than
+   one-off changes
+
+### Common Feedback Categories
+
+**Clock usage**: Prefer monotonic clocks (`time.monotonic()`) for intervals,
+especially in wrappers like rate limiters and circuit breakers. Wall-clock time
+(`time.time()`) is vulnerable to system clock changes.
+
+**Connection ownership**: Track whether stores own their client connections to
+avoid closing externally-provided resources. Use flags like `_owns_client` to
+distinguish between internally-created and externally-provided connections.
+
+**Async patterns**: Production stores should use actual async I/O (not
+`asyncio.sleep()` or `run_in_executor()`). Debug-only tools may use simpler
+patterns for clarity.
+
+**Test isolation**: Ensure tests clean up resources (connections, temp files,
+etc.) and don't interfere with each other. Use context managers and proper
+teardown.
+
+**Type safety**: This project uses strict type checking (Basedpyright). Address
+type annotation issues to maintain type safety guarantees.
+
 ## Make Commands Reference
 
 | Command | Purpose |
@@ -224,6 +357,8 @@ GitHub Actions workflows are in `.github/workflows/`:
 - `publish.yml` - Publish packages to PyPI
 - `claude-on-mention.yml` - Claude Code assistant (can make PRs)
 - `claude-on-open-label.yml` - Claude triage assistant (read-only analysis)
+- `claude-on-test-failure.yml` - Claude test failure analysis (automatically
+  analyzes failed tests and suggests solutions)
 
 ## Version Management
 
@@ -242,5 +377,15 @@ make bump-version-dry VERSION=1.2.3    # Dry run
 
 ## Radical Honesty
 
-Agents should be honest! Properly document any problems encountered, share
-feedback, and be transparent about your AI-assisted work.
+Agents should be honest! When working with code review feedback:
+
+- **Document unresolved items**: List any feedback that wasn't addressed and why
+- **Acknowledge uncertainty**: If unclear whether to implement a suggestion, ask
+- **Report problems**: Document issues encountered during implementation
+- **Share trade-offs**: Explain reasoning for rejecting or deferring feedback
+- **Admit limitations**: If unable to verify a fix works correctly, say so
+
+Never claim work is complete if you have doubts about correctness or completeness.
+
+Properly document any problems encountered, share feedback, and be transparent
+about your AI-assisted work.
diff --git a/key-value/key-value-aio/src/key_value/aio/stores/__init__.py b/key-value/key-value-aio/src/key_value/aio/stores/__init__.py
diff --git a/key-value/key-value-aio/src/key_value/aio/stores/elasticsearch/store.py b/key-value/key-value-aio/src/key_value/aio/stores/elasticsearch/store.py
@@ -29,6 +29,7 @@
 
 try:
     from elasticsearch import AsyncElasticsearch
+
     from key_value.aio.stores.elasticsearch.utils import (
         get_aggregations_from_body,
         get_body_from_response,
diff --git a/key-value/key-value-aio/src/key_value/aio/stores/elasticsearch/utils.py b/key-value/key-value-aio/src/key_value/aio/stores/elasticsearch/utils.py
@@ -6,7 +6,6 @@
     ObjectApiResponse,
     SerializationError,
 )
-
 from elasticsearch import AsyncElasticsearch
 
 
diff --git a/key-value/key-value-sync/src/key_value/sync/code_gen/stores/__init__.py b/key-value/key-value-sync/src/key_value/sync/code_gen/stores/__init__.py
@@ -0,0 +1,4 @@
+# WARNING: this file is auto-generated by 'build_sync_library.py'
+# from the original file '__init__.py'
+# DO NOT CHANGE! Change the original file instead.
+
diff --git a/key-value/key-value-sync/src/key_value/sync/stores/__init__.py b/key-value/key-value-sync/src/key_value/sync/stores/__init__.py
@@ -0,0 +1,4 @@
+# WARNING: this file is auto-generated by 'build_sync_library.py'
+# from the original file '__init__.py'
+# DO NOT CHANGE! Change the original file instead.
+

Original file line number	Diff line number	Diff line change
`@@ -6,7 +6,6 @@`
`6`	`6`	`ObjectApiResponse,`
`7`	`7`	`SerializationError,`
`8`	`8`	`)`
`9`		`-`
`10`	`9`	`from elasticsearch import AsyncElasticsearch`
`11`	`10`
`12`	`11`