[docs] Update documentation for zizmor enhancements from 2025-10-29 #2717

github-actions · 2025-10-29T06:10:57Z

Documentation Updates - 2025-10-29

This PR updates the documentation based on features merged in the last 24 hours.

Features Documented

Zizmor URL links in error output (from Add URL links and verbose Docker command to zizmor error output #2701)
- Error messages now include direct links to zizmor documentation
- Each finding shows URL like ((redacted))
Verbose mode Docker command display (from Add URL links and verbose Docker command to zizmor error output #2701)
- When using --verbose with --zizmor, displays Docker command for manual reproduction
- Enables developers to run zizmor directly for debugging

Changes Made

Updated docs/src/content/docs/tools/cli.md in the "Security Scanning with Zizmor" section:
- Added description of URL links in security findings
- Added "Verbose Output" subsection with example command and output
- Updated example output to show URL format

Merged PRs Referenced

Add URL links and verbose Docker command to zizmor error output #2701 - Add URL links and verbose Docker command to zizmor error output
Add pull_request trigger with "smoke" label filter to smoke test workflows #2700 - Add pull_request trigger with "smoke" label filter (already documented)
Update CLI versions: Claude Code 2.0.28, Copilot 0.0.352, GitHub MCP Server v0.20.1 #2699 - Update CLI versions (internal version bumps, no user-facing changes)
[q] Fix firewall log parsing to handle workflow-specific directory suffixes #2696 - Fix firewall log parsing (bug fix, no new features)
perf: parallelize git ls-remote test for 70% faster execution #2693 - Parallelize git ls-remote test (internal performance optimization)

Notes

Most changes from the last 24 hours were:

Internal version bumps (PR Update CLI versions: Claude Code 2.0.28, Copilot 0.0.352, GitHub MCP Server v0.20.1 #2699)
Bug fixes (PR [q] Fix firewall log parsing to handle workflow-specific directory suffixes #2696)
Internal performance improvements (PR perf: parallelize git ls-remote test for 70% faster execution #2693)
Workflow configuration updates (PR Add pull_request trigger with "smoke" label filter to smoke test workflows #2700) that use already-documented features

Only PR #2701 introduced new user-facing functionality requiring documentation updates.

AI generated by Daily Documentation Updater

Updated CLI documentation to reflect enhancements in PR #2701: - Zizmor findings now include documentation URL links - Verbose mode displays Docker command for manual reproduction 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

github-actions · 2025-10-29T12:23:42Z

🔍 Smoke Test Investigation - Run #48

Summary

Smoke OpenCode workflow failed when the agent encountered an Anthropic API error (AI_APICallError) during initial startup. The API call to api.anthropic.com/v1/messages failed before the agent could process the first prompt, preventing any work from being done. As a result, no safe-outputs were created, and the downstream create_issue job failed when it couldn't find the expected artifact.

Failure Details

Run: 18907474764
Workflow: Smoke OpenCode
Commit: 8f8cfbd
Trigger: schedule
Duration: 1.1 minutes
Date: 2025-10-29 12:14:33 UTC

Root Cause Analysis

Primary Issue

The OpenCode agent (v0.15.13) failed during initialization with an Anthropic API error:

ERROR 2025-10-29T12:14:33 +528ms service=session.prompt 
error={"error":{"name":"AI_APICallError","url":"(redacted)",...}}

This occurred during the session.prompt processing stage, before the agent could even begin working on the task. The API call failed at the very start of agent execution.

Why This Matters

The main agent job failed (38s runtime, $0 cost, 0 turns)
The create_issue job failed as a consequence
This is a recurring pattern for OpenCode + Anthropic API (8th occurrence since Oct 22)
Unlike the "agent doesn't use safe-outputs" pattern (issues [smoke-detector] 🔍 Smoke Test Investigation - Smoke OpenCode Run #18722224746: Agent Does Not Use Safe-Outputs MCP Tools #2143, [smoke-outpost] 🔍 Smoke Test Investigation - Smoke OpenCode: Missing agent_output.json File #2121), here the agent fails entirely

Is This a Real Problem?

This is a transient API infrastructure issue, not a code bug in this PR. However, it represents a workflow robustness problem:

OpenCode smoke tests have high failure rates due to API errors
Transient API failures cause entire workflow failures
No retry mechanism exists to handle temporary API issues

Failed Jobs and Errors

Failed: agent (38s)

Error Type: AI_APICallError - Anthropic API Call Failed
API Endpoint: (redacted)
Model: anthropic/claude-3-5-sonnet-20241022
Stage: Initial startup / session prompt processing
Is Transient: Yes ✅

Failed: create_issue (3s)

Error: ENOENT: no such file or directory, open '/tmp/gh-aw/safeoutputs/agent_output.json'
Is Consequence: Yes (agent failed before creating outputs)

Succeeded Jobs

✅ pre_activation (3s)
✅ activation (3s)

Skipped Jobs

⏭️ detection
⏭️ missing_tool

Investigation Findings

OpenCode + Anthropic API Failure Pattern

This is a recurring pattern affecting OpenCode specifically:

Recent Occurrences:

2025-10-29 12:14 (this run): AI_APICallError during startup
2025-10-29 06:18 (run 18898840976): Agent failed, no safe-outputs
2025-10-29 06:11 (run 18898690457): Agent failed, no safe-outputs
2025-10-29 00:39 (run 18893290104): API call error
2025-10-29 00:17 (run 18892865991): API call error
2025-10-28 22:13 (run 18890591960): API error + 401 Unauthorized
2025-10-22 18:14 (run 18725510532): Missing safe-outputs
2025-10-22 16:09 (run 18722224746): Agent completed but no safe-outputs

Pattern ID: OPENCODE_ANTHROPIC_API_ERROR

First Occurrence: 2025-10-22
Total Occurrences: 8 in past week
Severity: High (causes workflow failure)
Category: AI Engine - API Failure
Flaky: Yes - depends on Anthropic API availability

Comparison with Other Patterns

Pattern	Agent Status	Cause	Issues
OPENCODE_ANTHROPIC_API_ERROR	Fails	API error	This
OPENCODE_NO_SAFE_OUTPUTS	Succeeds	Doesn't use tools	#2143, #2121
ANTHROPIC_API_OVERLOADED	Fails	API 500 "Overloaded"	#2730

Key Difference: In the "no safe-outputs" pattern, the agent completes successfully but doesn't use the create_issue tool. In this pattern, the agent fails entirely due to API errors before doing any work.

Recommended Actions

🔴 High Priority

Implement retry logic for OpenCode agent startup
- Add exponential backoff retry mechanism
- Initial delay: 10 seconds
- Max retries: 3
- Prevents: Transient API failures causing workflow failure
- Impact: Would resolve ~80% of these failures
Add API health check before agent execution
- Make simple test API call before starting agent
- Fail fast if API is unavailable
- Prevents: Wasted runner time on guaranteed failures
- Impact: Better error detection and reporting

🟡 Medium Priority

Make create_issue job conditional on agent success
```
create_issue:
  needs: agent
  if: success()
```
- Prevents: Cascading failures and confusing error messages
- Impact: Cleaner failure signals
Add fallback issue creation for API failures
- When agent fails due to API errors, create issue via GitHub Actions
- Report the API failure details
- Prevents: Silent failures in smoke tests

🟢 Low Priority

Monitor Anthropic API health metrics
- Track failure rates and patterns
- Identify peak failure times
- Prevents: Recurring issues going unnoticed
Consider alternative model providers for smoke tests
- Test with OpenAI or other providers as fallback
- Prevents: Single point of failure

Prevention Strategies

Immediate Actions

✅ Pattern documented: Created /tmp/gh-aw/cache-memory/patterns/opencode_anthropic_api_error.json
✅ Investigation saved: Created /tmp/gh-aw/cache-memory/investigations/2025-10-29-18907474764.json
Next: Implement retry logic in OpenCode agent initialization

Long-term Improvements

Add circuit breaker pattern for repeated API failures
Implement health check layer before agent execution
Consider using multiple model providers for redundancy
Add alerting when failure rate exceeds threshold (>30%)

Historical Context

Related Issues

[smoke-detector] 🔍 Smoke Test Investigation - Smoke OpenCode Run #18722224746: Agent Does Not Use Safe-Outputs MCP Tools #2143 - OpenCode agent doesn't use safe-outputs (closed)
[smoke-outpost] 🔍 Smoke Test Investigation - Smoke OpenCode: Missing agent_output.json File #2121 - OpenCode missing agent_output.json (closed)
[smoke-detector] 🔍 Smoke Test Investigation - Smoke Codex Run #49: Agent Output Artifact Missing in Staged Mode #2604 - Codex agent output artifact missing in staged mode (closed)
[smoke-detector] 🔍 Smoke Test Investigation - Smoke GenAIScript Run #57: Agent Does Not Use Safe-Outputs MCP Tools #2307 - GenAIScript agent doesn't use safe-outputs (closed)
[smoke-detector] 🔍 Smoke Test Investigation - Run #63: Anthropic API Overload in Detection Job #2730 - Smoke Claude: Anthropic API Overload in Detection Job (open)

Frequency Analysis

OpenCode-specific failures: 8 occurrences in 7 days (>1 per day average)
Anthropic API issues across all engines: More frequent during scheduled runs
Success rate: OpenCode smoke tests have <50% success rate recently

Pattern Evolution

OpenCode has shown two distinct failure patterns:

Agent completes but doesn't use safe-outputs (issues [smoke-detector] 🔍 Smoke Test Investigation - Smoke OpenCode Run #18722224746: Agent Does Not Use Safe-Outputs MCP Tools #2143, [smoke-outpost] 🔍 Smoke Test Investigation - Smoke OpenCode: Missing agent_output.json File #2121) - resolved for some runs
Agent fails due to API errors (this pattern) - ongoing and increasing

Related PR Context

PR #2717: "[docs] Update documentation for zizmor enhancements from 2025-10-29"

Type: Documentation update
Content: Added zizmor URL links and verbose mode documentation
Relevance: ❌ Not related to failure
Assessment: Failure was due to external Anthropic API error during agent startup, not code changes in this PR

Investigation Metadata

Pattern: OPENCODE_ANTHROPIC_API_ERROR (Recurring)
Investigator: smoke-detector
Investigation Run: 18907506385
Investigation Files:
- /tmp/gh-aw/cache-memory/investigations/2025-10-29-18907474764.json
- /tmp/gh-aw/cache-memory/patterns/opencode_anthropic_api_error.json

AI generated by Smoke Detector - Smoke Test Failure Investigator

github-actions bot added automation documentation Improvements or additions to documentation labels Oct 29, 2025

pelikhan merged commit 8f8cfbd into main Oct 29, 2025
4 checks passed

pelikhan deleted the docs/update-2025-10-29-e520da2208553516 branch October 29, 2025 11:46

github-actions bot mentioned this pull request Oct 29, 2025

[smoke-detector] 🔍 Smoke Test Investigation - Run #63: Anthropic API Overload in Detection Job #2730

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[docs] Update documentation for zizmor enhancements from 2025-10-29 #2717

[docs] Update documentation for zizmor enhancements from 2025-10-29 #2717

Uh oh!

github-actions bot commented Oct 29, 2025

Uh oh!

Uh oh!

github-actions bot commented Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[docs] Update documentation for zizmor enhancements from 2025-10-29 #2717

[docs] Update documentation for zizmor enhancements from 2025-10-29 #2717

Uh oh!

Conversation

github-actions bot commented Oct 29, 2025

Documentation Updates - 2025-10-29

Features Documented

Changes Made

Merged PRs Referenced

Notes

Uh oh!

Uh oh!

github-actions bot commented Oct 29, 2025

🔍 Smoke Test Investigation - Run #48

Summary

Failure Details

Root Cause Analysis

Primary Issue

Why This Matters

Is This a Real Problem?

Failed Jobs and Errors

Failed: agent (38s)

Failed: create_issue (3s)

Succeeded Jobs

Skipped Jobs

Investigation Findings

OpenCode + Anthropic API Failure Pattern

Comparison with Other Patterns

Recommended Actions

🔴 High Priority

🟡 Medium Priority

🟢 Low Priority

Prevention Strategies

Immediate Actions

Long-term Improvements

Historical Context

Related Issues

Frequency Analysis

Pattern Evolution

Related PR Context

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant