Add automated Claude workflow for test failure analysis #200

Copilot · 2025-11-02T00:46:04Z

When PR tests fail, automatically trigger Claude to analyze failures and post diagnostic comments with suggested fixes.

Changes

New workflow .github/workflows/claude-on-test-failure.yml
- Triggers on workflow_run: completed for "Run Tests" workflow
- Runs only when conclusion == 'failure'
- Waits for all test jobs to complete before analyzing
- Uses GitHub MCP tools to fetch workflow logs and failure details
- Posts structured PR comment with root cause and actionable solution
- Concurrency controls: Only one analysis runs at a time per branch, with older jobs automatically cancelled
- Loop detection: Claude checks for bot comment loops (e.g., CodeRabbit ↔ Claude cycles) and exits with a warning if detected
Documentation updated in AGENTS.md

Workflow Behavior

When tests fail, Claude:

Fetches failed job logs via get_job_logs and summarize_run_log_failures
Analyzes failures against codebase context
Checks PR comments for bot loops before posting
Posts comment with:
- Failure summary
- Root cause analysis
- Specific file changes needed
- Detailed logs in collapsible sections

Follows existing Claude workflow patterns (same MCP servers, similar prompt structure, consistent tool allowlist).

Safety Features

Concurrency management: cancel-in-progress: true ensures efficient resource usage by canceling superseded analysis runs
Loop prevention: Explicit instructions for Claude to detect and break bot interaction loops

<issue_title>Add Claude Workflow for failed tests</issue_title>
><issue_description>When a PR check fails, we should trigger Claude automatically to assess the failure and post a comment with what it thinks the solution to the failure is.
>
> Can we have it wait until all checks on the PR report success or failure?
>
></issue_description>
>
> ## Comments on the Issue (you are @copilot in this section)
>
>
><comment_new>@strawgate
> @claude please make a pull request</comment_new>
>
>

Fixes Add Claude Workflow for failed tests #195

Original prompt

This section details on the original issue you should resolve

<issue_title>Add Claude Workflow for failed tests</issue_title>
<issue_description>When a PR check fails, we should trigger Claude automatically to assess the failure and post a comment with what it thinks the solution to the failure is.

Can we have it wait until all checks on the PR report success or failure?

</issue_description>

Comments on the Issue (you are @copilot in this section)
@strawgate @claude please make a pull request

Fixes Add Claude Workflow for failed tests #195

💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.

coderabbitai · 2025-11-02T00:46:12Z

Important

Review skipped

Bot user detected.

To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

✨ Finishing touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch copilot/add-claude-workflow-for-tests

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Co-authored-by: strawgate <6384545+strawgate@users.noreply.github.com>

sonarqubecloud · 2025-11-02T00:50:08Z

Please retry analysis of this Pull-Request directly on SonarQube Cloud

Co-authored-by: strawgate <6384545+strawgate@users.noreply.github.com>

strawgate · 2025-11-02T00:54:43Z

@copilot i think we want to set concurrency controls here so that older jobs get cancelled, only one runs at a time, etc. We also should make it very clear in the prompt that if it detects a loop (code rabbit triggers claude triggers code rabbit triggers claude sorta thing) it must exist.

Co-authored-by: strawgate <6384545+strawgate@users.noreply.github.com>

Copilot · 2025-11-02T00:58:11Z

@copilot i think we want to set concurrency controls here so that older jobs get cancelled, only one runs at a time, etc. We also should make it very clear in the prompt that if it detects a loop (cod...

Added in commit 914a4de:

Concurrency controls: Added concurrency group with cancel-in-progress: true to ensure only one analysis runs at a time per branch, canceling older jobs automatically.
Loop detection: Added a critical section to the prompt instructing Claude to:
- Check PR comments for bot comment loops before posting
- Detect if CodeRabbit/other bots triggered a cycle
- Exit immediately with a brief warning comment if a loop is detected
- Not post another full analysis if loops are detected

sonarqubecloud · 2025-11-02T02:19:17Z

Quality Gate failed

Failed conditions
3 Security Hotspots

See analysis details on SonarQube Cloud

Copilot AI self-assigned this Nov 2, 2025

Initial plan

c941367

Copilot AI assigned strawgate Nov 2, 2025

Copilot started work on behalf of strawgate November 2, 2025 00:47 View session

Add Claude workflow for analyzing test failures

2cbe27b

Co-authored-by: strawgate <6384545+strawgate@users.noreply.github.com>

Copilot AI and others added 2 commits November 2, 2025 00:51

Document Claude test failure workflow in AGENTS.md

261c929

Co-authored-by: strawgate <6384545+strawgate@users.noreply.github.com>

Address code review feedback: clarify tool usage and add comment

a167d7f

Co-authored-by: strawgate <6384545+strawgate@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Add Claude workflow for failed tests~~ Add automated Claude workflow for test failure analysis Nov 2, 2025

Copilot AI requested a review from strawgate November 2, 2025 00:54

Copilot finished work on behalf of strawgate November 2, 2025 00:54

strawgate marked this pull request as ready for review November 2, 2025 00:55

Copilot started work on behalf of strawgate November 2, 2025 00:55 View session

Add concurrency controls and loop detection to prevent bot loops

914a4de

Co-authored-by: strawgate <6384545+strawgate@users.noreply.github.com>

Copilot finished work on behalf of strawgate November 2, 2025 00:59

Merge branch 'main' into copilot/add-claude-workflow-for-tests

092220c

strawgate merged commit cacb180 into main Nov 2, 2025
1 of 2 checks passed

strawgate deleted the copilot/add-claude-workflow-for-tests branch November 2, 2025 02:19

claude bot mentioned this pull request Nov 7, 2025

Claude Analyze Failure not working #214

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add automated Claude workflow for test failure analysis #200

Add automated Claude workflow for test failure analysis #200

Uh oh!

Copilot AI commented Nov 2, 2025 •

edited

Loading

Uh oh!

coderabbitai bot commented Nov 2, 2025

Review skipped

Uh oh!

sonarqubecloud bot commented Nov 2, 2025

Uh oh!

strawgate commented Nov 2, 2025

Uh oh!

Copilot AI commented Nov 2, 2025

Uh oh!

sonarqubecloud bot commented Nov 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add automated Claude workflow for test failure analysis #200

Add automated Claude workflow for test failure analysis #200

Uh oh!

Conversation

Copilot AI commented Nov 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Workflow Behavior

Safety Features

Comments on the Issue (you are @copilot in this section)

Uh oh!

coderabbitai bot commented Nov 2, 2025

Review skipped

Uh oh!

sonarqubecloud bot commented Nov 2, 2025

Uh oh!

strawgate commented Nov 2, 2025

Uh oh!

Copilot AI commented Nov 2, 2025

Uh oh!

sonarqubecloud bot commented Nov 2, 2025

Quality Gate failed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Nov 2, 2025 •

edited

Loading