Fix smoke-claude: handle tool failures gracefully and guarantee safe output by Copilot · Pull Request #14171 · github/gh-aw

Copilot · 2026-02-06T16:10:38Z

The smoke-claude workflow failed when the agentic-workflows tool returned errors, and didn't always create diagnostic issues on test failures. Timeout also exceeded the 10-minute target.

Changes

Timeout: Reduced from 15 to 10 minutes
agentic-workflows tool handling: Added explicit error handling with fallback behavior
- If tool fails or returns no results, mark test as ❌ but continue execution
- Clearer extraction instructions: total runs, success/failure counts, last run timestamp
Safe output guarantee: Made issue creation mandatory regardless of test results
- Added CRITICAL directive requiring issue creation before any other operations
- Include error details in issue body when tests fail
- Fallback for discussion comment step if discussion number extraction fails

Example

Before:

10. **Agentic Workflows MCP Testing**: 
   - Use the `agentic-workflows` MCP tool with the `status` method to query the status of the "smoke-claude" workflow
   - Extract key information: total runs, recent success/failure status, last run time

After:

10. **Agentic Workflows MCP Testing**: 
   - Call the `agentic-workflows` MCP tool using the `status` method with workflow name `smoke-claude`
   - If the tool returns an error or no results, mark this test as ❌ and note "Tool unavailable or workflow not found" but continue to the Output section
   - If the tool succeeds, extract key information from the response: total runs, success/failure counts, last run timestamp

Output section now explicitly requires issue creation:

**CRITICAL: You MUST create an issue regardless of test results - this is a required safe output.**

Original prompt

Fix smoke-claude to handle the agentic-workflows tool response and make sure it always create safe outputs. Increase timeout to 10 minutes.

See https://github.com/github/gh-aw/actions/runs/21757032307

💬 We'd love your input! Share your thoughts on Copilot coding agent in our 2 minute survey.

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

… outputs, reduce timeout to 10min Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

pelikhan · 2026-02-06T16:21:42Z

.github/workflows/smoke-claude.md

      run-success: "🎬 **THE END** — [{workflow_name}]({run_url}) **MISSION: ACCOMPLISHED!** The hero saves the day! ✨"
      run-failure: "💫 **TO BE CONTINUED...** [{workflow_name}]({run_url}) {status}! Our hero faces unexpected challenges..."
-timeout-minutes: 15
+timeout-minutes: 10


@copilot keep it 15

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot

Pull request overview

Updates the Smoke Claude agentic workflow to better handle agentic-workflows MCP tool responses (including error cases), reduce runtime timeout, and strengthen guidance to always produce required safe outputs.

Changes:

Reduce smoke-claude workflow timeout from 15 to 10 minutes (and recompile lockfile).
Improve agentic-workflows MCP tool testing instructions to handle errors gracefully and still proceed to safe outputs.
Update output instructions to make issue creation mandatory regardless of test results.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

File	Description
`docs/src/content/docs/agent-factory-status.mdx`	Removes a malformed/extra workflow row from the status table.
`.github/workflows/smoke-claude.md`	Updates timeout and strengthens instructions for MCP tool error handling + mandatory safe output issue creation.
`.github/workflows/smoke-claude.lock.yml`	Recompiled workflow YAML reflecting the updated timeout/frontmatter hash.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-06T16:27:11Z

.github/workflows/smoke-claude.md

   - ✅ or ❌ for each test result
   - Overall status: PASS or FAIL

 3. Use the `add_comment` tool with `item_number` set to the discussion number you extracted in step 9 to add a **fun comic-book style comment** to that discussion - be playful and use comic-book language like "💥 WHOOSH!"


In Output step 3, the instructions say to call add_comment with item_number set to the extracted discussion number. Elsewhere in this workflow (step 9) and in other smoke workflows, discussions are targeted via discussion_number, and using item_number risks posting to an issue/PR with the same number instead of the discussion. Consider switching this to discussion_number: <extracted_number> for consistency and correctness.

Suggested change

3. Use the `add_comment` tool with `item_number` set to the discussion number you extracted in step 9 to add a **fun comic-book style comment** to that discussion - be playful and use comic-book language like "💥 WHOOSH!"

3. Use the `add_comment` tool with `discussion_number` set to the discussion number you extracted in step 9 to add a **fun comic-book style comment** to that discussion - be playful and use comic-book language like "💥 WHOOSH!"

Initial plan

36b59a8

Copilot AI assigned Copilot and pelikhan Feb 6, 2026

Copilot started work on behalf of pelikhan February 6, 2026 16:10 View session

Initial plan for smoke-claude fix

fead8c0

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

github-actions bot mentioned this pull request Feb 6, 2026

📰 Repository Chronicle - Team Leverages AI Tools in 24-Hour Development Blitz #14172

Open

Copilot AI and others added 2 commits February 6, 2026 16:18

Fix smoke-claude: handle agentic-workflows tool response, ensure safe…

0545788

… outputs, reduce timeout to 10min Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Merge branch 'main' into copilot/fix-smoke-claude-tool-response

08a50b2

pelikhan reviewed Feb 6, 2026

View reviewed changes

pelikhan marked this pull request as ready for review February 6, 2026 16:23

Copilot AI review requested due to automatic review settings February 6, 2026 16:23

Final validation complete

f913515

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot started reviewing on behalf of pelikhan February 6, 2026 16:23 View session

Copilot AI changed the title ~~[WIP] Fix smoke-claude to handle agentic-workflows tool response~~ Fix smoke-claude: handle tool failures gracefully and guarantee safe output Feb 6, 2026

Copilot AI requested a review from pelikhan February 6, 2026 16:24

Copilot finished work on behalf of pelikhan February 6, 2026 16:24

Copilot started work on behalf of pelikhan February 6, 2026 16:26 View session

pelikhan merged commit dd39552 into main Feb 6, 2026
120 of 121 checks passed

pelikhan deleted the copilot/fix-smoke-claude-tool-response branch February 6, 2026 16:26

Copilot AI reviewed Feb 6, 2026

View reviewed changes

Copilot stopped work on behalf of pelikhan due to an error February 6, 2026 16:27
The session was cancelled by the user.

github-actions bot mentioned this pull request Feb 6, 2026

Smoke Test: Claude - 21758499044 #14188

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix smoke-claude: handle tool failures gracefully and guarantee safe output#14171

Fix smoke-claude: handle tool failures gracefully and guarantee safe output#14171
pelikhan merged 5 commits intomainfrom
copilot/fix-smoke-claude-tool-response

Copilot AI commented Feb 6, 2026 •

edited

Loading

Uh oh!

pelikhan Feb 6, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	3. Use the `add_comment` tool with `item_number` set to the discussion number you extracted in step 9 to add a fun comic-book style comment to that discussion - be playful and use comic-book language like "💥 WHOOSH!"
	3. Use the `add_comment` tool with `discussion_number` set to the discussion number you extracted in step 9 to add a fun comic-book style comment to that discussion - be playful and use comic-book language like "💥 WHOOSH!"

Conversation

Copilot AI commented Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Example

Uh oh!

pelikhan Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Feb 6, 2026 •

edited

Loading