[WIP] Fix critical recurring failure in GenAIScript model by Copilot · Pull Request #2210 · github/gh-aw

Copilot · 2025-10-23T13:30:14Z

Thanks for assigning this issue to me. I'm starting to work on it and will keep this PR's description up to date as I form a plan and make progress.

Original prompt

This section details on the original issue you should resolve

<issue_title>[smoke-detector] 🚨 CRITICAL RECURRING: GenAIScript Invalid Model (gpt-4.1) - 3rd Occurrence</issue_title>
<issue_description># 🚨 CRITICAL RECURRING FAILURE - 3rd Occurrence

Summary

The Smoke GenAIScript workflow has FAILED AGAIN with the same root cause that was previously reported in #2157. This is the 3rd occurrence of this critical issue in less than 24 hours. Issue #2157 was closed as "not_planned" but the underlying configuration problem was never fixed, resulting in continued failures.

Failure Details

Run: #18739169072
Commit: 6b2c9e7 - "Remove 'defaults' section from main JSON schema (Remove "defaults" section from main JSON schema #2200)"
Trigger: schedule (automated smoke test)
Duration: 3.1 minutes
Failed Job: detection (55 seconds)
Workflow: Smoke GenAIScript

Recurrence Timeline

Occurrence	Run ID	Date	Status
1st	18727962258	2025-10-22 19:45 UTC	Issue #2157 created
2nd	18733557489	2025-10-23 00:19 UTC	#2157 closed as "not_planned"
3rd	18739169072 (current)	2025-10-23 06:07 UTC	This investigation

Root Cause (UNCHANGED)

The problem remains exactly the same as documented in #2157:

The GenAIScript configuration uses an invalid OpenAI model name:

# .github/workflows/shared/genaiscript.md:6
GH_AW_AGENT_MODEL_VERSION: "openai:gpt-4.1"

gpt-4.1 does not exist in OpenAI's model catalog. Valid models include:

gpt-4o (recommended)
gpt-4-turbo
gpt-4
gpt-3.5-turbo

Error Chain

GenAIScript attempts to use model openai:gpt-4.1
OpenAI API rejects the invalid model name
GenAIScript receives undefined/null response
GenAIScript crashes: TypeError: Cannot read properties of undefined (reading 'text')
Detection job fails with exit code 255

Stack Trace

2025-10-23T06:09:52.672Z genaiscript:error {
  name: 'TypeError',
  message: "Cannot read properties of undefined (reading 'text')",
  stack: "TypeError: Cannot read properties of undefined (reading 'text')\n" +
    '    at githubActionSetOutputs ((redacted))\n' +
    '    at async Command.runScriptWithExitCode ((redacted))'
}

Failed Jobs and Errors

Job Execution Summary

✅ activation - succeeded (2s)
✅ agent - succeeded (1.4m) - Agent completed successfully
❌ detection - FAILED (55s) - Threat detection crashed
✅ create_issue - succeeded (7s)
⏭️ missing_tool - skipped

Impact Assessment

Severity: 🔴 CRITICAL

All scheduled smoke tests for GenAIScript are failing
Threat detection is NOT running (security implications)
False confidence in system health due to skipped validations
3 consecutive failures in ~10 hours

Urgency: 🔴 IMMEDIATE

Simple one-line configuration fix
Blocking critical security and quality checks
Issue will continue to recur daily via scheduled runs

Scope:

Affects: All workflows using shared/genaiscript.md
Frequency: Every scheduled smoke test run (multiple times per day)
Duration: Ongoing since 2025-10-22 19:45 UTC (>10 hours)

Why This Needs Immediate Attention

Security Risk: Threat detection is disabled due to these failures
False Negatives: Team may assume smoke tests are passing when they're actually failing
Resource Waste: Every scheduled run consumes CI minutes while producing no value
Pattern Established: Without intervention, this will fail indefinitely
Simple Fix: One-line configuration change to resolve

Recommended Actions

🔴 IMMEDIATE - Fix Configuration (5 minutes)

Update .github/workflows/shared/genaiscript.md line 6:

- GH_AW_AGENT_MODEL_VERSION: "openai:gpt-4.1"
+ GH_AW_AGENT_MODEL_VERSION: "openai:gpt-4o"

🟡 SHORT-TERM - Prevent Recurrence

Add Model Validation - Create a pre-flight check that validates model names before execution
Update Documentation - Document valid model names in configuration files
Monitor Pattern - Track if this issue pattern appears in other workflows

🟢 LONG-TERM - Systemic Improvements

Schema Validation - Add JSON schema validation for workflow configurations
Better Error Messages - Work with GenAIScript team to improve error handling
Automated Alerts - Configure alerts for recurring failure patterns

Investigation Findings

Configuration Location

File: .github/workflows/shared/genaiscript.md
Line: 6
Variable: GH_AW_AGENT_MODEL_VERSION
Current Value:...

Fixes #2204

💬 We'd love your input! Share your thoughts on Copilot coding agent in our 2 minute survey.

Initial plan

b0c39ef

Copilot AI assigned Copilot and pelikhan Oct 23, 2025

Copilot started work on behalf of pelikhan October 23, 2025 13:30 View session

pelikhan closed this Oct 23, 2025

pelikhan deleted the copilot/fix-genaiscript-invalid-model branch October 23, 2025 13:30

Copilot stopped work on behalf of pelikhan due to an error October 23, 2025 13:31
Copilot has encountered an error. See logs for additional details.

Copilot AI requested a review from pelikhan October 23, 2025 13:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Fix critical recurring failure in GenAIScript model#2210

[WIP] Fix critical recurring failure in GenAIScript model#2210
Copilot wants to merge 1 commit intomainfrom
copilot/fix-genaiscript-invalid-model

Copilot AI commented Oct 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Copilot AI commented Oct 23, 2025

Summary

Failure Details

Recurrence Timeline

Root Cause (UNCHANGED)

Error Chain

Stack Trace

Failed Jobs and Errors

Job Execution Summary

Impact Assessment

Why This Needs Immediate Attention

Recommended Actions

🔴 IMMEDIATE - Fix Configuration (5 minutes)

🟡 SHORT-TERM - Prevent Recurrence

🟢 LONG-TERM - Systemic Improvements

Investigation Findings

Configuration Location

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants