-
Notifications
You must be signed in to change notification settings - Fork 46
Description
🚨 CRITICAL RECURRING FAILURE - 3rd Occurrence
Summary
The Smoke GenAIScript workflow has FAILED AGAIN with the same root cause that was previously reported in #2157. This is the 3rd occurrence of this critical issue in less than 24 hours. Issue #2157 was closed as "not_planned" but the underlying configuration problem was never fixed, resulting in continued failures.
Failure Details
- Run: #18739169072
- Commit: 6b2c9e7 - "Remove 'defaults' section from main JSON schema (Remove "defaults" section from main JSON schema #2200)"
- Trigger: schedule (automated smoke test)
- Duration: 3.1 minutes
- Failed Job: detection (55 seconds)
- Workflow: Smoke GenAIScript
Recurrence Timeline
| Occurrence | Run ID | Date | Status |
|---|---|---|---|
| 1st | 18727962258 | 2025-10-22 19:45 UTC | Issue #2157 created |
| 2nd | 18733557489 | 2025-10-23 00:19 UTC | #2157 closed as "not_planned" |
| 3rd | 18739169072 (current) | 2025-10-23 06:07 UTC | This investigation |
Root Cause (UNCHANGED)
The problem remains exactly the same as documented in #2157:
The GenAIScript configuration uses an invalid OpenAI model name:
# .github/workflows/shared/genaiscript.md:6
GH_AW_AGENT_MODEL_VERSION: "openai:gpt-4.1"gpt-4.1 does not exist in OpenAI's model catalog. Valid models include:
gpt-4o(recommended)gpt-4-turbogpt-4gpt-3.5-turbo
Error Chain
- GenAIScript attempts to use model
openai:gpt-4.1 - OpenAI API rejects the invalid model name
- GenAIScript receives undefined/null response
- GenAIScript crashes:
TypeError: Cannot read properties of undefined (reading 'text') - Detection job fails with exit code 255
Stack Trace
2025-10-23T06:09:52.672Z genaiscript:error {
name: 'TypeError',
message: "Cannot read properties of undefined (reading 'text')",
stack: "TypeError: Cannot read properties of undefined (reading 'text')\n" +
' at githubActionSetOutputs ((redacted))\n' +
' at async Command.runScriptWithExitCode ((redacted))'
}
Failed Jobs and Errors
Job Execution Summary
- ✅ activation - succeeded (2s)
- ✅ agent - succeeded (1.4m) - Agent completed successfully
- ❌ detection - FAILED (55s) - Threat detection crashed
- ✅ create_issue - succeeded (7s)
- ⏭️ missing_tool - skipped
Impact Assessment
Severity: 🔴 CRITICAL
- All scheduled smoke tests for GenAIScript are failing
- Threat detection is NOT running (security implications)
- False confidence in system health due to skipped validations
- 3 consecutive failures in ~10 hours
Urgency: 🔴 IMMEDIATE
- Simple one-line configuration fix
- Blocking critical security and quality checks
- Issue will continue to recur daily via scheduled runs
Scope:
- Affects: All workflows using
shared/genaiscript.md - Frequency: Every scheduled smoke test run (multiple times per day)
- Duration: Ongoing since 2025-10-22 19:45 UTC (>10 hours)
Why This Needs Immediate Attention
- Security Risk: Threat detection is disabled due to these failures
- False Negatives: Team may assume smoke tests are passing when they're actually failing
- Resource Waste: Every scheduled run consumes CI minutes while producing no value
- Pattern Established: Without intervention, this will fail indefinitely
- Simple Fix: One-line configuration change to resolve
Recommended Actions
🔴 IMMEDIATE - Fix Configuration (5 minutes)
Update .github/workflows/shared/genaiscript.md line 6:
- GH_AW_AGENT_MODEL_VERSION: "openai:gpt-4.1"
+ GH_AW_AGENT_MODEL_VERSION: "openai:gpt-4o"🟡 SHORT-TERM - Prevent Recurrence
- Add Model Validation - Create a pre-flight check that validates model names before execution
- Update Documentation - Document valid model names in configuration files
- Monitor Pattern - Track if this issue pattern appears in other workflows
🟢 LONG-TERM - Systemic Improvements
- Schema Validation - Add JSON schema validation for workflow configurations
- Better Error Messages - Work with GenAIScript team to improve error handling
- Automated Alerts - Configure alerts for recurring failure patterns
Investigation Findings
Configuration Location
- File:
.github/workflows/shared/genaiscript.md - Line: 6
- Variable:
GH_AW_AGENT_MODEL_VERSION - Current Value:
"openai:gpt-4.1"❌ - Correct Value:
"openai:gpt-4o"✅
Historical Pattern
{
"pattern_signature": "GENAISCRIPT_INVALID_MODEL",
"first_occurrence": "2025-10-22T19:45:52Z",
"recurrence_count": 3,
"days_recurring": 1,
"previous_run_ids": [18727962258, 18733557489, 18739169072],
"is_flaky": false,
"external_dependency": "OpenAI API"
}Previous Issue Context
Issue #2157 documented this exact problem but was closed as "not_planned" without implementing a fix. The investigation was thorough and accurate, but the root cause was not addressed, leading to these continued failures.
This issue reopens the discussion with emphasis on:
- The recurring nature (3 occurrences)
- The security implications (disabled threat detection)
- The simplicity of the fix (one line change)
- The cost of inaction (ongoing CI failures)
Reproduction Steps
- Configure GenAIScript with model:
openai:gpt-4.1 - Run any GenAIScript-based workflow
- Observe failure when invalid model is used
- See TypeError when accessing undefined result
Related Issues
- [smoke-detector] 🔍 Smoke Test Investigation - GenAIScript Invalid Model Name (gpt-4.1) #2157 - Original investigation (closed as "not_planned")
- [smoke-detector] 🔍 Smoke Test Investigation - GenAIScript OPENAI_API_KEY Missing #2142 - Similar GenAIScript error (different root cause - missing API key)
Investigation Metadata
- Investigator: Smoke Detector (Failure Investigation Agent)
- Investigation Run: #18739232833
- Pattern:
GENAISCRIPT_INVALID_MODEL(3rd occurrence) - Investigation Record:
/tmp/gh-aw/cache-memory/investigations/2025-10-23-18739169072.json - Created: 2025-10-23T06:12:00Z
AI generated by Smoke Detector - Smoke Test Failure Investigator
AI generated by Smoke Detector - Smoke Test Failure Investigator