You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Comprehensive statistical analysis of 145 agentic workflow lock files in the github/gh-aw repository, revealing usage patterns, popular triggers, structural characteristics, and configuration trends.
Pure Event-Driven (issues, issue_comment, etc.): Various counts
React to specific GitHub events
Used for triage, moderation, and automated responses
Schedule Patterns
View Detailed Schedule Distribution
Schedule (Cron)
Count
Description
0 13 * * 1-5
4
Daily at 1:00 PM UTC (weekdays)
0 14 * * 1-5
4
Daily at 2:00 PM UTC (weekdays)
0 11 * * 1-5
4
Daily at 11:00 AM UTC (weekdays)
0 10 * * 1-5
2
Daily at 10:00 AM UTC (weekdays)
0 9 * * 1-5
2
Daily at 9:00 AM UTC (weekdays)
0 15 * * 1-5
2
Daily at 3:00 PM UTC (weekdays)
0 16 * * 1-5
2
Daily at 4:00 PM UTC (weekdays)
0 7 * * 1-5
2
Daily at 7:00 AM UTC (weekdays)
5 12 * * *
1
Daily at 12:05 PM UTC (all days)
31 */12 * * *
1
Every 12 hours at :31 minutes
Various others
80+
Scattered throughout the day
Pattern Insight: Schedules are deliberately staggered throughout business hours (7 AM - 4 PM UTC) to distribute load and avoid resource contention. Most workflows run on weekdays only.
Safe Outputs Analysis
Safe outputs enable workflows to create GitHub resources (discussions, issues, comments) in a controlled manner.
Safe Output Type
Usage Count
Description
noop
1,243
No-operation transparency logging
missing_tool
1,108
Report missing tool/capability
missing_data
414
Report missing data/information
add-comment
90
Add comments to issues/PRs
Key Findings:
Transparency First: The noop safe output is the most common (1,243 uses), indicating workflows frequently log completion status even when no changes are needed. This ensures visibility into workflow execution.
Limitations Reporting: missing_tool (1,108) and missing_data (414) are heavily used to report capability gaps and data unavailability, providing valuable feedback about workflow constraints.
Controlled Interactions: Only 90 uses of add-comment shows disciplined use of GitHub API mutations, preventing spam and maintaining clean issue/PR threads.
Discussion Category: When creating discussions, workflows primarily target the "audits" category for reports and analysis results.
Structural Characteristics
Job Complexity
Average Jobs per Workflow: 6.0
Maximum Jobs: 9 (in firewall-escape.lock.yml)
Minimum Jobs: 2 (typical for simple workflows)
Typical Job Structure:
Activation Job: Checks workflow file timestamps and prerequisites
Agent Job: Main agentic execution with Claude/Copilot
Collect Output Jobs: Gather and process safe outputs
Output processing: 10-20 (collect, validate, format results)
Action execution: 5-20 (create GitHub resources)
Cleanup/notification: 5-10 (finalization steps)
Permission Patterns
Most Common Permissions
Permission
Read Count
Write Count
Total
contents
652
74
726
issues
131
314
445
pull-requests
129
240
369
discussions
N/A
270
270
Permission Distribution Insights:
Read-Heavy Contents Access: 652 read vs. 74 write for contents permission indicates workflows primarily analyze code rather than modify it. Write access is reserved for specific workflows that need to commit changes.
Issue Management: 314 write permissions for issues (vs. 131 read) shows active issue creation and management, likely for reporting, triage, and automation.
Pull Request Engagement: 240 write permissions for pull-requests indicates workflows actively comment on, review, or create PRs.
Discussion Creation: 270 write permissions for discussions aligns with the repository's emphasis on using discussions for audit reports and analysis results.
Minimal Permissions: All workflows follow the principle of least privilege, requesting only necessary permissions for their specific job steps.
MCP Server Usage
MCP (Model Context Protocol) servers provide specialized capabilities to agentic workflows.
MCP Server
Usage Count
Description
github
35
GitHub API integration (repos, issues, PRs, commits)
playwright
5
Browser automation and web scraping
arxiv
1
Academic paper search and retrieval
deepwiki
1
Deep wiki content exploration
Findings:
GitHub-Centric: The github MCP server dominates with 35 uses, reflecting workflows' primary focus on repository analysis, code review, and GitHub resource management.
Web Automation: Playwright MCP server (5 uses) enables workflows to interact with web UIs, test web applications, or gather data from web sources.
Specialized Research: Arxiv and deepwiki MCP servers show experimental use of specialized knowledge sources, potentially for research-oriented workflows.
Interesting Findings
High Manual Trigger Adoption (88.3%)
Nearly all workflows support workflow_dispatch, enabling on-demand execution
This provides flexibility for testing, debugging, and ad-hoc analysis
Reflects a design philosophy of "automate, but keep manual control"
Scheduled Workflow Dominance (71.7%)
Over 70% of workflows run on schedules, indicating strong automation culture
Schedules are deliberately staggered to prevent resource contention
Most run on weekdays only, respecting business hour patterns
Consistent File Size (69% in 50-70 KB range)
Remarkable consistency in workflow complexity across the repository
Suggests standardized workflow patterns and templates
Average 61 KB file size indicates substantial but not excessive configuration
High Step Count (avg 71.6 steps)
Complex multi-stage workflows with detailed orchestration
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
Executive Summary
Comprehensive statistical analysis of 145 agentic workflow lock files in the
github/gh-awrepository, revealing usage patterns, popular triggers, structural characteristics, and configuration trends.Key Metrics:
File Size Distribution
Size Extremes:
codex-github-remote-mcp-test.lock.yml(22 KB)smoke-claude.lock.yml(106 KB)Key Insight: The vast majority (69%) of lock files fall into the 50-70 KB range, indicating consistent workflow complexity across the repository.
Trigger Analysis
Most Popular Triggers
Common Trigger Combinations
Schedule + Manual (
schedule,workflow_dispatch): 95 workflows (65.5%)Manual Only (
workflow_dispatch): 19 workflows (13.1%)Multi-Source (
pull_request,schedule,workflow_dispatch): 6 workflows (4.1%)Interactive Multi-Trigger (all event types): 3 workflows (2.1%)
Pure Event-Driven (
issues,issue_comment, etc.): Various countsSchedule Patterns
View Detailed Schedule Distribution
0 13 * * 1-50 14 * * 1-50 11 * * 1-50 10 * * 1-50 9 * * 1-50 15 * * 1-50 16 * * 1-50 7 * * 1-55 12 * * *31 */12 * * *Pattern Insight: Schedules are deliberately staggered throughout business hours (7 AM - 4 PM UTC) to distribute load and avoid resource contention. Most workflows run on weekdays only.
Safe Outputs Analysis
Safe outputs enable workflows to create GitHub resources (discussions, issues, comments) in a controlled manner.
noopmissing_toolmissing_dataadd-commentKey Findings:
Transparency First: The
noopsafe output is the most common (1,243 uses), indicating workflows frequently log completion status even when no changes are needed. This ensures visibility into workflow execution.Limitations Reporting:
missing_tool(1,108) andmissing_data(414) are heavily used to report capability gaps and data unavailability, providing valuable feedback about workflow constraints.Controlled Interactions: Only 90 uses of
add-commentshows disciplined use of GitHub API mutations, preventing spam and maintaining clean issue/PR threads.Discussion Category: When creating discussions, workflows primarily target the "audits" category for reports and analysis results.
Structural Characteristics
Job Complexity
firewall-escape.lock.yml)Typical Job Structure:
Step Complexity
daily-copilot-token-report.lock.yml)Step Distribution Pattern:
Permission Patterns
Most Common Permissions
Permission Distribution Insights:
Read-Heavy Contents Access: 652 read vs. 74 write for
contentspermission indicates workflows primarily analyze code rather than modify it. Write access is reserved for specific workflows that need to commit changes.Issue Management: 314 write permissions for
issues(vs. 131 read) shows active issue creation and management, likely for reporting, triage, and automation.Pull Request Engagement: 240 write permissions for
pull-requestsindicates workflows actively comment on, review, or create PRs.Discussion Creation: 270 write permissions for
discussionsaligns with the repository's emphasis on using discussions for audit reports and analysis results.Minimal Permissions: All workflows follow the principle of least privilege, requesting only necessary permissions for their specific job steps.
MCP Server Usage
MCP (Model Context Protocol) servers provide specialized capabilities to agentic workflows.
Findings:
GitHub-Centric: The
githubMCP server dominates with 35 uses, reflecting workflows' primary focus on repository analysis, code review, and GitHub resource management.Web Automation: Playwright MCP server (5 uses) enables workflows to interact with web UIs, test web applications, or gather data from web sources.
Specialized Research: Arxiv and deepwiki MCP servers show experimental use of specialized knowledge sources, potentially for research-oriented workflows.
Interesting Findings
High Manual Trigger Adoption (88.3%)
workflow_dispatch, enabling on-demand executionScheduled Workflow Dominance (71.7%)
Consistent File Size (69% in 50-70 KB range)
High Step Count (avg 71.6 steps)
Safe Output Discipline
noop(1,243) demonstrates commitment to transparencymissing_tool/missing_datacounts (1,522 combined) show workflows gracefully handle limitationsadd-commentcount (90) prevents notification spamMulti-Job Architecture (avg 6 jobs)
Weekday-Only Schedules
1-5in cron)Minimal Write Permissions
contents: 652 vs 74)Statistical Profile: The "Typical" Agentic Workflow
Based on median and average values, a typical
.lock.ymlfile in this repository has:schedule+workflow_dispatch(65.5% use this combo)contents: readissues: writepull-requests: readdiscussions: writenoopfor transparency, occasionaladd-commentRecommendations
Based on the analysis, here are actionable recommendations:
Template Standardization
Schedule Distribution
Safe Output Expansion
create-pull-requestandcreate-issuesafe outputs if not already availableadd-commentcount suggests conservative use - maintain this disciplineMCP Server Adoption
Permission Optimization
contents) is idealDocumentation
Monitoring
Historical Analysis
Methodology
Analysis Tools: Bash scripts and Python 3 with regex-based YAML parsing (no external dependencies)
Lock Files Analyzed: 145
Cache Memory: Used
/tmp/gh-aw/cache-memory/for script persistence and historical data trackingData Sources: All
.lock.ymlfiles in.github/workflows/directoryAnalysis Scripts:
/tmp/gh-aw/cache-memory/scripts/analyze_lockfiles.sh- Bash-based extraction/tmp/gh-aw/cache-memory/scripts/comprehensive_analysis_v2.py- Python statistical aggregationVerification: Cross-referenced multiple data extraction methods to ensure accuracy
References:
Beta Was this translation helpful? Give feedback.
All reactions