Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
63 changes: 48 additions & 15 deletions .github/workflows/agent-performance-analyzer.lock.yml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

51 changes: 42 additions & 9 deletions .github/workflows/agent-performance-analyzer.md
Original file line number Diff line number Diff line change
Expand Up @@ -66,14 +66,15 @@ As a meta-orchestrator for agent performance, you assess how well AI agents are
### 2. Agent Effectiveness Measurement

**Task completion rates:**
- Track how often agents complete their intended tasks
- Track how often agents complete their intended tasks using historical metrics
- Measure:
- Issues resolved vs. created
- PRs merged vs. created
- Issues resolved vs. created (from metrics data)
- PRs merged vs. created (use pr_merge_rate from quality_indicators)
- Campaign goals achieved
- User satisfaction indicators (reactions, comments)
- User satisfaction indicators (reactions, comments from engagement metrics)
- Calculate effectiveness scores (0-100)
- Identify agents consistently failing to complete tasks
- Compare current rates to historical averages (7-day and 30-day trends)

**Decision quality:**
- Review strategic decisions made by orchestrator agents
Expand Down Expand Up @@ -167,8 +168,31 @@ Execute these phases each run:

This workflow shares memory with other meta-orchestrators (Campaign Manager and Workflow Health Manager) to coordinate insights and avoid duplicate work.

**Shared Metrics Infrastructure:**

The Metrics Collector workflow runs daily and stores performance metrics in a structured JSON format:

1. **Latest Metrics**: `/tmp/gh-aw/repo-memory-default/memory/meta-orchestrators/metrics/latest.json`
- Most recent daily metrics snapshot
- Quick access without date calculations
- Contains all workflow metrics, engagement data, and quality indicators

2. **Historical Metrics**: `/tmp/gh-aw/repo-memory-default/memory/meta-orchestrators/metrics/daily/YYYY-MM-DD.json`
- Daily metrics for the last 30 days
- Enables trend analysis and historical comparisons
- Calculate week-over-week and month-over-month changes

**Use metrics data to:**
- Avoid redundant API queries (metrics already collected)
- Compare current performance to historical baselines
- Identify trends (improving, declining, stable)
- Calculate moving averages and detect anomalies
- Benchmark individual workflows against ecosystem averages

**Read from shared memory:**
1. Check for existing files in the memory directory:
- `metrics/latest.json` - Latest performance metrics (NEW - use this first!)
- `metrics/daily/*.json` - Historical daily metrics for trend analysis (NEW)
- `agent-performance-latest.md` - Your last run's summary
- `campaign-manager-latest.md` - Latest campaign health insights
- `workflow-health-latest.md` - Latest workflow health insights
Expand Down Expand Up @@ -201,7 +225,16 @@ This workflow shares memory with other meta-orchestrators (Campaign Manager and

### Phase 1: Data Collection (10 minutes)

1. **Gather agent outputs:**
1. **Load historical metrics from shared storage:**
- Read latest metrics from: `/tmp/gh-aw/repo-memory-default/memory/meta-orchestrators/metrics/latest.json`
- Load daily metrics for trend analysis from: `/tmp/gh-aw/repo-memory-default/memory/meta-orchestrators/metrics/daily/`
- Extract per-workflow metrics:
- Safe output counts (issues, PRs, comments, discussions)
- Workflow run statistics (total, successful, failed, success_rate)
- Engagement metrics (reactions, comments, replies)
- Quality indicators (merge rates, close times)

2. **Gather agent outputs:**
- Query recent issues/PRs/comments with agent attribution
- For each workflow, collect:
- Safe output operations from recent runs
Expand All @@ -210,17 +243,17 @@ This workflow shares memory with other meta-orchestrators (Campaign Manager and
- Project board updates
- Collect metadata: creation date, author workflow, status

2. **Analyze workflow runs:**
3. **Analyze workflow runs:**
- Get recent workflow run logs
- Extract agent decisions and actions
- Capture error messages and warnings
- Record resource usage metrics

3. **Build agent profiles:**
4. **Build agent profiles:**
- For each agent, compile:
- Total outputs created
- Total outputs created (use metrics data for efficiency)
- Output types (issues, PRs, comments, etc.)
- Success/failure patterns
- Success/failure patterns (from metrics)
- Resource consumption
- Active time periods

Expand Down
32 changes: 29 additions & 3 deletions .github/workflows/campaign-manager.lock.yml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

32 changes: 29 additions & 3 deletions .github/workflows/campaign-manager.md
Original file line number Diff line number Diff line change
Expand Up @@ -72,16 +72,25 @@ As a meta-orchestrator, you coordinate between multiple campaigns, analyze their
### 3. Performance Monitoring

**Aggregate metrics across campaigns:**
- Collect metrics from each campaign's project board
- Load shared metrics from: `/tmp/gh-aw/repo-memory-default/memory/meta-orchestrators/metrics/latest.json`
- Use workflow metrics for campaigns to assess:
- Workflow success rates for campaign workflows
- Safe output volume (issues, PRs created by campaign workflows)
- Engagement levels (reactions, comments on campaign outputs)
- Quality indicators (PR merge rates, issue close times)
- Collect additional metrics from each campaign's project board
- Track velocity, completion rates, and blockers
- Compare actual progress vs. expected timelines
- Identify campaigns that are ahead, on track, or behind schedule

**Trend analysis:**
- Compare current metrics with historical data
- Identify improving or degrading trends
- Load historical daily metrics from: `/tmp/gh-aw/repo-memory-default/memory/meta-orchestrators/metrics/daily/`
- Compare current metrics with historical data (7-day, 30-day trends)
- Identify improving or degrading trends in workflow performance
- Calculate velocity trends from safe output volume over time
- Predict completion dates based on velocity
- Flag campaigns at risk of missing deadlines
- Detect anomalies (sudden drops in success rate, output volume)

### 4. Strategic Decision Making

Expand Down Expand Up @@ -127,8 +136,25 @@ Execute these phases each time you run:

This workflow shares memory with other meta-orchestrators (Workflow Health Manager and Agent Performance Analyzer) to coordinate insights and avoid duplicate work.

**Shared Metrics Infrastructure:**

The Metrics Collector workflow runs daily and stores performance metrics in a structured JSON format:

1. **Latest Metrics**: `/tmp/gh-aw/repo-memory-default/memory/meta-orchestrators/metrics/latest.json`
- Most recent daily metrics snapshot
- Contains workflow success rates, safe output volumes, engagement data
- Use to assess campaign health without redundant API queries

2. **Historical Metrics**: `/tmp/gh-aw/repo-memory-default/memory/meta-orchestrators/metrics/daily/YYYY-MM-DD.json`
- Daily metrics for the last 30 days
- Calculate campaign velocity trends
- Identify performance degradation early
- Compare current vs. historical performance

**Read from shared memory:**
1. Check for existing files in the memory directory:
- `metrics/latest.json` - Latest performance metrics (NEW - use this first!)
- `metrics/daily/*.json` - Historical daily metrics for trend analysis (NEW)
- `campaign-manager-latest.md` - Your last run's summary
- `workflow-health-latest.md` - Latest workflow health insights
- `agent-performance-latest.md` - Latest agent quality insights
Expand Down
Loading