Skip to content

Fix Issue Arborist workflow: use correct search filter to exclude sub-issues#10524

Merged
pelikhan merged 4 commits intomainfrom
copilot/review-fix-arborist-workflow
Jan 18, 2026
Merged

Fix Issue Arborist workflow: use correct search filter to exclude sub-issues#10524
pelikhan merged 4 commits intomainfrom
copilot/review-fix-arborist-workflow

Conversation

Copy link
Contributor

Copilot AI commented Jan 18, 2026

The Issue Arborist workflow was analyzing only 10 issues instead of 100, producing no parent issues or links. Root cause: invalid GitHub search qualifier --search "no:parent-issue" in the gh issue list command.

Changes

Fixed invalid search filter in .github/workflows/issue-arborist.md:

- # Fetch the last 100 open issues that don't have a parent issue
- # Using search filter to exclude issues that are already sub-issues
+ # Fetch the last 100 open issues that don't have a parent issue
+ # Using search filter to exclude issues that are already sub-issues
  gh issue list --repo ${{ github.repository }} \
-   --search "no:parent-issue" \
+   --search "-parent-issue:*" \
    --state open \
    --json number,title,author,createdAt,state,url,body,labels,updatedAt,closedAt,milestone,assignees \
    --limit 100 \
    > /tmp/gh-aw/issues-data/issues.json

The correct GitHub search syntax -parent-issue:* excludes issues that have a parent (i.e., excludes sub-issues), ensuring only top-level issues are analyzed.

Updated documentation to accurately reflect that only top-level issues are fetched (excluding sub-issues).

Fixed unrelated linter warnings in pkg/workflow/create_issue_group_test.go (replaced assert.True(strings.Contains(...)) with assert.Contains(...)).

Impact

Workflow will now analyze ~100 top-level issues per run instead of 10, substantially improving its ability to identify related issues and create meaningful parent-child relationships while avoiding analyzing issues that are already sub-issues.

Original prompt

This section details on the original issue you should resolve

<issue_title>[plan] Review and fix Issue Arborist workflow effectiveness</issue_title>
<issue_description>## Objective

Investigate why the Issue Arborist workflow produced no outputs (0 parent issues, 0 links created) and fix or tune the workflow to produce meaningful results.

Context

The Issue Arborist Daily Report (#10466) shows:

  • Issues Analyzed: 10 (only open issues)
  • Parent Issues Created: 0 (reported as "None")
  • Links Created: 0 (reported as "None")

This suggests the workflow either:

  1. Found no issues meeting criteria for parent creation
  2. Has a bug preventing output generation
  3. Needs dataset expansion or tuning

Root Cause (Hypotheses)

  1. Small dataset: Only 10 open issues analyzed may be too small
  2. Overly strict criteria: Thresholds for creating parent issues may be too high
  3. Logic bug: Workflow may have an error preventing output
  4. Configuration issue: Input parameters may need adjustment

Approach

1. Review Workflow Implementation

Examine the Issue Arborist workflow:

# Find the workflow file
find .github/workflows -name "*issue*arborist*" -o -name "*arborist*"

Check:

  • Input parameters (how many issues to analyze)
  • Criteria for creating parent issues
  • Logic for detecting related issues
  • Output generation code

2. Run Diagnostic Test

Test with different parameters:

# If workflow supports manual trigger with inputs
gh workflow run issue-arborist.lock.yml -f issue_count=50
# Or test locally with workflow file

Try:

  • Increasing issue count (e.g., 50 or 100 instead of 10)
  • Lowering thresholds for parent creation
  • Testing with specific known issue sets

3. Analyze Issue Dataset

Check if the 10 issues analyzed are suitable:

# Get recent open issues
gh issue list --limit 20 --json number,title,labels,createdAt

Questions:

  • Are they related enough to warrant grouping?
  • Do they have common labels/themes?
  • Are they all new or diverse in age?

4. Fix or Tune

Based on findings:

Option A: Expand Dataset

  • Increase default issue count to 50 or 100
  • Include recently closed issues (not just open)
  • Expand time window for analysis

Option B: Adjust Criteria

  • Lower threshold for parent issue creation
  • Adjust similarity/clustering parameters
  • Make criteria more inclusive

Option C: Fix Bugs

  • Fix any logic errors discovered
  • Add error handling for edge cases
  • Improve logging for debugging

Option D: Enhance Logic

  • Improve issue relationship detection
  • Add more sophisticated grouping algorithms
  • Consider label-based or keyword-based clustering

5. Validate Fix

After changes:

  • Trigger workflow manually or wait for next scheduled run
  • Verify report shows non-zero outputs
  • Confirm parent issues and links are meaningful and useful

Files to Modify

  • .github/workflows/issue-arborist.md (workflow definition)
  • Potentially related scripts or actions used by the workflow
  • Run make recompile after modifying .md files

Acceptance Criteria

  • Root cause identified and documented
  • Fix or tuning implemented
  • Workflow produces meaningful outputs (≥1 parent issue or links)
  • Report shows valid analysis results
  • Documentation updated explaining expected behavior

Testing

# Manual test
gh workflow run issue-arborist.lock.yml
# Wait for completion, then check discussion

# Or test locally if workflow supports it
./gh-aw run issue-arborist.md

Success Metrics

AI generated by Plan Command for discussion #10496

Comments on the Issue (you are @copilot in this section)


💬 We'd love your input! Share your thoughts on Copilot coding agent in our 2 minute survey.

Copilot AI and others added 2 commits January 18, 2026 02:50
Co-authored-by: mnkiefer <8320933+mnkiefer@users.noreply.github.com>
Co-authored-by: mnkiefer <8320933+mnkiefer@users.noreply.github.com>
Copilot AI changed the title [WIP] Investigate and resolve Issue Arborist workflow effectiveness Fix Issue Arborist workflow: remove invalid search filter Jan 18, 2026
Copilot AI requested a review from mnkiefer January 18, 2026 03:02
@pelikhan
Copy link
Contributor

@copilot fix search filter to only include sub issues

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>
Copy link
Contributor Author

Copilot AI commented Jan 18, 2026

@copilot fix search filter to only include sub issues

Fixed in commit abaa36c. Changed the search filter from the invalid no:parent-issue to the correct GitHub search syntax -parent-issue:* which excludes issues that have a parent (i.e., excludes sub-issues). The workflow will now fetch only top-level issues for analysis.

Copilot AI changed the title Fix Issue Arborist workflow: remove invalid search filter Fix Issue Arborist workflow: use correct search filter to exclude sub-issues Jan 18, 2026
Copilot AI requested a review from pelikhan January 18, 2026 05:49
@pelikhan pelikhan marked this pull request as ready for review January 18, 2026 05:51
@pelikhan pelikhan merged commit fab9ddf into main Jan 18, 2026
@pelikhan pelikhan deleted the copilot/review-fix-arborist-workflow branch January 18, 2026 05:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[plan] Review and fix Issue Arborist workflow effectiveness

3 participants