Skip to content

Conversation

@jeffhandley
Copy link
Member

These workflows replace the issue-labeler implementation that runs as a GitHub app, allowing training and prediction for the repository's labels without the need for an external service.

References:

Once this is merged, we'll need to run the 'Labeler: Training' workflow to train new prediction models, then 'Labeler: Promotion' workflow to promote the models into active use, and then I can decommission the integration of the legacy issue-labeler app for this repo.

The legacy issue-labeler was also applying the untriaged Issues and PRs which have not yet been triaged by a lead label to new issues and pull requests, but there's also a resourceManagement policy in this repo for managing the untriaged label. With this migration to the GitHub Action based issue-labeler, the issue-labeler will no longer apply that label, so only the resource management policy will apply for that.

@jeffhandley jeffhandley requested a review from jaredpar May 3, 2025 02:40
@jeffhandley jeffhandley self-assigned this May 3, 2025
@jeffhandley jeffhandley requested a review from a team as a code owner May 3, 2025 02:40
@ghost ghost added Area-Infrastructure untriaged Issues and PRs which have not yet been triaged by a lead labels May 3, 2025
fail-on-cache-miss: ${{ env.ALLOW_FAILURE }}
quiet: true

- name: "Predict issue labels"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fascinating!

does this use the previous training from the old labeler by any chance or do we start over?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You have to re-train but now it's all self-service within each repository, and no services to run, storage to manage, or anything else outside GitHub Actions. But the logic for building the new model is the same as the old implementation (but muuuuuuuccch faster and easier). When you run the 'Labeler: Training' job, the job summary will show the new model's accuracy based on test results where it re-predicts existing issue/pulls in the repository and compares the new prediction against the existing labels.

@arunchndr arunchndr merged commit 2edc01d into dotnet:main May 8, 2025
4 of 5 checks passed
@dotnet-policy-service dotnet-policy-service bot added this to the Next milestone May 8, 2025
@RikkiGibson RikkiGibson modified the milestones: Next, 18.0 P1 Aug 19, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Area-Infrastructure untriaged Issues and PRs which have not yet been triaged by a lead

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants