Skip to content

Refactor TestRig to support configurable approval modes for behavioral evals #17168

@jerop

Description

@jerop

Replace the deprecated yolo boolean flag in TestRig with a flexible approvalMode property. This refactor aligns the test infrastructure with the core ApprovalMode logic, enabling tests to explicitly target modes like plan or default. Updates evals/test-helper.ts and existing integration tests to utilize this refined interface.

The TestRig should continue to default to YOLO approval mode.

Metadata

Metadata

Assignees

Labels

area/coreIssues related to User Interface, OS Support, Core Functionalitystatus/need-triageIssues that need to be triaged by the triage automation.workstream-rollupLabel used to tag epics and features that are associated with one of the three primary workstreams🔒 maintainer only⛔ Do not contribute. Internal roadmap item.

Type

Projects

Status

Closed

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions