replace operator agent with base of new agent #1014

tkattkat · 2025-08-25T20:52:50Z

Why

Replace operator agent with new agent handler

The operator agent was an older implementation that did not use tool calling and used a single model for both high-level reasoning and low-level action execution.

What Changed

Removed operator agent (StagehandOperatorHandler)
Added new agent handler (StagehandAgentHandler)
- Leverages AI SDK for proper tool call handling
- New executionModel option for dual-model architecture
- Better error handling and retry mechanisms
- Structured tool system with Zod schema validation
ExecutionModel feature:
- Use a powerful model (like claude 4 sonnet) for reasoning and planning
- Use a faster model (like gemini 2.0 flash) for Stagehand operations like act() and extract()
- Enables cost and performance optimization

Test Plan

Tested locally with various agent tasks
Verified backward compatibility
Tested dual-model execution with different model combinations
Installed package from branch, for additional local testing to catch any additional edge cases

Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

changeset-bot · 2025-08-25T20:52:54Z

🦋 Changeset detected

Latest commit: ed42209

The changes in this PR will be included in the next version bump.

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

greptile-apps

Greptile Summary

This PR implements a major architectural refactor of the agent system, replacing the entire "operator agent" implementation with a new AI SDK-based agent architecture. The changes include:

Core Architecture Changes:

Removed StagehandOperatorHandler and types/operator.ts entirely
Introduced StagehandAgentHandler as the new default agent implementation
Renamed the existing agent handler to cuaAgentHandler (Computer Use Agent) for provider-specific execution
Updated the main library exports to use the new handlers while maintaining API compatibility

New Tool System:
The PR introduces a complete tool ecosystem under lib/agent/tools/ with 11 standardized tools that wrap existing Stagehand functionality:

act.ts - Web element interaction with observe-then-act pattern
ariaTree.ts - Accessibility tree extraction for page context
close.ts - Task completion signaling
extract.ts - Data extraction from pages
fillform.ts - Optimized multi-field form filling
goto.ts - URL navigation
navback.ts - Browser history navigation
screenshot.ts - JPEG screenshot capture with compression
scroll.ts - Page scrolling functionality
wait.ts - Time-based delays
index.ts - Centralized tool factory function

Implementation Details:

All tools use AI SDK's tool() function with Zod schema validation
The new StagehandAgentHandler leverages AI SDK's generateText with built-in tool calling
Added message processing utilities in messageProcessing.ts for context compression
Updated LLM client interface with getLanguageModel() getter for AI SDK integration
Fixed minor issues like grammar corrections in evaluation tasks

The refactor maintains backward compatibility through the same public API while completely overhauling the internal agent execution model from custom schema-based responses to standardized AI SDK tool calling patterns.

Confidence score: 2/5

This PR introduces significant architectural changes that could destabilize the agent system due to the complete replacement of core functionality
Score reflects the massive scope of changes, removal of entire systems, and potential integration issues with the new AI SDK dependency requirements
Pay close attention to lib/handlers/stagehandAgentHandler.ts, lib/agent/tools/act.ts, lib/agent/tools/fillform.ts, and lib/handlers/cuaAgentHandler.ts

_{20 files reviewed, 12 comments}

_{Edit Code Review Bot Settings | Greptile}

lib/agent/utils/messageProcessing.ts

lib/agent/tools/navback.ts

lib/agent/utils/messageProcessing.ts

lib/agent/tools/goto.ts

lib/agent/tools/extract.ts

lib/handlers/cuaAgentHandler.ts

lib/agent/tools/wait.ts

lib/handlers/stagehandAgentHandler.ts

greptile-apps

Greptile Summary

This review covers only the changes made since the last review (commit 80cb25b), not the entire PR.

The latest changes implement the final pieces of the agent architecture refactor, completing the replacement of the operator agent with a new dual-agent system. The key additions include:

Agent Tool Interface Standardization: The createAgentTools function now accepts an optional AgentToolOptions interface with an executionModel parameter, providing a unified way to configure tool behavior across the agent system.
Execution Model Support: Multiple tool files (act.ts, extract.ts, fillform.ts) now support an optional executionModel parameter that allows different models to be used for tool execution versus agent reasoning. When provided, this model is passed to page.observe() and page.extract() operations.
Type System Enhancement: The AgentConfig interface in types/stagehand.ts now includes an optional executionModel field with clear documentation about its format ("provider/model") and purpose for tool execution optimization.
Agent Handler Architecture: Two new handler classes have been introduced:
- StagehandAgentHandler: A new AISDK-based agent handler that serves as the default agent implementation with comprehensive error handling and step tracking
- CuaAgentHandler: A Computer Use Agent handler for advanced visual browser automation with providers like OpenAI and Anthropic
Main Library Integration: The lib/index.ts file has been updated to use class-based agent handlers instead of function-based ones, with the new StagehandAgentHandler becoming the default while maintaining CuaAgentHandler for advanced use cases.

This refactor enables more flexible model selection where users can specify different models for high-level reasoning versus tool execution, potentially optimizing for cost and performance by using faster models for routine operations while reserving powerful models for complex tasks.

Confidence score: 3/5

This PR introduces significant architectural changes that require careful testing to ensure compatibility
Score reflects the complexity of the agent system refactor and potential for integration issues
Pay close attention to the dynamic schema evaluation in extract.ts and error handling patterns across tool files

Context used:

Rule - Use camelCase naming convention for TypeScript code and snake_case naming convention for Python code in documentation examples. (link)
Context - We enforce linting and prettier at the CI level, so no code style comments that aren't obvious. (link)

_{10 files reviewed, no comments}

_{Edit Code Review Bot Settings | Greptile}

lib/agent/tools/screenshot.ts

… into agent-revamp

lib/agent/tools/extract.ts

lib/handlers/stagehandAgentHandler.ts

@tkattkat

This PR was opened by the [Changesets release](https://github.com/changesets/action) GitHub action. When you're ready to do a release, you can merge this and the packages will be published to npm automatically. If you're not ready to do a release yet, that's fine, whenever you add more changesets to main, this PR will be updated. # Releases ## @browserbasehq/stagehand@2.5.1 ### Patch Changes - [#1082](#1082) [`8c0fd01`](8c0fd01) Thanks [@tkattkat](https://github.com/tkattkat)! - Pass stagehand object to agent instead of stagehand page - [#1104](#1104) [`a1ad06c`](a1ad06c) Thanks [@miguelg719](https://github.com/miguelg719)! - Fix logging for stagehand agent - [#1066](#1066) [`9daa584`](9daa584) Thanks [@tkattkat](https://github.com/tkattkat)! - Add playwright arguments to agent execute response - [#1077](#1077) [`7f38b3a`](7f38b3a) Thanks [@tkattkat](https://github.com/tkattkat)! - adds support for stagehand agent in the api - [#1032](#1032) [`bf2d0e7`](bf2d0e7) Thanks [@miguelg719](https://github.com/miguelg719)! - Fix for zod peer dependency support - [#1014](#1014) [`6966201`](6966201) Thanks [@tkattkat](https://github.com/tkattkat)! - Replace operator handler with base of new agent - [#1089](#1089) [`536f366`](536f366) Thanks [@miguelg719](https://github.com/miguelg719)! - Fixed info logs on api session create - [#1103](#1103) [`889cb6c`](889cb6c) Thanks [@tkattkat](https://github.com/tkattkat)! - patch custom tool support in anthropic cua client - [#1056](#1056) [`6a002b2`](6a002b2) Thanks [@chrisreadsf](https://github.com/chrisreadsf)! - remove need for duplicate project id if already passed to Stagehand - [#1090](#1090) [`8ff5c5a`](8ff5c5a) Thanks [@miguelg719](https://github.com/miguelg719)! - Improve failed act error logs - [#1014](#1014) [`6966201`](6966201) Thanks [@tkattkat](https://github.com/tkattkat)! - replace operator agent with scaffold for new stagehand agent - [#1107](#1107) [`3ccf335`](3ccf335) Thanks [@seanmcguire12](https://github.com/seanmcguire12)! - fix: url extraction not working inside an array - [#1102](#1102) [`a99aa48`](a99aa48) Thanks [@miguelg719](https://github.com/miguelg719)! - Add current page and date context to agent - [#1110](#1110) [`dda52f1`](dda52f1) Thanks [@miguelg719](https://github.com/miguelg719)! - Add support for new Gemini Computer Use models ## @browserbasehq/stagehand-evals@1.1.0 ### Minor Changes - [#1057](#1057) [`b7be89e`](b7be89e) Thanks [@filip-michalsky](https://github.com/filip-michalsky)! - added web voyager ground truth (optional), added web bench, and subset of OSWorld evals which run on a browser ### Patch Changes - [#1072](#1072) [`dc2d420`](dc2d420) Thanks [@filip-michalsky](https://github.com/filip-michalsky)! - improve evals screenshot service - add img hashing diff to add screenshots and change to screenshot intercepts from the agent - Updated dependencies \[[`8c0fd01`](8c0fd01), [`a1ad06c`](a1ad06c), [`9daa584`](9daa584), [`7f38b3a`](7f38b3a), [`bf2d0e7`](bf2d0e7), [`6966201`](6966201), [`536f366`](536f366), [`889cb6c`](889cb6c), [`6a002b2`](6a002b2), [`8ff5c5a`](8ff5c5a), [`6966201`](6966201), [`3ccf335`](3ccf335), [`a99aa48`](a99aa48), [`dda52f1`](dda52f1)]: - @browserbasehq/stagehand@2.5.1 ## @browserbasehq/stagehand-examples@1.0.10 ### Patch Changes - Updated dependencies \[[`8c0fd01`](8c0fd01), [`a1ad06c`](a1ad06c), [`9daa584`](9daa584), [`7f38b3a`](7f38b3a), [`bf2d0e7`](bf2d0e7), [`6966201`](6966201), [`536f366`](536f366), [`889cb6c`](889cb6c), [`6a002b2`](6a002b2), [`8ff5c5a`](8ff5c5a), [`6966201`](6966201), [`3ccf335`](3ccf335), [`a99aa48`](a99aa48), [`dda52f1`](dda52f1)]: - @browserbasehq/stagehand@2.5.1 Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

sameelarif and others added 10 commits August 8, 2025 13:12

restore agent integrations

bcbe2e4

working build

3719911

Merge branch 'main' into sarif/stg-519-mcp-and-tools-support

8efdd36

deps

e75a850

ignore inference summary

e3c7697

better tool calling in operator and oai

088c51f

example integrations

e16dfd0

handle malformed args from LLM

c32ebbf

Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

fix "none" tool choice handling

52348c6

merge

ab7742c

greptile-apps bot reviewed Aug 25, 2025

View reviewed changes

tkattkat marked this pull request as draft August 25, 2025 22:04

miguelg719 requested review from miguelg719 and sameelarif August 25, 2025 22:26

tkattkat marked this pull request as ready for review August 25, 2025 23:05

greptile-apps bot reviewed Aug 25, 2025

View reviewed changes

sameelarif and others added 13 commits August 27, 2025 10:46

mcp docs

b581063

Merge branch 'main' into sarif/stg-519-mcp-and-tools-support

0382bae

basic implementation

5e3a7f3

move tools to own folder + add screenshot filterning

74fb597

add accessability tool + context handling for it

36dbb00

add fill form tool + agent to eval runner

b3a0138

remove operator handler

db4e31e

update naming of the computer use agent handler

664e90b

add type guard

3fe546c

update system

96c0508

add scroll tool

7b7adb9

update act tool

803642c

remove comments

14196a6

miguelg719 reviewed Sep 4, 2025

View reviewed changes

lib/agent/tools/screenshot.ts Outdated Show resolved Hide resolved

tkattkat and others added 12 commits September 3, 2025 18:42

use stagehandpage instead of page

3158586

remove screenshot console logs & use logger for extract

7927f39

add back warning when not using provider/model format

9c7f393

add docs for agent

6e2e3ec

Merge branch 'main' into agent-revamp

a08ac8d

update to use act instead of observe

7f0f11d

update copy on variable

786b139

Merge branch 'agent-revamp' of https://github.com/browserbase/stagehand…

220b37f

… into agent-revamp

remove closing page from close tool

f00222b

Merge branch 'main' into agent-revamp

001cc4f

update init stagehand and sf library card eval

77961b1

add new model to task config

6008fc7

seanmcguire12 reviewed Sep 9, 2025

View reviewed changes

lib/agent/tools/extract.ts Show resolved Hide resolved

miguelg719 approved these changes Sep 9, 2025

View reviewed changes

lib/handlers/stagehandAgentHandler.ts Show resolved Hide resolved

tkattkat added 2 commits September 9, 2025 14:00

update extract prompt

07211cc

add changeset

f86955c

seanmcguire12 approved these changes Sep 9, 2025

View reviewed changes

add url note, and remove optional from examples

ed42209

tkattkat merged commit 6966201 into main Sep 9, 2025
15 checks passed

github-actions bot mentioned this pull request Aug 13, 2025

Version Packages aaag1980/stagehand#1

Open

github-actions bot mentioned this pull request Oct 29, 2025

Version Packages #1126

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

replace operator agent with base of new agent #1014

replace operator agent with base of new agent #1014

Uh oh!

tkattkat commented Aug 25, 2025 •

edited

Loading

Uh oh!

changeset-bot bot commented Aug 25, 2025 •

edited

Loading

Uh oh!

greptile-apps bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

greptile-apps bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

replace operator agent with base of new agent #1014

replace operator agent with base of new agent #1014

Uh oh!

Conversation

tkattkat commented Aug 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why

What Changed

Test Plan

Uh oh!

changeset-bot bot commented Aug 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Greptile Summary

Confidence score: 2/5

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Greptile Summary

Confidence score: 3/5

Context used:

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

tkattkat commented Aug 25, 2025 •

edited

Loading

changeset-bot bot commented Aug 25, 2025 •

edited

Loading