Port over BashSession and tests by xingyaoww · Pull Request #7 · OpenHands/software-agent-sdk

xingyaoww · 2025-08-24T20:36:02Z

No description provided.

- Created comprehensive test suite for execute_bash functionality: * test_bash_session.py: Tests for BashSession class with 12 test cases * test_bash_parsing.py: Tests for bash command parsing utilities with 30 test cases * test_bash_ps1_metadata.py: Tests for PS1 metadata parsing with 12 test cases * conftest.py: Shared test utilities to reduce code duplication - Fixed all import statements to use correct module paths: * Updated imports from old openhands_logger to openhands.core.logger.get_logger * Fixed imports to use openhands.core.runtime.tools.execute_bash modules * Added proper __init__.py files for test directory structure - Enhanced BashSession class with bug fixes and improvements: * Added no_change_timeout_seconds parameter to constructor for configurable timeouts * Fixed critical timeout logic bug: changed 'if is_blocking' to 'if not is_blocking' * Fixed initialization order: set _initialized=True before calling _clear_screen() * Added proper session cleanup handling in close() method * Fixed destructor to initialize _closed=False in constructor - Enhanced ExecuteBashObservation class: * Added command_id property to get PID from metadata * Added message property for formatted command execution description * Fixed field name from 'content' to 'output' throughout codebase - Updated all ExecuteBashAction usage: * Added required security_risk parameter to all instances * Fixed constructor calls to use proper keyword arguments * Replaced deprecated CmdRunAction references with ExecuteBashAction - All 54 tests now pass successfully - Code passes pre-commit checks (formatting, linting, type checking) Co-authored-by: openhands <openhands@all-hands.dev>

This commit addresses the critical issues raised in the code review: ## Critical Issue #1: Data Structure Design Flaw - Refactored AgentState to AgentStateRegistry using dict[str, Any] pattern - Provides loosely-coupled storage for agent-specific state - Agents store state using string keys instead of typed fields - Maintains backward compatibility with AgentState alias ## Critical Issue #2: Hidden State Mutation - Fixed _check_iterative_refinement to only increment iteration counter when refinement will actually continue - State mutation now happens AFTER validation checks pass - Function name now accurately reflects its behavior ## Critical Issue #3: Missing Tests - Added comprehensive tests for AgentStateRegistry (14 tests) - Added tests for IterativeRefinementConfig validation (4 tests) - Added tests for _check_iterative_refinement logic (12 tests) - Tests cover edge cases: max iterations, threshold boundaries, etc. ## Suggestion #5: Unnecessary Forward Reference - Removed quotes from LLM type hint in example ## Suggestion #7: Unused Method - Removed reset_iterative_refinement() as part of registry refactor ## Note on Issue #4 (FinishTool Special Case) The suggestion to move iterative refinement to the conversation layer is a valid architectural improvement but requires more significant changes. This is deferred to a future PR to keep this change focused. Co-authored-by: openhands <openhands@all-hands.dev>

xingyaoww and others added 7 commits August 24, 2025 11:16

add how we perform tests

d63e596

port over initial implementation of tmux session

9c51064

move bash session

abd8085

use relative import for conf test

1340fcc

add .agent_observation field to convert it to agent obs

fc7c6b7

remove logging, add self.error to agent_observation

6d02f1f

xingyaoww merged commit 5c54fd5 into main Aug 24, 2025
2 checks passed

xingyaoww deleted the dev branch August 24, 2025 21:02

openhands-ai bot mentioned this pull request Dec 18, 2025

Add iterative refinement example for COBOL to Java refactoring #1415

Merged

2 tasks

jpshackelford mentioned this pull request Jan 15, 2026

feat(plugin): Add Plugin.fetch() for remote plugin fetching and caching #1647

Merged

9 tasks

openhands-ai bot mentioned this pull request Feb 9, 2026

feat(examples): update critic example to demonstrate iterative refinement #1879

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Port over BashSession and tests#7

Port over BashSession and tests#7
xingyaoww merged 7 commits intomainfrom
dev

xingyaoww commented Aug 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

xingyaoww commented Aug 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant