Agent Analysis

A comprehensive toolkit for analyzing Software Engineering (SWE) agents, focusing on performance evaluation, feature analysis, and benchmarking.

Overview

This project provides tools and utilities for:

Analyzing agent performance on software engineering tasks
Computing various metrics (code, dependency, error, instance, patch, type)
Evaluating performance gaps between different agent implementations
Processing and analyzing data from OpenHands and SWE-bench

Project Structure

analysis/: Core analysis modules
- features/metrics/: Various metric implementations for agent analysis
- models/: Data models for OpenHands and SWE-bench
- performance_gap.py: Performance gap analysis utilities
- usage.py: Usage analysis tools
notebooks/: Jupyter notebooks for analysis and visualization
- condenser_results.ipynb: Analysis of condenser results
- localization_metrics.ipynb: Metrics for code localization
- performance_gap.ipynb: Performance gap analysis

Requirements

Python ≥ 3.12
Dependencies are managed through Poetry

Installation

Ensure you have Poetry installed
Clone this repository
Run poetry install to install dependencies

Usage

The toolkit can be used either through its Python modules or via the provided Jupyter notebooks for interactive analysis.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
analysis		analysis
notebooks		notebooks
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agent Analysis

Overview

Project Structure

Requirements

Installation

Usage

License

About

Releases

Packages

Contributors 3

Languages

License

All-Hands-AI/agent-analysis

Folders and files

Latest commit

History

Repository files navigation

Agent Analysis

Overview

Project Structure

Requirements

Installation

Usage

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages