This repo is currently deprecated. For the updated benchmark view the main repo at https://github.com/Significant-Gravitas/AutoGPT in the `benchmark` folder.

Built for the purpose of benchmarking the performance of agents regardless of how they work.

Objectively know how well your agent is performing in categories like code, retrieval, memory, and safety.

Save time and money while doing it through smart dependencies. The best part? It's all automated.

Scores:

More agents coming soon !

Name		Name	Last commit message	Last commit date
Latest commit History 1,297 Commits
.github		.github
.vscode		.vscode
agbenchmark		agbenchmark
agent		agent
backend		backend
frontend @ c6a9572		frontend @ c6a9572
notebooks		notebooks
paper		paper
reports		reports
.env.example		.env.example
.flake8		.flake8
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
json_to_base_64.py		json_to_base_64.py
mypy.ini		mypy.ini
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
run.sh		run.sh
send_to_googledrive.py		send_to_googledrive.py