Change the repository type filter
All
Repositories list
25 repositories
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.
inspect_k8s_sandbox
Publicautonomy-evals-guide
PublicSWE-bench-fork
Publicviv-task-dev
Publicinspect_ai
Publicagent-fork-of-eval-analysis-public
Public archivellm-foundry
Publicmetr-task-boilerplate
Publictask-protected-scoring
Publictask-artifacts
Publicpublic-tasks
Publicworktest-sw-eng-deps
Publictask-assets
Public.github
PublicnanoGPT
Publictask-legacy-verifier
Publictask-aux-vm-helpers
Publicpyhooks
Public archivevivaria-mentat
Public archivetask-template
Public template