METR is a research nonprofit that works on assessing whether cutting-edge AI systems could pose catastrophic risks to society.
We build the science of accurately assessing risks, so that humanity is informed before developing transformative AI systems.
Read more about our work here.
- Vivaria
- Public Task Suite
- RE-Bench Task Suite
- Some of our open-source agents can be found at github.com/poking-agents