Stars
Building a comprehensive and handy list of papers for GUI agents
AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.
🌎💪 BrowserGym, a Gym environment for web task automation
Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"
Evaluating the Ripple Effects of Knowledge Editing in Language Models
Manual on how to reproduce the results of the NSM article and also how to adjust custom datasets to it + relevant scripts
The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".
Code for EMNLP'21 paper "Mitigating False-Negative Contexts in Multi-Document Question Answering with Retrieval Marginalization"