Empowering LLMs: Tool Learning with Real-World Interactions

This is the official repo for SIGIR 2024 tutorial: Empowering LLMs: Tool Learning with Real-World Interactions. More details can be found in https://rulegreen.github.io/services/tools-meet-llm/

We record the recent progress of tool learning based on LLMs. We list works following the structure of tutorail, and will constantly update it, welcome to raise a issue to add new works!!

0 Survey

1 Defnition and Scope of Tools

defnition and scope of tools

0 Cognitive Tools
1 Physical Tools

1.1 Cognitive Tools

relevant cognitive tools

TPE: Towards Better Compositional Reasoning over Conceptual Tools with Multi-persona Collaboration 🔥🔥🔥
Large Language Models as Source Planner for Personalized Knowledge-grounded Dialogues 🔥🔥🔥
StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving
Meta Reasoning for Large Language Models
Meta-Reasoning: Monitoring and Control of Thinking and Reasoning 🔥🔥🔥🔥🔥 personally like this

1.2 Physical Tools

relevant physical tools

API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs 🔥🔥🔥 important work, also for dialogues
Toolformer: Language Models Can Teach Themselves to Use Tools
TravelPlanner: A Benchmark for Real-World Planning with Language Agents
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs

2 Components and Architecture of Tool Learning

2.1 Tool Set

see above

2.2 Controller / Planner

2.3 Environments / Benchmarks

ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios
[CToolEval: A Chinese Benchmark for LLM-Powered Agent Evaluation in Real-World API Interactions] chinese benchmark
TOOLTALK: EVALUATING TOOL USAGE IN A CONVERSATIONAL SETTING
MINT: Evaluating llms in multi-turn interaction with tools and language feedback
Metatool benchmark for large language models: Deciding whether to use tools and which to use

2.4 Perceiver

Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives
Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization
CRITIC: LARGE LANGUAGE MODELS CAN SELFCORRECT WITH TOOL-INTERACTIVE CRITIQUING first to use external feedback from tools to critic/refine outputs of LLMs? [code]
Reflexion: Language Agents with Verbal Reinforcement Learning
Chat with the Environment: Interactive Multimodal Perception Using Large Language Models

3 Tool Learning based on LLMs

3.1 Tool-oriented Learning

A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis

3.2 Tool-augmented Learning

3.3 Learning of Tool Learning

4 Application of Tool Learning

4.1 Tool Creation Selection and Utilization

Tool Creation

Tool Selection and Utilization

EASYTOOL: ENHANCING LLM-BASED AGENTS WITH CONCISE TOOL INSTRUCTION optimizate tool documentation
TOOLVERIFIER: Generalization to New Tools via Self-Verification finetune a model to select tool based on desc, and propose questions to self-refine decisions
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems ICLR2024
CRUXEval: A Benchmark for Code Reasoning Understanding and Execution
Toolrerank: Adaptive and hierarchy-aware reranking for tool retrieval
Empowering Large Language Model Agents through Action Learning

4.2 Tool Learning in IR

UniMS-RAG: A Unified Multi-source Retrieval-Augmented Generation for Personalized Dialogue Systems 🔥🔥🔥
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection 🔥🔥🔥🔥🔥
UniRetriever: Multi-task Candidates Selection for Various Context-Adaptive Conversational Retrieval
Active Retrieval Augmented Generation EMNLP 2023 🔥🔥🔥 interesting and useful -> may can be used in dialogues
I^3 Retriever: Incorporating Implicit Interaction in Pre-trained Language Models for Passage Retrieval
Learning Retrieval Augmentation for Personalized Dialogue Generation EMNLP 2023
PK-ICR: Persona-Knowledge Interactive Multi-Context Retrieval for Grounded Dialogue EMNLP 2023
Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions ACL 2023
Self-Knowledge Guided Retrieval Augmentation for Large Language Models EMNLP 2023

4.3 Tool Learning in Embodied Environment

4.4 Tool Learning for All

5 Advanced Topic and Future Directions

defnition and scope of tools

0 Multi-modal and Multi-agent Tool Learning
1 Safe, Trustworthy and Personalized Tool Learning
2 Emerging Trends and Future Opportunities

5.1 Multi-modal and Multi-agent Tool Learning

AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction 🔥🔥🔥🔥🔥
Learning to Use Tools via Cooperative and Interactive Agents with Large Language Models 🔥🔥
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation
[OS-Copilot: Towards Generalist Computer Agents with Self-Improvement]
[Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents]
Scaling Large-Language-Model-based Multi-Agent Collaboration multi-agent
Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration
MOBILE-AGENT: AUTONOMOUS MULTI-MODAL MOBILE DEVICE AGENT WITH VISUAL PERCEPTION multi-modal
WEBARENA: A REALISTIC WEB ENVIRONMENT FOR BUILDING AUTONOMOUS AGENTS multi-modal

5.2 Safe, Trustworthy and Personalized Tool Learning

5.3 Emerging Trends and Future Opportunities

Self-DC: When to retrieve and When to generate? Self Divide-and-Conquer for Compositional Unknown Questions 🔥🔥
Knowledge Conflicts for LLMs: A Survey
Metacognitive Retrieval-Augmented Large Language Models tools conflicts
TORA: A TOOL-INTEGRATED REASONING AGENT FOR MATHEMATICAL PROBLEM SOLVING 🔥🔥🔥 [code]

@inproceedings{toolmeetllm,
        author = {Wang, Hongru and Qin, Yujia and Lin, Yankai and Pan, Jeff Z. and Wong, Kam-Fai},
        title = {Empowering Large Language Models: Tool Learning for Real-World Interaction},
        year = {2024},
        isbn = {9798400704314},
        publisher = {Association for Computing Machinery},
        address = {New York, NY, USA},
        url = {https://doi.org/10.1145/3626772.3661381},
        doi = {10.1145/3626772.3661381},
        abstract = {Since the advent of large language models (LLMs), the field of tool learning has remained very active in solving various tasks in practice, including but not limited to information retrieval. This half-day tutorial provides basic concepts of this field and an overview of recent advancements with several applications. In specific, we start with some foundational components and architecture of tool learning (i.e., cognitive tool and physical tool), and then we categorize existing studies in this field into tool-augmented learning and tool-oriented learning, and introduce various learning methods to empower LLMs this kind of capability. Furthermore, we provide several cases about when, what, and how to use tools in different applications. We end with some open challenges and several potential research directions for future studies. We believe this tutorial is suited for both researchers at different stages (introductory, intermediate, and advanced) and industry practitioners who are interested in LLMs and tool learning.},
        booktitle = {Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval},
        pages = {2983–2986},
        numpages = {4},
        keywords = {language agents, large language models, tool learning},
        location = {Washington DC, USA},
        series = {SIGIR '24}
}

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Empowering LLMs: Tool Learning with Real-World Interactions

Table of Contents

0 Survey

1 Defnition and Scope of Tools

1.1 Cognitive Tools

1.2 Physical Tools

2 Components and Architecture of Tool Learning

2.1 Tool Set

2.2 Controller / Planner

2.3 Environments / Benchmarks

2.4 Perceiver

3 Tool Learning based on LLMs

3.1 Tool-oriented Learning

3.2 Tool-augmented Learning

3.3 Learning of Tool Learning

4 Application of Tool Learning

4.1 Tool Creation Selection and Utilization

Tool Creation

Tool Selection and Utilization

4.2 Tool Learning in IR

4.3 Tool Learning in Embodied Environment

4.4 Tool Learning for All

5 Advanced Topic and Future Directions

5.1 Multi-modal and Multi-agent Tool Learning

5.2 Safe, Trustworthy and Personalized Tool Learning

5.3 Emerging Trends and Future Opportunities

About

Releases

Packages

Contributors 2

License

ruleGreen/ToolsMeetLLMs

Folders and files

Latest commit

History

Repository files navigation

Empowering LLMs: Tool Learning with Real-World Interactions

Table of Contents

0 Survey

1 Defnition and Scope of Tools

1.1 Cognitive Tools

1.2 Physical Tools

2 Components and Architecture of Tool Learning

2.1 Tool Set

2.2 Controller / Planner

2.3 Environments / Benchmarks

2.4 Perceiver

3 Tool Learning based on LLMs

3.1 Tool-oriented Learning

3.2 Tool-augmented Learning

3.3 Learning of Tool Learning

4 Application of Tool Learning

4.1 Tool Creation Selection and Utilization

Tool Creation

Tool Selection and Utilization

4.2 Tool Learning in IR

4.3 Tool Learning in Embodied Environment

4.4 Tool Learning for All

5 Advanced Topic and Future Directions

5.1 Multi-modal and Multi-agent Tool Learning

5.2 Safe, Trustworthy and Personalized Tool Learning

5.3 Emerging Trends and Future Opportunities

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages