ToolCoder: Enhancing Tool Learning through Code-Centric Task Planning and Execution

Overview of ToolCoder 🚀

ToolCoder is an innovative framework that redefines tool learning for large language models (LLMs) by transforming it into a code completion task. Unlike conventional approaches that struggle with cross-task planning inconsistencies, multi-step structured execution, and error handling, ToolCoder introduces a code-centric paradigm to enhance accuracy and reliability.

🔹 Key Features & Innovations:
✅ Code-Driven Task Execution: Converts natural language queries into structured Python function scaffolds with clear input-output specifications.
✅ Structured Multi-Step Planning: Decomposes complex tasks into smaller, executable subtasks, embedding them as code comments for better task structuring.
✅ Enhanced Tool Utilization: Generates implementation code by leveraging validated function repositories and available tools, ensuring correctness.
✅ Iterative Error Diagnosis: Implements a code-based traceback mechanism to detect, analyze, and refine errors dynamically, improving execution reliability.

🔬 Why ToolCoder?
Experiments show that ToolCoder significantly outperforms existing methods in task completion accuracy and execution robustness, demonstrating the power of code-centric approaches in tool learning. By structuring problem-solving as Python function generation, ToolCoder enables LLMs to tackle real-world tasks with greater consistency, interpretability, and efficiency.

Datasets

We have uploaded all the dataset used in our experiments, so you don’t need to download it yourself.

Requirements

openai==0.27.1
sentence_transformers
googletrans
langchain

Step-to-step Instructions on How to Run

RestBench-TMDB

Register a TMDB API key from the TMDB Developer Center and copy the api key to TMDB_API_KEY in the run_tmdb.py file.
Write your OpenAI API key and your python environment path in the run_tmdb.py file.
Run our script. We use the gpt-4o-mini model by default, you can also modify the code to use other models.

python run_tmdb.py

RestBench-Spotify

Make sure that you have got a Spotify account. Otherwise, you need to get your Spotify key from Spotify Web API. Fill in your own key in config.yaml.
Write your OpenAI API key and your python environment path in the run_spotify.py file.
Before run the following script, make sure your Spotify device is online (for example, log in to the device using a mobile phone or computer client and play a random song), because some queries in this dataset require querying the device you are using.

python run_spotify.py

WARNING: this will remove all your data from spotify! So we recommend you to register a new Spotify account to test on this dataset.

API-Bank

Write your OpenAI API key and your python environment path in the toolcoder.py file.
You can manually modify the data_dir in line 184 of the toolcoder.py file to choose whether to test on level-1-given-desc (lv1) or level-2-toolsearcher (lv2).

python toolcoder.py

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
apibank		apibank
figs		figs
restbench		restbench
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ToolCoder: Enhancing Tool Learning through Code-Centric Task Planning and Execution

Overview of ToolCoder 🚀

Datasets

Requirements

Step-to-step Instructions on How to Run

RestBench-TMDB

RestBench-Spotify

API-Bank

About

Releases

Packages

Languages

dhx20150812/ToolCoder

Folders and files

Latest commit

History

Repository files navigation

ToolCoder: Enhancing Tool Learning through Code-Centric Task Planning and Execution

Overview of ToolCoder 🚀

Datasets

Requirements

Step-to-step Instructions on How to Run

RestBench-TMDB

RestBench-Spotify

API-Bank

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages