simulated-trial-and-error - Improving LLMs tool using skills through simulated trial and error. #750
Labels
Algorithms
Sorting, Learning or Classifying. All algorithms go here.
Git-Repo
Source code repository like gitlab or gh
llm
Large Language Models
llm-experiments
experiments with large language models
New-Label
Choose this option if the existing labels are insufficient to describe the content accurately
Research
personal research notes for a topic
Software2.0
Software development driven by AI and neural networks.
simulated-trial-and-error/README.md at main · microsoft/simulated-trial-and-error
LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error
File Structure
Environment Setup
Put your OpenAI API key in
api_key.txt
in the parent directory.STE/
, install ToolBench, BMTools and acquire the associated API keys following their respective instructions, and thenllama-recipes/
, set up the environment following https://github.com/facebookresearch/llama-recipes.Exploration w/ STE
Custom tool
For STE with custom APIs, simply append the API names and descriptions to
API_list.json
andAPI_descriptions.json
intool_metadata/
, and change therun_tool
function inmain.py
to enable the execution of newly-added tools.Exploitation w/ STE
Data preparation
Fine-tuning & Inference
ICL
First run
demo_retrieve.ipynb
to prepare retrieved demonstration examples.Continual Learning with Rehearsal
For round {0|1|2|3},
Evaluation
STE/evaluation.ipynb
includes the evaluation scripts and cached evaluation results for all predictions files inSTE/saved_results/
Citation
Suggested labels
{'label-name': 'tool-learning', 'label-description': 'Focuses on learning through simulated trial and error using tools for large language models.', 'confidence': 75.1}
The text was updated successfully, but these errors were encountered: