Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up
RoT improves reasoning accuracy and efficiency while minimizing computational costs, leveraging Preference-Guided Reverse Reasoning and a Cognitive Preference Manager to optimally explore LLM reasoning with cognitive preferences.
πππ reversal_demo.py
from utils.llm_utils import *
from utils.prompt import *
pipeline=Pipeline(model_id=model_id, base_url=base_url, api_key=api_key, prob=True)
demos = "Input:... Output:..." #Suggest 2-shot Demos
llm_taste=rot_pipeline( pipeline, reversal_of_thought, demos=demos, warmup=5)
-
Enhance LLM-Preferred Prompts for Task Solutions
Refines prompts to align with LLM-preferred strategies, optimizing task-solving efficiency. -
Potential for Creating Diverse QA Datasets
Generates varied question-answer pairs to improve dataset diversity.
If you find our work useful for your research, please kindly cite our paper as follows:
@article{yuan2024reversal,
title={Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up},
author={Yuan, Jiahao and Du, Dehui and Zhang, Hao and Di, Zixiang and Naseem, Usman},
journal={arXiv preprint arXiv:2410.12323},
year={2024}
}