Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector #29

Open
Prat011 opened this issue Jul 19, 2024 · 0 comments

Comments

@Prat011
Copy link

Prat011 commented Jul 19, 2024

Summary

Hallucination detection is a challenging task for large language models (LLMs), and existing studies heavily rely on powerful closed-source LLMs such as GPT-4. In this paper, we propose an autonomous LLM-based agent framework, called HaluAgent, which enables relatively smaller LLMs (e.g. Baichuan2-Chat 7B) to actively select suitable tools for detecting multiple hallucination types such as text, code, and mathematical expression.

Key Findings

  • HaluAgent integrates the LLM, multi-functional toolbox, and a fine-grained three-stage detection framework along with a memory mechanism.
  • HaluAgent leverages existing Chinese and English datasets to synthesize detection trajectories for fine-tuning, enabling bilingual hallucination detection.
  • Extensive experiments demonstrate that HaluAgent can perform hallucination detection on various types of tasks and datasets, achieving performance comparable to or even higher than GPT-4 without tool enhancements.

Implementation Guidance

  1. Develop a multi-functional toolbox for HaluAgent to detect various types of hallucinations.
  2. Design a fine-grained three-stage detection framework and integrate it with a memory mechanism.
  3. Fine-tune HaluAgent using existing Chinese and English datasets to enhance its bilingual detection capabilities.

Reference

Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector

Tags

  • LLM
  • Hallucination Detection
  • HaluAgent
  • Bilingual Detection

Assignee

@sawradip

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant