Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent

🔔 Code • 📃 Paper • 🤗 Dataset

Abstract

Large Language Models (LLMs) have revolutionized open-domain dialogue agents but encounter challenges in multi-character role-playing (MCRP) scenarios. To address the issue, we present Neeko, an innovative framework designed for efficient multiple characters imitation. Unlike existing methods, Neeko employs a dynamic low-rank adapter (LoRA) strategy, enabling it to adapt seamlessly to diverse characters. Our framework breaks down the role-playing process into agent pre-training, multiple characters playing, and character incremental learning, effectively handling both seen and unseen roles. This dynamic approach, coupled with distinct LoRA blocks for each character, enhances Neeko's adaptability to unique attributes, personalities, and speaking patterns. As a result, Neeko demonstrates superior performance in MCRP over most existing methods, offering more engaging and versatile user interaction experiences.

Framework

Getting Started

git clone https://github.com/weiyifan1023/Neeko.git
cd Neeko

1. Shuffle data from multiple roles.

python shuffle_data.py \
    --data_dir /path/to/your/character-llm-data/ \
    --out_path /path/to/your/character-llm-data/prompted/shuffle.jsonl

2. Use a pretrained transformer encoder model to generate role embeddings

Here, we take S-Bert as an example.

python embd_roles.py \
    --encoder_path /path/to/your/s-bert \
    --seed_data_path /path/to/your/seed_data \
    --save_path /path/to/save/your/role_embds

3. Training Neeko

We take Llama-2-7b as an example, replace some paths in neeko.sh, and then execute:

bash neeko.sh

Demonstration

Citation

If you find our paper inspiring and have utilized it in your work, please cite our paper.

@article{yu2024neeko,
  title={Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent},
  author={Yu, Xiaoyan and Luo, Tongxu and Wei, Yifan and Lei, Fangyu and Huang, Yiming and Hao, Peng and Zhu, Liehuang},
  journal={arXiv preprint arXiv:2402.13717},
  year={2024}
}

Contact

xiaoyan.yu@bit.edu.cn && 2748113810@qq.com (Tongxu Luo) && weiyifan2021@ia.ac.cn (Preferred)

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
data		data
eval		eval
images		images
moelora		moelora
src		src
LICENSE		LICENSE
OverallFrame.pdf		OverallFrame.pdf
OverallFrame.png		OverallFrame.png
README.md		README.md
embd_roles.py		embd_roles.py
infer.py		infer.py
lora.sh		lora.sh
neeko.sh		neeko.sh
requirements.txt		requirements.txt
shuffle_data.py		shuffle_data.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent

Abstract

Framework

Getting Started

1. Shuffle data from multiple roles.

2. Use a pretrained transformer encoder model to generate role embeddings

3. Training Neeko

Demonstration

Citation

Contact

About

Releases

Packages

Languages

License

tongxuluo/Neeko

Folders and files

Latest commit

History

Repository files navigation

Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent

Abstract

Framework

Getting Started

1. Shuffle data from multiple roles.

2. Use a pretrained transformer encoder model to generate role embeddings

3. Training Neeko

Demonstration

Citation

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages