Evaluating Large Language Models as Generative User Simulators for Conversational Recommendation

This is the code for our paper:

Evaluating Large Language Models as Generative User Simulators for Conversational Recommendation Se-eun Yoon, Zhankui He, Jessica Maria Echterhoff, Julian McAuley Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024)

Link to arxiv: https://arxiv.org/abs/2403.09738

Preparing the dataset

Download the datasets in the data/common directory.

data/common/ml-25m should contain csv files downloaded from this ML-25M repository.
data/common/reddit should contain {test, train, valid}.csv and id2name.json files downloaded from this huggingface repository.
data/redial should contain {test_data, train_data}.jsonl and movies_with_mentions.csv files downloaded from this ReDial repository.
data/demographic and data/imdb already have their contents in this repository.

Next, preprocess the data with notebooks in preprocess_notebooks. Orders don't matter, except that preprecess_rest.ipynb should be run last.

Running the tasks

Copy and paste your OpenAI API key to openai_key.txt. We used openai 0.28.1 in our experiments.

The code for running each task is in generate.py within each folder:

Task1 (ItemsTalk): t1_items
Task2 (BinPref): t2_bin_preference
Task3 (OpenPref): t3_open_preference
Task4 (RecRequest): t4_requests
Task5 (Feedback): t5_feedback

Examples of running this script is in each bash file named run.sh.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Evaluating Large Language Models as Generative User Simulators for Conversational Recommendation

Preparing the dataset

Running the tasks

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
lib		lib
preprocess_notebooks		preprocess_notebooks
t1_items		t1_items
t2_bin_preference		t2_bin_preference
t3_open_preference		t3_open_preference
t4_requests		t4_requests
t5_feedback		t5_feedback
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
openai_key.txt		openai_key.txt

granelle/naacl24-user-sim

Folders and files

Latest commit

History

Repository files navigation

Evaluating Large Language Models as Generative User Simulators for Conversational Recommendation

Preparing the dataset

Running the tasks

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages