Skip to content

armandstrickernlp/Task-Chitchat-Entropy

Repository files navigation

Enhancing Task-Oriented Dialogues with Chitchat: a Comparative Study Based on Lexical Diversity and Divergence

This project compares the effect of enhancing task-oriented dialogues with different types of chitchat strategies. Metrics used are entropy-based and measure the lexical diversity and divergence brought on by the chitchat enhancements. The project relies on the following datasets: Accentor, KETOD and FusedChat. These datasets are all open-source and can be freely downloaded.

Accepted @ ASRU 2023. Publication to come ! (arxiv + video)

Requirements:

This project uses Python 3.9+

Create and activate a virtual environment:

conda create -n task_chitchat_ent python=3.9

Install the requirements:

git clone git@github.com:armandstrickernlp/Task-Chitchat-Entropy.git
cd Task-Chitchat-Entropy
pip install -r requirements.txt

Compare lexical diversity and divergence

The serialized extracted utterances from each dataset are made available in the utt_data repository and are directly loaded in compare_diversity_divergence.py.

To reproduce comparison plots and results from the paper, simply run the following command:

python compare_diversity_divergence.py

This will generate the plots in a plots directory.

Code for computing metrics is in metric_utils.py and code for extracting utterances is in load_utts.py. Note: In Accentor, several chitchat candidates are proposed for most utterances. Prior to extracting the Accentor utterances for analysis, we randomly pick a chitchat candidate when possible, using generate_accentor_seeds.py.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages