Towards Filling the Gap in Conversational Search: From Passage Retrieval to Conversational Response Generation

This repository provides resources developed within the following article [PDF]:

W. Łajewska and K. Balog. Towards Filling the Gap in Conversational Search: From Passage Retrieval to Conversational Response Generation. In: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management (CIKM '23). ACM. Birmingham, United Kingdom. October 2023. 10.1145/3583780.3615132

An extended version of this paper is available on arXiv.

Summary

Research on conversational search has so far mostly focused on query rewriting and multi-stage passage retrieval. However, synthesizing the top retrieved passages into a complete, relevant, and concise response is still an open challenge. Having snippet-level annotations of relevant passages would enable both (1) the training of response generation models that are able to ground answers in actual statements and (2) the automatic evaluation of the generated responses in terms of completeness. In this paper, we address the problem of collecting high-quality snippet-level answer annotations for two of the TREC Conversational Assistance track datasets. To ensure quality, we first perform a preliminary annotation study, employing different task designs, crowdsourcing platforms, and workers with different qualifications. Based on the outcomes of this study, we refine our annotation protocol before proceeding with the full-scale data collection. Overall, we gather annotations for 1.8k question-paragraph pairs, each annotated by three independent crowd workers. The process of collecting data at this magnitude also led to multiple insights about the problem that can inform the design of future response-generation methods.

CAsT-snippets dataset

Snippets annotations collected for TREC CAsT'20 and '22 are available under data/large_scale/all. They are divided into two subfolders containing paragraph-based annotations for CAsT'20 (under data/large_scale/all/2020) and CAsT'22 (under data/large_scale/all/2022). Additionally, a copy of the two sample topics used in our preliminary study that were annotated one more time in the large-scale data annotation process along with a copy of expert annotations can be found under data/large_scale/topics_1-2. The filenames contain information about the batch and the group of crowdworkers assigned to it. Each file contains annotations done by 3 crowdworkers for 1-2 topics. All the annotations aggregated in one file can be found here along with information about train-validation-test split (in total 1,855 query-passage pairs). The annotations are split across topics and not queries to avoid information leakage (5 topics in test partition (~12%), 5 topics in validation partition (~10%), and 32 topics in train partition (~78%)).

Dataset	#queries	#passages per query	#annotators per passage	#query-passage pairs	Avg. snippet length (tokens)	#snippets per annotation
CAsT-snippets	371	5	3	1,855	39.6	2.3

The annotated data for every task configuration considered in the paper is covered in detail here.

Crowdsourcing task design

The crowdsourcing task designs and the automatic quality control mechanisms are covered in detail here.

Evaluation measures

The implementation of similarity measures, both for inter-annotator agreement as well as for agreement between crowd workers and expert annotators, can be found here. To generate the result tables presented in the paper run the following command:

python -m snippet_annotation.create_result_tables

Citation

If you use the resources presented in this repository, please cite:

@inproceedings{Lajewska:2023:CIKM,
  author =    {Weronika Łajewska and Krisztian Balog},
  title =     {Towards Filling the Gap in Conversational Search: From Passage Retrieval to Conversational Response Generation},
  booktitle = {Proceedings of the 32nd ACM International Conference on Information and Knowledge Management},
  series =    {CIKM '23},
  year =      {2023},
  doi =       {10.1145/3583780.3615132},
  publisher = {ACM}
}

Contact

Should you have any questions, please contact Weronika Łajewska at weronika.lajewska[AT]uis.no (with [AT] replaced by @).

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
crowdsourcing_task_design		crowdsourcing_task_design
data		data
scripts		scripts
snippet_annotation		snippet_annotation
tests		tests
.DS_Store		.DS_Store
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
conftest.py		conftest.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Towards Filling the Gap in Conversational Search: From Passage Retrieval to Conversational Response Generation

Summary

CAsT-snippets dataset

Crowdsourcing task design

Evaluation measures

Citation

Contact

About

Releases

Packages

Languages

iai-group/CAsT-snippets

Folders and files

Latest commit

History

Repository files navigation

Towards Filling the Gap in Conversational Search: From Passage Retrieval to Conversational Response Generation

Summary

CAsT-snippets dataset

Crowdsourcing task design

Evaluation measures

Citation

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages