This repository contains the data and source code for the following paper:
- M. Corsi and J. Urbano, "The Treatment of Ties in Rank-Biased Overlap", International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024.
A single ZIP file can be downloaded as well.
Folder rbo/
contains standalone implementations of all RBO variants in R and Python. Please refer to the respective README
files for reference.
This is the project structure:
data/
Input data files.output/
Generated output files.rbo/
RBO implementation.src/
Source code in R to reproduce results from the paper.scratch/
Temporary files generated in the process.
All code is written in R. You will need the following packages installed from CRAN: dplyr
, extraDistr
, future.apply
, ggplot2
, glue
, latex2exp
, mvtnorm
and rio
.
The source files in src/
need to be run in order. You can run each file individually by executing Rscript src/<file>.R
. They will store intermediate data in scratch/
and the final data in output/
.
It is important that you always run from the base directory.
src/01-trec-download.R
: download TREC runs (you will need password and username; see https://trec.nist.gov/results.html). Store inscratch/01-trec-download
.src/02-trec-stats.R
: compute statistics about TREC runs. Store inoutput/trec-stats
.src/11-rbo-trec.R
: compute RBO scores between pairs of TREC runs. Store inoutput/rbo-trec
.src/12-rbo-synthetic.R
: simulate synthetic data and compute RBO scores. Store inoutput/rbo-synthetic
.src/99-paper.R
: generates tables and figures. Store inoutput/figures
.
- Databases and their contents are distributed under the terms of the Creative Commons Attribution-ShareAlike 4.0 International License.
- Software is distributed under the terms of the MIT License.
When using this archive, please cite this paper:
@inproceedings{corsi2024treatment,
author = {Corsi, Matteo and Urbano, Juli\'{a}n},
booktitle = {International ACM SIGIR Conference on Research and Development in Information Retrieval},
title = {{The Treatment of Ties in Rank-Biased Overlap}},
year = {2024},
pages = {xx--xx}
}