GitHub

Code for the EACL 2021 paper How to Evaluate a Summarizer: Study Design and Statistical Analysis for Manual Linguistic Quality Evaluation

Most experiments can be reproduced by running the accompanying jupyter notebook. For significance analysis run scrips/r/analyse-ordinal.r anonymized_judgements/<data_file> crossed

Power analysis is not included in the steps, as it is computationally expensive. To reproduce one step, run

python -m summaryanalysis.design_power -b <batch count> -d <docs per batch> -a <annotators per doc> <model_file> out.csv

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
anonymized_annotations		anonymized_annotations
models		models
obspower		obspower
paper_lists		paper_lists
scripts		scripts
README.md		README.md
summary_lq.ipynb		summary_lq.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

julmaxi/summary_lq_analysis

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages