sumEvaluation

In this repository, we present a novel approach to text summarization evaluation, addressing the limitations of traditional metrics and introducing a comprehensive methodology that integrates various techniques for a more accurate assessment.

To prepare the environment for improved_summac use

pip install -r requirements.txt

The notebook consists of all of the experiments that we ran and the final results.

We combine three approaches:

NLI - Natural Language Inference
STS - Semantic Text Similarity
MLM - Masked Language Modeling

We rely on the main idea that was shown in the SummaC framework - we propose to change the granularities between different approaches:

We test NLI, relying on 'MNLI', on the sentence level only (as it was shown superior by the SummaC authors)
We test STS, relying on 'all-MiniLM-L6-v2', on the sentence level and on the paragraph level
We test MLM, relying on 'BertForMaskedLM', on the paragraph level and on the document level

The datasets we have used are almost the same as were used in SummaC (except for CoGenSumm) and are as follows:

XSumFaith: is an extension of the XSum dataset, emphasizing the faithfulness of summarization models.
Polytope: presents a comprehensive typology of summarization errors.
FactCC: focuses on factual consistency in summaries.
SummEval: comprises summarizer outputs from a variety of models, labeled using a 5-point Likert scale for coherence, consistency, fluency, and relevance.
FRANK: annotates summarizers trained on CNN/DM and XSum datasets.

As our datasets are often imbalanced, we used Balanced Accuracy as our main KPI. Our results are as follows:

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
improved_summac		improved_summac
.gitignore		.gitignore
Improved SummaC - Results.ipynb		Improved SummaC - Results.ipynb
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

sumEvaluation

About

Releases

Packages

Contributors 2

Languages

License

tzachpach/sumEvaluation

Folders and files

Latest commit

History

Repository files navigation

sumEvaluation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages