Finetuning_BART_for_FACET_Summarization

New!! The dataset is now available at Hugging Face 🤗

Finetuning_BART_for_FACET_Summarization

Paper: ACL 2021, Bringing Structure into Summaries: a Faceted Summarization Dataset for Long Scientific Documents

Introduction

There are some things that you need to familiarise yourself with / consider:

Fine-tuning

Prerequisites:

PyTorch
Fairseq
Download the pretrained BART-large model
Get the Emerald dataset

Preprocessing data

$> python preprocess_data.py
$> bash bpe.sh
$> bash binarize.sh

For parameters, check the finetune.sh script.

Although we did not find major differences with updating the max_tokens parameter during BART finetuning, in case you want to try it, the code allows to change the parameter (in the scripts/train.py file).

Find more information at fairseq bart repo!

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
model_output/BARTFacet_OA_Test		model_output/BARTFacet_OA_Test
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Finetuning_BART_for_FACET_Summarization

Introduction

Fine-tuning

Preprocessing data

About

Releases

Packages

Contributors 2

Languages

License

khushsi/Finetuning_BART_for_FACET_Summarization

Folders and files

Latest commit

History

Repository files navigation

Finetuning_BART_for_FACET_Summarization

Introduction

Fine-tuning

Preprocessing data

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages