Controlling Length in Abstractive Summarization Using a Convolutional Neural Network

This code is for EMNLP 2018 paper: Controlling Length in Abstractive Summarization Using a Convolutional Neural Network

In this paper, we propose an approach to constrain the summary length by extending a convolutional sequence to sequence model (Paper, Code(fairseq-py)).

Requirements

PyTorch version >= 0.4.0 Python version >= 3.6

pip install -r requirments

Data Preprocessing

We use CNN/Daily Mail as dataset and get train/valid/test sets according to See et al. 2017. The preprocessed data can be downloaded from here.

Binary

python -u preprocess.py --source-lang en --target-lang de --trainpref \
$TEXT/train --validpref $TEXT/valid --testpref $TEXT/test \
--destdir data-bin/cnndm41.tokenized.en-de \
--thresholdtgt 20 --thresholdsrc 20

Training

TEXT=data/cnndm41.tokenized.en-de

python length.py

cp $TEXT/*.len* data-bin/cnndm41.tokenized.en-de

mkdir -p checkpoints/fconv_cnndm41

CUDA_VISIBLE_DEVICES=0 python -u train.py data-bin/cnndm41.tokenized.en-de \
     --lr 0.2 --clip-norm 0.1 --dropout 0.2 --max-tokens 4000 \
     --label-smoothing 0.1 --force-anneal 200 \
     --save-dir checkpoints/fconv_cnndm41 --arch fconv_cnndm_en_de \
     --skip-invalid-size-inputs-valid-test --sample-without-replacement 3850 \
     --max-source-positions 500 --max-target-positions 200

The pretrained model can be downloaded from here

Testing

TEXT=data/cnndm41.tokenized.en-de

Gold Length

cp $TEXT/*.len* data-bin/cnndm41.tokenized.en-de

python generate.py data-bin/cnndm41.tokenized.en-de \
 --save-path 4120lenParam --path checkpoints/fconv_cnndm41/checkpoint_best.pt \
 --batch-size 128 --beam 5 --max-source-positions 500 \
 --max-target-positions 200 --max-len-b 100

Arbitrary Length

python fortest.py 10 #10 is the desired length.

python generate.py data-bin/cnndm41.tokenized.en-de \
 --save-path 4120lenParam --path checkpoints/fconv_cnndm41/checkpoint_best.pt \
 --batch-size 128 --beam 5 --max-source-positions 500 \
 --max-target-positions 200 --max-len-b 100

Citation

@inproceedings{summlen18,
  author    = {Yizhu Liu and
               Zhiyi Luo and
               Kenny Q. Zhu},
  title     = {Controlling Length in Abstractive Summarization Using a Convolutional
               Neural Network},
  booktitle = {Proceedings of the 2018 Conference on Empirical Methods in Natural
               Language Processing, Brussels, Belgium, October 31 - November 4, 2018},
  pages     = {4110--4119},
  publisher = {Association for Computational Linguistics},
  year      = {2018}
}

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
fairseq		fairseq
README.md		README.md
fortest.py		fortest.py
generate.py		generate.py
interactive.py		interactive.py
length.py		length.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
score.py		score.py
setting.py		setting.py
setup.py		setup.py
test.sh		test.sh
train.py		train.py
train.sh		train.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Controlling Length in Abstractive Summarization Using a Convolutional Neural Network

Requirements

Data Preprocessing

Training

Testing

Gold Length

Arbitrary Length

Citation

About

Releases

Packages

Languages

YizhuLiu/sumlen

Folders and files

Latest commit

History

Repository files navigation

Controlling Length in Abstractive Summarization Using a Convolutional Neural Network

Requirements

Data Preprocessing

Training

Testing

Gold Length

Arbitrary Length

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages