Skip to content

peterwestuw/BottleSum

Repository files navigation

BottleSumm

Code for "BottleSum: Self-Supervised and Unsupervised Sentence Summarization using the Information Bottleneck Principle"

Simple

####compatibility:

pytorch 0.4 
pytorch-pretrained-bert
sacremoses
nltk
tqdm

Precomputed outputs:

If your main goal is to compare model outputs, we have example outputs from the DUC datasets and CNN/DM in the outputs directory.

For CNN dailymail, we include the model inputs. For DUC datasets, order of outputs corresponds to the DUC data (we do not make this available.

Temporary code instructions:

Note: We are still working on improving this section.

To run bottleSumEx with the same settings as in the paper

python bottleEx_summarize.py -S1_path <S1FILE> -S2_path <S2FILE> -rem_words 3 -out_name <OUTNAME>

and would contain one source sentence (s1) or next sentence (s2) per line. See code for more options.

To generate data for training BottleSum^Self, use the above command (these are the same setting used in the paper.

We are currently producing more in detailed instructions for training BottleSum^Self. If you are planning to use BottleSum^self, please contact us for instructions, as this process is more involved.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages