This repository contains the dataset used in the paper "On the Helpfulness of Document Context to Sentence Simplification"
(1) The file "With_context.txt" contains sentence pairs with the preceding sentences and the following sentences. The format is as follows:
The simplified sentence. The original sentence. The preceding sentences. The following sentences.
The file "With_context.txt" can be obtained at https://drive.google.com/file/d/1NcvgVd6PTZuMncLJnsgnS_CIRWiwO0wH/view?usp=sharing
(2) The file "Without_context.txt" contains sentence pairs without context. The format is as follows:
The simplified sentence. The original sentence.
The file "Without_context.txt" can be obtained at https://drive.google.com/file/d/1w-yXM2m7gFFrwQttcAU3tq1kZy4MAFG4/view?usp=sharing
Before you train the model, you should lowercase the characters in the file.