This repository contains dataset for our EMNLP 2019 paper "Attribute-aware Sequence Network for Review Summarization".
Download our dataset from https://pan.baidu.com/s/1n256L9o3DVoshum65Efo3g with password "6tkw".
train.txt, test.txt, dev.txt represent training set, testing set and development set. Each line in each file is a sample. Each line makes up of 7 elements, which are split by "\t\t". Element 1 is the user ID, element 2 is the overall rating (which is not used in this paper), element 3 is the review content and element 4 is the summary of the review, element 5-7 are age, gender and travel state.