Skip to content

Latest commit

 

History

History

data

Data Pre-processing

Requirement

StanfordCoreNLP is required to obtain the dependency trees for all dataset. Please download version 3.9.2 and put the folder stanford-corenlp-full-2018-10-05 under the current directory.

Obtain the data

Download the dataset from official website and do the pre-processing in the format of sample_data.

Follow the script data_processes.sh to obtain the dependency trees for dataset.