We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent fac3e4b commit 469c782Copy full SHA for 469c782
README.md
@@ -18,11 +18,12 @@ The dataset are in directory `dataset` in `HDF5` format. There are three files f
18
-- text: first three sentences
19
-- datetime: date of publication
20
21
-The dataset was split 70-20-10 as train-test-val sets several times.
+The dataset was split 70-20-10 as train-test-val sets 51 times. The precomputed splits are available in:
22
-- train_idxs: Matrix with each row i containing training indexes of split i.
23
-- test_idxs: Matrix with each row i containing test indexes of split i.
24
-- val_idxs: Matrix with each row i containing val indexes of split i.
25
```
26
+The paper uses first 10 splits to compute error bars in automatic evaluations results.
27
Please see `news.py` for example loading of this dataset.
28
29
### 3. Code
0 commit comments