Skip to content

Commit 469c782

Browse files
authored
add splits info to readme
1 parent fac3e4b commit 469c782

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

README.md

+2-1
Original file line numberDiff line numberDiff line change
@@ -18,11 +18,12 @@ The dataset are in directory `dataset` in `HDF5` format. There are three files f
1818
-- text: first three sentences
1919
-- datetime: date of publication
2020
21-
The dataset was split 70-20-10 as train-test-val sets several times.
21+
The dataset was split 70-20-10 as train-test-val sets 51 times. The precomputed splits are available in:
2222
-- train_idxs: Matrix with each row i containing training indexes of split i.
2323
-- test_idxs: Matrix with each row i containing test indexes of split i.
2424
-- val_idxs: Matrix with each row i containing val indexes of split i.
2525
```
26+
The paper uses first 10 splits to compute error bars in automatic evaluations results.
2627
Please see `news.py` for example loading of this dataset.
2728

2829
### 3. Code

0 commit comments

Comments
 (0)