Partially Shuffling the Training Data to Improve Language Models

This repository contains the code for the Partial Shuffle method, and a modified version of the DOC language model that utilizes this method.

If you'd like to run the DOC + Partial Shuffle models, use the same commands as in the original DOC model, presented here.

The code for the Partial Shuffle method itself is in partial_shuffle.py. If you'd like to use this method in your own language model, simply import partial_shuffle.py, and call it before each epoch, as in line 196 in main.py. No other modifications are required.

Reference

If you found this code useful, please cite the following paper:

@article{press2019partially,
  title={Partially Shuffling the Training Data to Improve Language Models},
  author={Press, Ofir},
  journal={arXiv preprint arXiv:1903.04167},
  year={2019}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
cal_ppl.py		cal_ppl.py
data.py		data.py
embed_regularize.py		embed_regularize.py
finetune.py		finetune.py
generate.py		generate.py
get_data.sh		get_data.sh
locked_dropout.py		locked_dropout.py
main.py		main.py
model.py		model.py
partial_shuffle.py		partial_shuffle.py
utils.py		utils.py
weight_drop.py		weight_drop.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Partially Shuffling the Training Data to Improve Language Models

Reference

About

Releases

Packages

Languages

ofirpress/PartialShuffle

Folders and files

Latest commit

History

Repository files navigation

Partially Shuffling the Training Data to Improve Language Models

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages