lmtuners

This repo contains trainers for language model pre-training tasks. Currently, there are two kinds:

LMTrainer (normal/causal LM as well as masked LM)
DiscLMTrainer (discriminative language modelling task from ELECTRA paper)

We've only built small models with this library (fit on one GPU), but the code theoretically generalizes to bigger models. We don't have the resources to experiment with that, but it should be relatively easy to adapt the lightning modules to other needs.

Dependencies

This package is built on top of:

huggingface/transformers
- model implementations, *ForMaskedLM, *ForTokenClassification, and optimizers
huggingface/tokenizers
- their Rust-backed fast tokenizers
pytorch-lightning
- Abstracts training loops, checkpointing, multi-gpu/distributed learning, other training features.
- Theoretically supports TPU, but WIP.
pytorch-lamb
- LAMB optimizer implementation.

Name		Name	Last commit message	Last commit date
Latest commit History 146 Commits
.github/workflows		.github/workflows
assets		assets
experiments/disc_lm_small		experiments/disc_lm_small
lmtuners		lmtuners
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
environment.yml		environment.yml
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

lmtuners

Dependencies

About

Releases 1

Packages

Languages

License

shoarora/transformers-trainers

Folders and files

Latest commit

History

Repository files navigation

lmtuners

Dependencies

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages