LOCOST

This repo contains the code used to pretrain and finetune LOCOST.

The scripts about state-space models are adapted from the official H3 repository.

Pre-trained models are available on the HuggingFace model hub.

Setup

Install both packages in the csrc/ folder:

cd csrc
cd fftconv
pip install ./
cd ../cauchy
pip install ./

Data

We expect the datasets to be tokenized with the base LongT5 tokenizer. This formatting can be done with the script preprocess_data.py.

Env

These scripts rely on a .env file, and is used through the python-dotenv package. Make sure to define here:

DATASET_PATH, the base folder where are stored the dataset.
TOKENIZER_PATH, the path to the model tokenizer (we used the LongT5 tokenizer).
CHECKPOINT_PATH to save the models checkpoint during training.

Pretraining

The pretraining is ran with PytorchLightning and tracked with wandb.

TRANSFORMERS_NO_ADVISORY_WARNINGS="true" python pretrain_script.py --dataset path/to/pretraining/dataset --config configs/pretraining/locost.yaml --wandb_name locost-pretraining

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
configs		configs
csrc		csrc
LICENSE		LICENSE
README.md		README.md
finetune_script.py		finetune_script.py
kernel_computations.py		kernel_computations.py
lightning_datamodules.py		lightning_datamodules.py
longt5_models.py		longt5_models.py
models.py		models.py
models_config.py		models_config.py
models_lightning.py		models_lightning.py
preprocess_data.py		preprocess_data.py
pretrain_script.py		pretrain_script.py
requirements.txt		requirements.txt
s4d_models.py		s4d_models.py
ssm_init.py		ssm_init.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LOCOST

Setup

Data

Env

Pretraining

About

Releases

Packages

Languages

License

flbbb/locost-summarization

Folders and files

Latest commit

History

Repository files navigation

LOCOST

Setup

Data

Env

Pretraining

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages