TOVA

This repository contains the code for the paper: Transformers are Multi-State RNNs by Matanel Oren*, Michael Hassid*, Yossi Adi and Roy Schwartz.

How to use

First set the environment:

pip install transformers==4.36.2 sentencepiece
git clone https://github.com/schwartz-lab-NLP/TOVA.git

Next, use the following example code (currently supports LLaMA and Mistral only):

from transformers import AutoTokenizer, AutoModelForCausalLM
from TOVA import TOVACache, enable_tova_caching

tokenizer = AutoTokenizer.from_pretrained("your_model")
model = AutoModelForCausalLM.from_pretrained("your_model")

prompt = "Enter your prompt here"
input_ids = tokenizer(prompt, return_tensors="pt").input_ids

# use TOVA
enable_tova_caching(model)
multi_state_size = 512
cache = TOVACache(multi_state_size)

output = model.generate(input_ids, past_key_values=cache)

Citation

@misc{oren2024transformers,
title={Transformers are Multi-State {RNNs}},
author={Matanel Oren and Michael Hassid and Yossi Adi and Roy Schwartz},
year={2024},
note = {{arXiv}:2401.06104},
url = {https://arxiv.org/abs/2401.06104},
}

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
src		src
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
fig2_tova.png		fig2_tova.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TOVA

How to use

Citation

About

Releases

Packages

Contributors 4

Languages

License

schwartz-lab-NLP/TOVA

Folders and files

Latest commit

History

Repository files navigation

TOVA

How to use

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages