This code trains a character-level language model using a variant of a convolutional neural network architecture called Hyena.
To run this code, you need to have Python 3 and the following packages installed:
- torch
python hyena-split-memory.py
This code is inspired by the papers
by Nal Kalchbrenner, Edward Grefenstette, and Phil Blunsom.
by: Michael Poli, Stefano Massaroli, Eric Nguyen, Daniel Y. Fu, Tri Dao, Stephen Baccus, Yoshua Bengio, Stefano Ermon, Christopher Ré
Affiliations:
- Department of Computer Science, Stanford University, Stanford, CA, USA
- Mila - Quebec AI Institute and DIRO, Université de Montréal, Montréal, QC, Canada