Skip to content

Suro-One/Hyena-Hierarchy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Hyena Hierarchy

This code trains a character-level language model using a variant of a convolutional neural network architecture called Hyena.

Requirements

To run this code, you need to have Python 3 and the following packages installed:

  • torch

Run

python hyena-split-memory.py

Credits

This code is inspired by the papers

by Nal Kalchbrenner, Edward Grefenstette, and Phil Blunsom.

by: Michael Poli, Stefano Massaroli, Eric Nguyen, Daniel Y. Fu, Tri Dao, Stephen Baccus, Yoshua Bengio, Stefano Ermon, Christopher Ré

Affiliations:

  1. Department of Computer Science, Stanford University, Stanford, CA, USA
  2. Mila - Quebec AI Institute and DIRO, Université de Montréal, Montréal, QC, Canada