Skip to content

dsalnikov/wav2vec

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

wav2vec

This is a minimal wav2vec 2.0 implementation in plain NumPy inspired by picoGPT.

This implementation:

  • 130 lines of code + utils to load and convert parameters.
  • Contains only fwd path.
  • Slow. Doesn't use GPU, multithreading etc.
  • Written in education purpose and may contain bugs.

Install

pip install -r requirements.txt

Tested on Python 3.9.

Usage

python wav2vec.py

Output

transcript:  I|HAD|THAT|CURIOSITY|BESIDE|ME|AT|THIS|MOMENT|

Files

  • wav2vec.py - model layers implementation
  • utils.py - helper functions to download and convert model parameters and example wav file

TODO

  • Add option to load user specified wav