Numpy GPT

This is a minimal implementation of a GPT-style transformer using only numpy.

Given that numpy is exclusively CPU-bound, it restricts the training to relatively small-scale models. I was able to conduct a miniaturized version of the grokking experiment on a 1 layer toy model in Nanda et al. 2023.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
npgpt		npgpt
.gitignore		.gitignore
README.md		README.md
adder.py		adder.py
adder_grokking_exp.ipynb		adder_grokking_exp.ipynb
gpt_test.ipynb		gpt_test.ipynb
output.png		output.png
test_basics.py		test_basics.py
tests.ipynb		tests.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Numpy GPT

About

Releases

Packages

Languages

samuel500/numpy-gpt

Folders and files

Latest commit

History

Repository files navigation

Numpy GPT

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages