vision_transformer_numpy

NumPy Implementation of the Vision Transformer (ViT) on Num/Cu-py

In order to gain a deeper understanding of Vision Transformers (ViT) and also I didnt see any previous work has demonstrated backward propagation in conjunction with forward propagation, Therefore, I come up with implementing vision transformer in numpy (cpu)/ cupy(gpu)

Here are the main benefits of implementing ViT in NumPy:

It aids in comprehending the underlying mathematics, preventing the abstraction of the learning process.
It eliminates the need for the pytorch framework.

Dataset

For sake of simplicity the code uses MNIST dataset as from here.

Training

The model trained in the code is currently not saved. Loss and metrics are provided.

Need to add/implement

Resolve bugs of overflow errors and occurance of nan values
save model weights
unit tests

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
README.md		README.md
cross_entropy_loss.py		cross_entropy_loss.py
gelu.py		gelu.py
layer_normalization.py		layer_normalization.py
linear.py		linear.py
main.py		main.py
multi_head_attention.py		multi_head_attention.py
optimizer.py		optimizer.py
parameter.py		parameter.py
patch.py		patch.py
position_embedding.py		position_embedding.py
softmax.py		softmax.py
vit.py		vit.py
vit_block.py		vit_block.py
vit_weights.npy		vit_weights.npy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

vision_transformer_numpy

Dataset

Training

Need to add/implement

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

sathishram12/vision_transformer_numpy

Folders and files

Latest commit

History

Repository files navigation

vision_transformer_numpy

Dataset

Training

Need to add/implement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages