Skip to content

Latest commit

 

History

History
11 lines (9 loc) · 664 Bytes

README.md

File metadata and controls

11 lines (9 loc) · 664 Bytes

(Pytorch) Visual Transformers: Token-based Image Representation and Processing for Computer Vision:

A Pytorch Implementation of the following paper "Visual Transformers: Token-based Image Representation and Processing for Computer Vision"

Visual Transformers Find the original paper here.

  • This Pytorch Implementation is based on This repo. The default dataset used here is CIFAR10 which can be easily changed to ImageNet or anything else.
  • You might need to install einops.