GitHub - AMD-AIG-AIMA/AMD_FDViT

FDViT: Improve the Hierarchical Architecture of Vision Transformer (ICCV 2023)

Yixing Xu, Chao Li, Dong Li, Xiao Sheng, Fan Jiang, Lu Tian, Ashish Sirasao | Paper

Advanced Micro Devices, Inc.

Dependancies

torch == 1.13.1
torchvision == 0.14.1
timm == 0.6.12
einops == 0.6.1

Model performance

The image classification results of FDViT models on ImageNet dataset are shown in the following table.

Model	Parameters (M)	FLOPs(G)	Top-1 Accuracy (%)
FDViT-Ti	4.6	0.6	73.74
FDViT-S	21.6	2.8	81.45
FDViT-B	68.1	11.9	82.39

Model Evaluation

python -m torch.distributed.launch --nproc_per_node=1 --use_env main.py --model fdvit_ti --data-path /path/to/imagenet/ --resume /path/to/tiny_model/ --eval

Model Training

FDViT-Ti

python -m torch.distributed.launch --nproc_per_node=8  --use_env main.py --model fdvit_ti --opt adamp --batch-size 256 --data-path /path/to/imagenet/--output_dir ./output/fdvit_ti/ --epochs 300 --warmup-epochs 20 --ratio 0.03 --mask_thre 0.2

FDViT-S

python -m torch.distributed.launch --nproc_per_node=8  --use_env main.py --model fdvit_s --opt adamp --batch-size 256 --data-path /path/to/imagenet/--output_dir ./output/fdvit_s/ --epochs 300 --warmup-epochs 20 --ratio 0.03 --mask_thre 0.2

FDViT-B

python -m torch.distributed.launch --nproc_per_node=8  --use_env main.py --model fdvit_b --opt adamp --batch-size 256 --data-path /path/to/imagenet/--output_dir ./output/fdvit_b/ --epochs 300 --warmup-epochs 20 --ratio 0.03 --mask_thre 0.2

Citation

@inproceedings{xu2023fdvit,
  title={FDViT: Improve the Hierarchical Architecture of Vision Transformer},
  author={Xu, Yixing and Li, Chao and Li, Dong and Sheng, Xiao and Jiang, Fan and Tian, Lu and Sirasao, Ashish},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  pages={5950--5960},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
img		img
License		License
augment.py		augment.py
datasets.py		datasets.py
engine.py		engine.py
fdvit.py		fdvit.py
losses.py		losses.py
main.py		main.py
readme.md		readme.md
samplers.py		samplers.py
train.sh		train.sh
utils.py		utils.py
vision_transformer.py		vision_transformer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FDViT: Improve the Hierarchical Architecture of Vision Transformer (ICCV 2023)

Dependancies

Model performance

Model Evaluation

Model Training

Citation

About

Releases

Packages

Languages

License

AMD-AIG-AIMA/AMD_FDViT

Folders and files

Latest commit

History

Repository files navigation

FDViT: Improve the Hierarchical Architecture of Vision Transformer (ICCV 2023)

Dependancies

Model performance

Model Evaluation

Model Training

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages