Skip to content
/ Vim-F Public

This project is based on Vim (paper, code) and we appreciate this excellent work.

Notifications You must be signed in to change notification settings

yws-wxs/Vim-F

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 

Repository files navigation

Vim-F: Visual State Space Model Benefiting from Learning in the Frequency Domain

Introduction

This project is based on Vim (paper, code) and we appreciate this excellent work. You can simply replace main.py and models_mamba.py with our versions to reimplement our work. Among them, main.py has no substantial modifications, and only the code related to position embedding in the original file has been removed to fit our work.

ImageNet classification

Pre-training

V1

Model Dataset Resolution Top1 Ckpt/Logs
Vim-Ti-F(H) ImgNet 1K 224×224 76.0 ckpt/log
Vim-S-F(H) ImgNet 1K 224×224 80.5 ckpt/log

V2

Model Dataset Resolution Top1 Ckpt/Logs
Vim-Ti-F ImgNet 1K 224×224 76.7 ckpt/log
Vim-S-F ImgNet 1K 224×224 80.9 retraining

About

This project is based on Vim (paper, code) and we appreciate this excellent work.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages