GitHub - facebookresearch/long_seq_mae: code release of research paper "Exploring Long-Sequence Masked Autoencoders"

Exploring Long-Sequence Masked Autoencoders

This is the code release of the paper Exploring Long-Sequence Masked Autoencoders:

@Article{hu2022exploring,
  author  = {Ronghang Hu and Shoubhik Debnath and Saining Xie and Xinlei Chen},
  journal = {arXiv:2210.07224},
  title   = {Exploring Long-Sequence Masked Autoencoders},
  year    = {2022},
}

This repo is a modification on the MAE repo, and supports long-sequence pretraining on both GPUs and TPUs using PyTorch.
This repo is based on timm==0.4.12, which can be installed via pip3 install timm==0.4.12.

Fine-tuning with pre-trained checkpoints

The following table provides the pre-trained checkpoints used in the paper:

Model (pretrained w/ L=784, image size 448, patch size 16)	ViT-Base	ViT-Large
COCO (train2017 + unlabeled2017) 4000-epoch	download	download
ImageNet-1k 800-epoch	download	download
ImageNet-1k 1600-epoch	download	download

Using the codebase

Follow PRETRAIN_LONG_SEQ_TPU.md for long-sequence pretraining on Google Cloud TPUs (which we used for our experiments).
Follow PRETRAIN_LONG_SEQ_GPU.md for long-sequence pretraining on Nvidia GPUs.
Follow FINETUNE_DETECTION.md to fine-tune on the object detection task using the ViTDet codebase from Detectron2.

In addition, this codebase is also compatible with the features in the original MAE repo. Follow README_MAE.md to use the features of the original MAE repo (such as fine-tuning on image classification).

License

This project is under the CC-BY-NC 4.0 license. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
demo		demo
tools		tools
util		util
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
DATA.md		DATA.md
FINETUNE.md		FINETUNE.md
FINETUNE_DETECTION.md		FINETUNE_DETECTION.md
LICENSE		LICENSE
PRETRAIN.md		PRETRAIN.md
PRETRAIN_LONG_SEQ_GPU.md		PRETRAIN_LONG_SEQ_GPU.md
PRETRAIN_LONG_SEQ_TPU.md		PRETRAIN_LONG_SEQ_TPU.md
README.md		README.md
README_MAE.md		README_MAE.md
engine_finetune.py		engine_finetune.py
engine_pretrain.py		engine_pretrain.py
main_finetune.py		main_finetune.py
main_linprobe.py		main_linprobe.py
main_pretrain.py		main_pretrain.py
models_mae.py		models_mae.py
models_vit.py		models_vit.py
submitit_finetune.py		submitit_finetune.py
submitit_linprobe.py		submitit_linprobe.py
submitit_pretrain.py		submitit_pretrain.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Exploring Long-Sequence Masked Autoencoders

Fine-tuning with pre-trained checkpoints

Using the codebase

License

About

Releases

Packages

Contributors 3

Languages

License

facebookresearch/long_seq_mae

Folders and files

Latest commit

History

Repository files navigation

Exploring Long-Sequence Masked Autoencoders

Fine-tuning with pre-trained checkpoints

Using the codebase

License

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages