Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problems in the use of pre training model #7

Open
wudizuixiaosa opened this issue Apr 24, 2022 · 5 comments
Open

Problems in the use of pre training model #7

wudizuixiaosa opened this issue Apr 24, 2022 · 5 comments

Comments

@wudizuixiaosa
Copy link

Hello, first of all, thank you for your amazing results. I obtained the pre training model of Imagenet 1K in Mae released by Facebook research before, and then loaded my own coco format data set for training, but an error occurred. Have you encountered this type of problem before. In addition, I read your vitaev2 paper, which mentioned that you used Mae to train vitae. Is there any specific operation method, such as where to modify the overall Mae.
Traceback (most recent call last):
File "tools/train.py", line 190, in
main()
File "tools/train.py", line 186, in main
meta=meta)
File "/home/lzh/miniconda3/envs/det2/lib/python3.7/site-packages/mmdet-2.18.0-py3.7.egg/mmdet/apis/train.py", line 177, in train_detector
runner.resume(cfg.resume_from)
File "/home/lzh/miniconda3/envs/det2/lib/python3.7/site-packages/mmcv/runner/base_runner.py", line 361, in resume
self._epoch = checkpoint['meta']['epoch']
KeyError: 'meta'

@Annbless
Copy link
Collaborator

The pretrained models should be loaded by specifying the model.pretrained options, not the resume_from options. Please refer to Usage for detailed commands.

@wudizuixiaosa
Copy link
Author

The pretrained models should be loaded by specifying the model.pretrained options, not the resume_from options. Please refer to Usage for detailed commands.

Ha ha, forgive me for not reading the usage carefully. Thank you for your timely reply. I wish you a happy life.

@wudizuixiaosa
Copy link
Author

If it's convenient, can you answer by the way? I read your vitaev2 paper, which mentioned that you train vitae with MAE. Whether there are any specific operation methods, such as where to modify the overall MAE. I am very interested in this because Mae cannot use PVT or swin due to its specific input method.

@Annbless
Copy link
Collaborator

Hi,

Please refer to Sec 3.3 and Sec 4.4.1 in the ViTAEv2 paper. We use MAE to train the isotropic design of ViTAE, not the hierarchical ViTAEv2 backbone. We will explore training hierarchical ViTAEv2 backbone using MIM series methods in the future.

@xmdgaoxin
Copy link

Hello, how can I solve the following error when I use my own coco dataset during training
捕获

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants