Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

关于预训练权重window_size过大导致爆显存问题 #44

Open
DoctorDream opened this issue Sep 27, 2024 · 3 comments
Open

关于预训练权重window_size过大导致爆显存问题 #44

DoctorDream opened this issue Sep 27, 2024 · 3 comments

Comments

@DoctorDream
Copy link

作者您好,非常感谢您开源的工作!

我在尝试应用过程中试图将其agent_swin_b的预训练权重替换至detr模型的backbone中,但是发现很容易出现显存爆炸的情况。
经过尝试我发现似乎是window_size设置为56/96导致的,相比detr中使用swin-transformer常用的12/14/16的window size,56对显存的负荷似乎过大了。

因为我做的试验少不太清楚,请问这么大的window_size对性能的影响是否显著,如果我想将其更改至12是否导致我将无法使用预训练权重?

非常感谢您的工作,希望您能为我解答疑惑,谢谢!

@Da1symeeting1
Copy link

@DoctorDream 你好,请问可以分享预训练权重吗,readme中的链接失效了 1241955089@qq.com
非常感谢!

@DoctorDream
Copy link
Author

@DoctorDream 你好,请问可以分享预训练权重吗,readme中的链接失效了 1241955089@qq.com 非常感谢!

我用的是agent-transformer的权重,因为显存占用过大无法正常训练,现在已经删掉了,刚刚看了一下好像还是能正常下载的,你试一下吧

@Da1symeeting1
Copy link

打扰啦,detection部分还是网页不存在,正常下载的是分类的预训练。感谢回复

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants