Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

对于每一个layer的第一个block,计算方式不是层级的,和论文中描述有差别 #73

Open
easy2star opened this issue May 6, 2023 · 3 comments

Comments

@easy2star
Copy link

easy2star commented May 6, 2023

第一个block计算方式

@gasvn
Copy link
Member

gasvn commented May 8, 2023

没有经过层级处理的block放在最开始(论文中)或者最后(代码实现)是等效的。x4只有在downsample的时候才需要pooling

@easy2star
Copy link
Author

easy2star commented May 8, 2023 via email

@mingjie-zheng
Copy link

@easy2star 你不看layer1的话,就明白了。layer2, layer3, layer4的第一个block里面,x1,x2,x3进行conv3x3的时候,stride=2(为了把size变小),也就是说 y_{i-1} 和 x_i 的size并不匹配,自然做不了concat。然后对于x4,为了保证最后y1, y2, y3, y4能concat,x4就只能做pooling来reduce size。
严格点的写法就是把layer1的block1独立出来,不跟着layer2-4那样写。比如改成:if i==0 or (self.stype=='stage' and self.stride==2)。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants