Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What do the two Subtract operations mean? #19

Open
CacatuaAlan opened this issue Jun 4, 2024 · 3 comments
Open

What do the two Subtract operations mean? #19

CacatuaAlan opened this issue Jun 4, 2024 · 3 comments
Assignees
Labels
question Further information is requested

Comments

@CacatuaAlan
Copy link

Hi! Mentioned that your paper has two Subtract operations which confuses me. Can I just consider them as decouple?

@Lupin1998
Copy link
Member

Lupin1998 commented Jul 8, 2024

Hi, @CacatuaAlan, sorry for the late reply! You can regard the two subtract operations in the feature decomposition (FD) and the channel aggregation (CA) modules as removing the low-pass components. Then, we adaptively combine the low-pass and the rest components to increase the diversity (i.e., enhancing the modeling of middle-order interactions). Feel free to ask me when new problems occur and star our repo if it's helpful to your project!

@Lupin1998 Lupin1998 self-assigned this Jul 8, 2024
@Lupin1998 Lupin1998 added the question Further information is requested label Jul 8, 2024
@CacatuaAlan
Copy link
Author

Thank you for your patient explanation. In the two subtraction operations within the FD and CA modules, the subtracted components are different operations (GAP and conv1x1). Is there any special design or experimental comparison for this?

Hi, @CacatuaAlan, sorry for the late reply! You can regard the two subtract operations in the feature decomposition (FD) and the channel aggregation (CA) modules as removing the low-pass components. Then, we adaptively combine the low-pass and the rest components to increase the diversity (i.e., enhancing the modeling of middle-order interactions). Feel free to ask me when new problems occur and star our repo if it's helpful to your project!

Thank you for your patient explanation. In the two subtraction operations within the FD and CA modules, the subtracted components are different operations (GAP and conv1x1). Is there any special design or experimental comparison for this?

@Lupin1998
Copy link
Member

Well, from my perspective, I try to regard the two subtraction operations in the FD and CA modules with a similar intuition, i.e., removing the low-order features (so-called DC component or low-frequency component) and reweighting them. As for ablation experiments, I have found the two subtractions useful. If you are interested in whether we could achieve this intuition with a different operation, I think the addition might be supported by a similar operation, GRN (proposed in ConvNeXt.V2).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants