Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Regarding visualization and router #3

Open
yinangit opened this issue Nov 8, 2024 · 0 comments
Open

Regarding visualization and router #3

yinangit opened this issue Nov 8, 2024 · 0 comments

Comments

@yinangit
Copy link

yinangit commented Nov 8, 2024

Great work !

  1. How were the visualizations of Figures 4 and 5 made? Taking Figure 4 as an example, how is the missing modality bank with a shape of [$2^{|M|}-1$, num_modality, num_patch, hidden_dim] used to visualize cosine similarity?
  2. Is the difference between G-Router and S-Router only based on whether the proposed loss $L_{ce}$ is used or not? Does G-Router only use $L_{balance}$, while S-Router uses $L_{ce}$ and $L_{balance}$? And their structures are the same, just using different loss functions during different training periods?

Looking forward to your answer !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant