-
Notifications
You must be signed in to change notification settings - Fork 136
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Auxiliary losses implementations #24
Comments
Hey @ggonzalezp, I'm interested in the auxiliary losses, too. According to Issue #8, it didn't work so it's not in the codebase. But that's really confusing, because the paper describes the losses in detail in Section 3.3. @RexYing could you elaborate on this? |
Hi, Guadalupe is right that it was cross entropy loss for link pred loss. The reason is that since the assignment prediction contains values between [0,1], cross entropy is more effective compared to l2 in Frobenius norm. I also added the entropy regularization. It makes the assignment matrix more discrete and improves interpretability, but does not necessarily improve classification. |
Thanks for the clarification :) |
Hi Ying,
|
@RexYing |
Hi Rex,
I have a couple of questions regarding the implementation of the auxiliary losses.
'at each layer l, we minimize
denotes the Frobenius norm.'
However, in the code, what I find is:
self.link_loss = -adj * torch.log(pred_adj+eps) - (1-adj) * torch.log(1-pred_adj+eps)
which is the binary cross-entropy on pre_adj
Could you please explain why/how this is equivalent to the mathematical formulation? Also, I believe that the pre_adj used is created with the final assignation tensor, isn't it?
In theory you are also regularizing the entropy of the cluster assignment by minimizing
LE = 1/n Sum(H(Si))
but I can't see this anywhere in the code? Could you point me to this please?
A third comment, not related to the losses is that in the experiments section of the paper you say that you use GraphSAGE as a base for the model, but as far as I could see in the code, it is using a GConv. Could you also enlighten me a little bit on this please?
Thanks!
Guadalupe
The text was updated successfully, but these errors were encountered: