Ganea model parameters #18

lej-la · 2019-07-03T14:31:41Z

I was trying to run your model with "rel-norm" type and 1 relation.
As you suggest in the paper it should be equivalent to the Ganea & Hofmann (2017) model, but the result I got on AIDA-B dataset was not the same as they reported.

Their reported number was 92.22 micro F1, while I got only 83.71 micro F1 on average (highest was 86.24).

Did you actually manage to replicate their results? Am I missing some parameter settings?

Thanks.

lephong · 2019-07-03T15:53:32Z

I wasn't able to replicate their paper in the beginning. But then reading their source code https://github.com/dalab/deep-ed was super helpful.

lej-la · 2019-07-03T16:19:57Z

Thank you, I've read their code and also tried to reimplement it (https://github.com/lej-la/deep-ed-pytorch), but I eventually gave up.

I thought your model is a generalized version of their model and that it is able to produce the same results using only 1 relation (the number of relations = 1).

So were you, eventually, able to replicate their results with this code, please?

lephong · 2019-07-03T16:24:38Z

You are right the if the number of relation is set to 1, we will have their model.

Yes, I successfully replicated their results (even got a bit higher scores, but not significant).

lej-la · 2019-07-04T07:22:14Z

Please, just for clarification. My assumption is that if I run your code like this with the following parameters:

python -u -m nel.main --mode train --n_rels 1 --mulrel_type rel-norm --model_path model

I should be able to get a Ganea-like model with the performance ~92 micro F1 on AIDA-B.
Is that correct?

Because I've tried that several times, but the results were only 83.71 micro F1 on average.

Thanks

lephong · 2019-07-04T07:35:16Z

Unfortunately, I don't have a facility to run the code.

When you ran the command line in the README

python -u -m nel.main --mode train --n_rels 3 --mulrel_type ment-norm --model_path model

did you get the reported number?

For Ganea-like, could you try:

python -u -m nel.main --mode train --n_rels 1 --mulrel_type rel-norm --model_path model

lephong · 2019-07-04T07:36:38Z

The reason for trying "rel-norm" instead of "ment-norm" is that for ment-norm the model uses mention padding, which Genea model doesn't have.

lej-la · 2019-07-04T07:39:02Z

The results of your best model that I was able to get (by running the first command) were on average 91.62 micro F1.

lephong · 2019-07-04T07:47:24Z

Hmm, could you please send me the log files (or what you get when running the cmd)?

lej-la · 2019-07-04T07:51:53Z

I don't have them, but I'll re-run the training and send it to you. Can I use your email from your last paper (https://arxiv.org/pdf/1906.01250.pdf)?

lephong · 2019-07-04T07:54:28Z

Yes, please send to my gmail address (I no longer use UoEdin email address). Thanks!

lej-la · 2019-07-04T07:55:55Z

Thank you :)

martinj96 · 2019-07-24T14:12:16Z

I have the same issue of not being able to achieve the claimed 93.07 score. Did you manage to find the issue?

lephong · 2019-07-24T14:15:49Z

Yes, I found the issue, but we had a private discussion which is not shown here. Long story short, lej-la commented out an important line.

Could you show your log file?

martinj96 · 2019-07-24T14:19:41Z

Thanks for the fast response. I ran it once, got her result and assumed there's some issue. Let me run it at least five times and get back to you with the log file or a message that I've reproduced the result.

lej-la · 2019-07-24T14:22:18Z

Hey, I got a bit confused by the part in section 3.2 about rel-norm. The true parameters to replicate ganea global model are actually using ment-norm with K=1. In that way, the normalization factor becomes the same as in equation 3. Using rel-norm, the normalization becomes just 1, instead of 1/(n-1). I got as close as 91.6 micro F1 on AIDA-B.

lephong · 2019-07-24T14:24:39Z

okay, so they differ at the normalization factor. Thanks for pointing out

lej-la · 2019-07-24T14:28:35Z

But then, actually the ment-norm model seems to have the same performance with K=1 and K=3.

martinj96 · 2019-07-24T15:35:42Z

All is good, I managed to reproduce the results. Very simple steps, with no issues; good job @lephong. Maybe you could also close this thread as it seems to be resolved. Bests.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ganea model parameters #18

Ganea model parameters #18

lej-la commented Jul 3, 2019

lephong commented Jul 3, 2019

lej-la commented Jul 3, 2019

lephong commented Jul 3, 2019

lej-la commented Jul 4, 2019

lephong commented Jul 4, 2019

lephong commented Jul 4, 2019

lej-la commented Jul 4, 2019

lephong commented Jul 4, 2019

lej-la commented Jul 4, 2019

lephong commented Jul 4, 2019

lej-la commented Jul 4, 2019

martinj96 commented Jul 24, 2019

lephong commented Jul 24, 2019

martinj96 commented Jul 24, 2019 •

edited

Loading

lej-la commented Jul 24, 2019

lephong commented Jul 24, 2019

lej-la commented Jul 24, 2019

martinj96 commented Jul 24, 2019

Ganea model parameters #18

Ganea model parameters #18

Comments

lej-la commented Jul 3, 2019

lephong commented Jul 3, 2019

lej-la commented Jul 3, 2019

lephong commented Jul 3, 2019

lej-la commented Jul 4, 2019

lephong commented Jul 4, 2019

lephong commented Jul 4, 2019

lej-la commented Jul 4, 2019

lephong commented Jul 4, 2019

lej-la commented Jul 4, 2019

lephong commented Jul 4, 2019

lej-la commented Jul 4, 2019

martinj96 commented Jul 24, 2019

lephong commented Jul 24, 2019

martinj96 commented Jul 24, 2019 • edited Loading

lej-la commented Jul 24, 2019

lephong commented Jul 24, 2019

lej-la commented Jul 24, 2019

martinj96 commented Jul 24, 2019

martinj96 commented Jul 24, 2019 •

edited

Loading