Fix `lrtest` for model families with dispersion #261

nalimilan · 2022-08-02T13:29:27Z

lrtest relied on the deviance rather than the log-likelihood, which is not correct for model families where a dispersion parameter needs to be taken into account. Scaling the deviance would be more efficient than computing the log-likelihood via loglikelihood, but there is currently no generic API for this and this may not work for non-GLM models.
We could imagine defining a likelihoodratio(m1, m2) = loglikelihood(m1) - loglikelihood(m2) method that packages could override for performance, but this may not be worth it.

Also make the check that more complex models have a strictly better fit than simpler nested ones less strict. The more complex model may have the same deviance, and due to approximations it may even have a slightly higher deviance.

Fixes JuliaStats/GLM.jl#490, #260.

`lrtest` relied on the deviance rather than the log-likelihood, which is not correct for model families where a dispersion parameter needs to be taken into account. Scaling the deviance would be more efficient than computing the log-likelihood, but there is currently no generic API for this and this may not work for non-GLM models, so simply call `loglikelihood`. We could imagine defining a `likelihoodratio(m1, m2) = loglikelihood(m1) - loglikelihood(m2)` method that packages could override for performance, but this may not be worth it. Also make the check that more complex models have a strictly better fit than simpler nested ones less strict. The more complex model may have the same deviance, and due to approximations it may even have a slightly higher deviance.

src/lrtest.jl

nalimilan · 2022-08-05T16:26:55Z

@palday I've noticed that MixedModels.jl uses the deviance for likelihoodratiotest, except for linear models where -2loglikelihood is used. AFAICT this doesn't give correct results for GLMs with a dispersion parameter, such as Gamma, like for lrtest, right? Do you also plan to switch to using loglikelihood?

ararslan · 2022-08-05T17:08:02Z

src/lrtest.jl

+    for i in 2:length(ll)
+        if ((forward && ll[i-1] > ll[i]) ||
+            (!forward && ll[i-1] < ll[i])) &&
+            ll[i-1] ≉ ll[i]


This was actually part of my earlier comment but got lost because the remainder of the comment was beefy. Should we allow the user to provide a tolerance for this check? We already have an atol argument used for checking whether the models are nested, perhaps it would make sense to reuse that here?

Yeah why not. I've pushed a commit to do that. BTW I discovered that the condition was backwards!

palday · 2022-08-05T18:09:01Z

@nalimilan we should change (and will 😉), although currently we don't support GLMMs with a dispersion parameter:

https://github.com/JuliaStats/MixedModels.jl/blob/ea9b86eb816f4047e59aaa1a3d78a390c268ee3f/src/generalizedlinearmixedmodel.jl#L373-L377

and so we don't have to worry about the comparison to the GLM deviance. (The problem here with deviance vs. loglikelihood is actually deeply intertwined with the problem of fitting GLMMs with a dispersion parameter.)

For LMM, it doesn't matter since the "deviance" is indeed just the objective, which is -2 loglikelihood. (In practice, we can actually compute -2 loglikelihood directly and then use that to compute the loglikelihood.)

nalimilan · 2022-08-05T21:19:24Z

Can you make a release if you're OK with the PR? I won't be on my computer for the next two weeks.

nalimilan mentioned this pull request Aug 2, 2022

lrtest gives incorrect results JuliaStats/GLM.jl#490

Closed

nalimilan requested review from ararslan and palday August 2, 2022 13:33

ararslan reviewed Aug 2, 2022

View reviewed changes

src/lrtest.jl Outdated Show resolved Hide resolved

palday reviewed Aug 2, 2022

View reviewed changes

src/lrtest.jl Outdated Show resolved Hide resolved

palday reviewed Aug 2, 2022

View reviewed changes

src/lrtest.jl Outdated Show resolved Hide resolved

palday approved these changes Aug 2, 2022

View reviewed changes

Fixes

ed6f37b

ararslan reviewed Aug 5, 2022

View reviewed changes

palday mentioned this pull request Aug 5, 2022

Always use -2 loglikelihood instead of deviance for LRT computations JuliaStats/MixedModels.jl#633

Open

Pass atol

655f3ec

ararslan approved these changes Aug 5, 2022

View reviewed changes

palday approved these changes Aug 5, 2022

View reviewed changes

ararslan merged commit 24a4e47 into master Aug 5, 2022

ararslan deleted the nl/lrtest branch August 5, 2022 22:10

kleinschmidt mentioned this pull request Jan 23, 2023

lrtest incorrect #260

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `lrtest` for model families with dispersion #261

Fix `lrtest` for model families with dispersion #261

nalimilan commented Aug 2, 2022 •

edited

Loading

nalimilan commented Aug 5, 2022 •

edited

Loading

ararslan Aug 5, 2022

nalimilan Aug 5, 2022

palday commented Aug 5, 2022

nalimilan commented Aug 5, 2022

Fix lrtest for model families with dispersion #261

Fix lrtest for model families with dispersion #261

Conversation

nalimilan commented Aug 2, 2022 • edited Loading

nalimilan commented Aug 5, 2022 • edited Loading

ararslan Aug 5, 2022

Choose a reason for hiding this comment

nalimilan Aug 5, 2022

Choose a reason for hiding this comment

palday commented Aug 5, 2022

nalimilan commented Aug 5, 2022

Fix `lrtest` for model families with dispersion #261

Fix `lrtest` for model families with dispersion #261

nalimilan commented Aug 2, 2022 •

edited

Loading

nalimilan commented Aug 5, 2022 •

edited

Loading