Logreg vs pmm for imputing dichotomous variables #383
angelrodriguez2020
started this conversation in
Missing data methodology
Replies: 2 comments 3 replies
-
Methods |
Beta Was this translation helpful? Give feedback.
3 replies
-
See https://journals.sagepub.com/doi/full/10.1177/09622802231198795 for a comparison. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Dear all,
I'm a little bit confused about conflicting recommendations on this issue.
The default method to impute a dichotomous variable in mice is logreg, what makes conceptual sense. But I've found the following advice: 'For sparse categorical data, it may be better to use method pmm instead of logreg, polr or polyreg"
(https://stefvanbuuren.name/fimd/sec-modelform.html)'. I would consider a dichotomous variable a very sparse categorical variable. But in the mice paper of the Journal of Statistical Software (p42) it is said: 'For variable with many categories,
we therefore recommend pmm'. So apparently pmm is preferred over other methods in categorical variables with very few or many categories.
Is this advice based on efficiency of the pmm method over logreg and polyreg, or are there more substantive issues there?
Many thanks.
Beta Was this translation helpful? Give feedback.
All reactions