How to impute a categorical variable with MICE but prevent it from taking some values? #196

alexiasampri · 2019-10-07T10:31:33Z

I have a categorical variable, var1 , that can take on values of W, B, A, M, N or P. There are some NAs that I want to impute using the mice package in R, but I know that the missing values cannot be "W" or "B" because those people said that they do not belong in that category. I want to impute var1 but force mice to only choose from everything except B or W .

Here is sample code for you to use:

df=data.frame(age=c(24,37,58,65,70,84, 56, 36, 48,23,15), 
    var1 =c("B","W", NA, "A",NA, "P","N", NA, "M",NA, "B"), 
    var1categ=c(0,0, 1, 1, 1,1,1,1,1,1, 0),
    ht = c(156, 169, 180, 175, 168, 165, 171, 158, 160, 175, 160))

imp=mice(df, remove_collinear = FALSE)

Thank you for your help and please let me know if you need more information.

The text was updated successfully, but these errors were encountered:

stefvanbuuren · 2019-10-08T13:06:19Z

An easy way is perhaps to impute the subset of df without categories "B" and "W".

alexiasampri · 2019-10-08T13:37:04Z

Thanks for getting back to me. The problem with that is that I don't want to follow this approach because I lose power. The actual dataset is much bigger. df mentioned above is just an example. Preferably I want to do it with either post processing or create a function in mice. But I don't really know how to it. I also know that for integers I can use squeeze. Is there anything similar for categorical variables?

stefvanbuuren · 2019-10-08T13:44:13Z

Yes, I understand.

Another way is to start with the mice.impute.polyreg() function. At some point, you see the line post <- predict(fit, xy[wy, , drop = FALSE], type = "probs"), which contains the probabilities per categories. You can then nullify the probabilities of the categories that you want to exclude, and perhaps you need to restandardise so that they add up to 1. If all is well, the method will then only draw from the permitted categories.

Sorry, I don't have examples that implements this approach, but in principle it should work.

stefvanbuuren mentioned this issue Oct 8, 2019

remove collinearity false is not working #197

Closed

stefvanbuuren closed this as completed Dec 11, 2019

stefvanbuuren mentioned this issue Aug 8, 2023

Imputing categorical data by predictive mean matching #576

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to impute a categorical variable with MICE but prevent it from taking some values? #196

How to impute a categorical variable with MICE but prevent it from taking some values? #196

alexiasampri commented Oct 7, 2019

stefvanbuuren commented Oct 8, 2019

alexiasampri commented Oct 8, 2019

stefvanbuuren commented Oct 8, 2019

How to impute a categorical variable with MICE but prevent it from taking some values? #196

How to impute a categorical variable with MICE but prevent it from taking some values? #196

Comments

alexiasampri commented Oct 7, 2019

stefvanbuuren commented Oct 8, 2019

alexiasampri commented Oct 8, 2019

stefvanbuuren commented Oct 8, 2019