Pooling rules for creating synthetic data with mice #436

thomvolker · 2021-10-07T07:48:18Z

As discussed with @gerkovink, the pool.syn() and pool.scalar.syn() pooling functions apply the rules developed by Reiter (2003) to combine analyses on multiply imputed synthetic datasets. Note that these rules only apply to synthetic versions of completely observed datasets. If the data to synthesize contains missing values, different pooling rules apply that require a two-step approach to imputation (first impute missingness, than synthesize all m imputed datasets). Developing a one-step approach would be something for future research.

gerkovink · 2021-10-07T08:54:22Z

@stefvanbuuren Can we prioritise this PR?

stefvanbuuren · 2021-10-07T09:20:51Z

Thanks for the PR.

There's a lot of duplicated code. I will look into the possibility to integrate this functionality as an extra argument to the regular pool() function.

thomvolker · 2021-10-07T09:29:40Z

I completely agree that this PR is mostly duplicate code. The reason to still write an additional function was to protect uninformed users against using wrong pooling rules. Still, an additional argument is probably more elegant.

stefvanbuuren · 2021-10-08T15:45:40Z

mice 3.13.15 adds a new rule argument to pool() and pool.scalar() and redefines pool.syn() and pool.scalar.syn() as wrappers. This removes almost all duplication and is extendable as other pooling rule come along.

Use pool.syn() and pool.scalar.syn() in code for synthetic data, and reserve pool() and pool.scalar() for missing data uses.

gerkovink · 2021-10-09T04:32:35Z

Nice indeed to separate the workflow between pool() and pool.syn()

thomvolker added 7 commits September 25, 2021 17:46

Synthetic data pool function

048a54a

Create synthetic pool function mice

c4d0b3a

Mice pool functions synthetic data

7b0823d

Merge branch 'amices:master' into master

643dc36

Reference to pool.scalar.syn

d79d3da

Merge branch 'master' of https://github.com/thomvolker/mice

a2e8d86

Change to complete data example, to not confuse users.

c8ebb9f

gerkovink assigned stefvanbuuren Oct 7, 2021

gerkovink added the enhancement label Oct 7, 2021

gerkovink requested a review from stefvanbuuren October 7, 2021 08:53

stefvanbuuren merged commit c8ebb9f into amices:master Oct 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pooling rules for creating synthetic data with mice #436

Pooling rules for creating synthetic data with mice #436

thomvolker commented Oct 7, 2021 •

edited by gerkovink

Loading

gerkovink commented Oct 7, 2021

stefvanbuuren commented Oct 7, 2021

thomvolker commented Oct 7, 2021

stefvanbuuren commented Oct 8, 2021

gerkovink commented Oct 9, 2021

Pooling rules for creating synthetic data with mice #436

Pooling rules for creating synthetic data with mice #436

Conversation

thomvolker commented Oct 7, 2021 • edited by gerkovink Loading

gerkovink commented Oct 7, 2021

stefvanbuuren commented Oct 7, 2021

thomvolker commented Oct 7, 2021

stefvanbuuren commented Oct 8, 2021

gerkovink commented Oct 9, 2021

thomvolker commented Oct 7, 2021 •

edited by gerkovink

Loading