Graphical and numerical diagnostic tools to assess multiple imputation models by posterior predictive checking

Publication date

2022-08-27

Authors

Cai, MingyangISNI 0000000517912281
Buuren, Stef vanORCID 0000-0003-1098-2119ISNI 0000000032712898
Vink, GerkoORCID 0000-0001-9767-1924ISNI 0000000394871968

Editors

Advisors

Supervisors

Document Type

/dk/atira/pure/researchoutput/researchoutputtypes/workingpaper/preprint
Open Access logo

License

cc_by

Abstract

Missing data are often dealt with multiple imputation. A crucial part of the multiple imputation process is selecting sensible models to generate plausible values for incomplete data. A method based on posterior predictive checking is proposed to diagnose imputation models based on posterior predictive checking. To assess the congeniality of imputation models, the proposed diagnostic method compares the observed data with their replicates generated under corresponding posterior predictive distributions. If the imputation model is congenial with the substantive model, the observed data are expected to be located in the centre of corresponding predictive posterior distributions. Simulation and application are designed to investigate the proposed diagnostic method for parametric and semi-parametric imputation approaches, continuous and discrete incomplete variables, univariate and multivariate missingness patterns. The results show the validity of the proposed diagnostic method.

Keywords

stat.CO

Citation

Cai, M, Buuren, S V & Vink, G 2022 'Graphical and numerical diagnostic tools to assess multiple imputation models by posterior predictive checking' arXiv, pp. 1-48. https://doi.org/10.48550/arXiv.2208.12929