Real-time imputation of missing predictor values improved the application of prediction models in daily practice

Publication date

2021-06

Authors

Nijman, Steven W.J.
Groenhof, T. Katrien J.
Hoogland, Jeroen
Bots, Michiel LORCID 0000-0003-2871-9810ISNI 0000000391893395
Brandjes, Menno
Jacobs, John J.L.
Asselbergs, Folkert WORCID 0000-0002-1692-8669ISNI 0000000391548591
Moons, Karel G MISNI 0000000390720943
Debray, ThomasORCID 0000-0002-1790-2719ISNI 0000000390283878

Editors

Advisors

Supervisors

Document Type

Article

Collections

Open Access logo

License

cc_by

Abstract

Objectives: In clinical practice, many prediction models cannot be used when predictor values are missing. We, therefore, propose and evaluate methods for real-time imputation. Study Design and Setting: We describe (i) mean imputation (where missing values are replaced by the sample mean), (ii) joint modeling imputation (JMI, where we use a multivariate normal approximation to generate patient-specific imputations), and (iii) conditional modeling imputation (CMI, where a multivariable imputation model is derived for each predictor from a population). We compared these methods in a case study evaluating the root mean squared error (RMSE) and coverage of the 95% confidence intervals (i.e., the proportion of confidence intervals that contain the true predictor value) of imputed predictor values. Results: –RMSE was lowest when adopting JMI or CMI, although imputation of individual predictors did not always lead to substantial improvements as compared to mean imputation. JMI and CMI appeared particularly useful when the values of multiple predictors of the model were missing. Coverage reached the nominal level (i.e., 95%) for both CMI and JMI. Conclusion: Multiple imputations using either CMI or JMI is recommended when dealing with missing predictor values in real-time settings.

Keywords

Computerized decision support system, Electronic health records, Missing data, Multiple imputations, Prediction, Real-time imputation

Citation

Nijman, S W J, Groenhof, T K J, Hoogland, J, Bots, M L, Brandjes, M, Jacobs, J J L, Asselbergs, F W, Moons, K G M & Debray, T P A 2021, 'Real-time imputation of missing predictor values improved the application of prediction models in daily practice', Journal of Clinical Epidemiology, vol. 134, pp. 22-34. https://doi.org/10.1016/j.jclinepi.2021.01.003