Real-time imputation of missing predictor values in clinical practice

Publication date

2021-03-01

Authors

Nijman, Steven W.J.
Hoogland, Jeroen
Groenhof, T. Katrien J.
Brandjes, Menno
Jacobs, John J.L.
Bots, Michiel L.ORCID 0000-0003-2871-9810ISNI 0000000391893395
Asselbergs, Folkert WORCID 0000-0002-1692-8669ISNI 0000000391548591
Moons, Karel G.M.ISNI 0000000390720943
Debray, Thomas P AORCID 0000-0002-1790-2719ISNI 0000000390283878

Editors

Advisors

Supervisors

Document Type

Article

Collections

Open Access logo

License

cc_by_nc

Abstract

Aims: Use of prediction models is widely recommended by clinical guidelines, but usually requires complete information on all predictors, which is not always available in daily practice. We aim to describe two methods for real-time handling of missing predictor values when using prediction models in practice. Methods and results: We compare the widely used method of mean imputation (M-imp) to a method that personalizes the imputations by taking advantage of the observed patient characteristics. These characteristics may include both prediction model variables and other characteristics (auxiliary variables). The method was implemented using imputation from a joint multivariate normal model of the patient characteristics (joint modelling imputation; JMI). Data from two different cardiovascular cohorts with cardiovascular predictors and outcome were used to evaluate the real-time imputation methods. We quantified the prediction model's overall performance [mean squared error (MSE) of linear predictor], discrimination (c-index), calibration (intercept and slope), and net benefit (decision curve analysis). When compared with mean imputation, JMI substantially improved the MSE (0.10 vs. 0.13), c-index (0.70 vs. 0.68), and calibration (calibration-in-the-large: 0.04 vs. 0.06; calibration slope: 1.01 vs. 0.92), especially when incorporating auxiliary variables. When the imputation method was based on an external cohort, calibration deteriorated, but discrimination remained similar. Conclusions: We recommend JMI with auxiliary variables for real-time imputation of missing values, and to update imputation models when implementing them in new settings or (sub)populations.

Keywords

Computerized decision support system, Electronic health records, Joint modelling imputation, Missing data, Prediction, Real-time imputation, Cardiology and Cardiovascular Medicine

Citation

Nijman, S W J, Hoogland, J, Groenhof, T K J, Brandjes, M, Jacobs, J J L, Bots, M L, Asselbergs, F W, Moons, K G M & Debray, T P A 2021, 'Real-time imputation of missing predictor values in clinical practice', European Heart Journal - Digital Health, vol. 2, no. 1, pp. 154-164. https://doi.org/10.1093/ehjdh/ztaa016