Population median imputation was noninferior to complex approaches for imputing missing values in cardiovascular prediction models in clinical practice

Publication date

2022-05

Authors

Berkelmans, Gijs H K
Read, S H
Gudbjörnsdottir, S
Wild, S H
Franzen, S
van der Graaf, YolandaISNI 0000000388026709
Eliasson, B
Visseren, Frank L JISNI 0000000389493675
Paynter, N P
Dorresteijn, Jannick AnORCID 0000-0002-0190-8526ISNI 0000000419437536

Editors

Advisors

Supervisors

Document Type

Article

Collections

Open Access logo

License

cc_by

Abstract

Objectives: To compare the validity and robustness of five methods for handling missing characteristics when using cardiovascular disease risk prediction models for individual patients in a real-world clinical setting. Study design and setting: The performance of the missing data methods was assessed using data from the Swedish National Diabetes Registry (n = 419,533) with external validation using the Scottish Care Information ˗ diabetes database (n = 226,953). Five methods for handling missing data were compared. Two methods using submodels for each combination of available data, two imputation methods: conditional imputation and median imputation, and one alternative modeling method, called the naïve approach, based on hazard ratios and populations statistics of known risk factors only. The validity was compared using calibration plots and c-statistics. Results: C-statistics were similar across methods in both development and validation data sets, that is, 0.82 (95% CI 0.82–0.83) in the Swedish National Diabetes Registry and 0.74 (95% CI 0.74–0.75) in Scottish Care Information-diabetes database. Differences were only observed after random introduction of missing data in the most important predictor variable (i.e., age). Conclusion: Validity and robustness of median imputation was not dissimilar to more complex methods for handling missing values, provided that the most important predictor variables, such as age, are not missing.

Keywords

Cardiovascular risk prediction, clinical practise, Epidemiology, Missing patient characteristics, Real-world setting, Epidemiology, Journal Article

Citation

Berkelmans, G F N, Read, S H, Gudbjörnsdottir, S, Wild, S H, Franzen, S, van der Graaf, Y, Eliasson, B, Visseren, F L J, Paynter, N P & Dorresteijn, J A N 2022, 'Population median imputation was noninferior to complex approaches for imputing missing values in cardiovascular prediction models in clinical practice', Journal of Clinical Epidemiology, vol. 145, pp. 70-80. https://doi.org/10.1016/j.jclinepi.2022.01.011