Propensity-based standardization to enhance the validation and interpretation of prediction model discrimination for a target population

Publication date

2023-08-30

Authors

de Jong, V. M.T.ORCID 0000-0001-9921-3468
Hoogland, Jeroen
Moons, Karel G.M.ISNI 0000000390720943
Riley, Richard D.
Nguyen, Tri Long
Debray, Thomas P AORCID 0000-0002-1790-2719ISNI 0000000390283878

Editors

Advisors

Supervisors

Document Type

Article

Collections

Open Access logo

License

cc_by_nc

Abstract

External validation of the discriminative ability of prediction models is of key importance. However, the interpretation of such evaluations is challenging, as the ability to discriminate depends on both the sample characteristics (ie, case-mix) and the generalizability of predictor coefficients, but most discrimination indices do not provide any insight into their respective contributions. To disentangle differences in discriminative ability across external validation samples due to a lack of model generalizability from differences in sample characteristics, we propose propensity-weighted measures of discrimination. These weighted metrics, which are derived from propensity scores for sample membership, are standardized for case-mix differences between the model development and validation samples, allowing for a fair comparison of discriminative ability in terms of model characteristics in a target population of interest. We illustrate our methods with the validation of eight prediction models for deep vein thrombosis in 12 external validation data sets and assess our methods in a simulation study. In the illustrative example, propensity score standardization reduced between-study heterogeneity of discrimination, indicating that between-study variability was partially attributable to case-mix. The simulation study showed that only flexible propensity-score methods (allowing for non-linear effects) produced unbiased estimates of model discrimination in the target population, and only when the positivity assumption was met. Propensity score-based standardization may facilitate the interpretation of (heterogeneity in) discriminative ability of a prediction model as observed across multiple studies, and may guide model updating strategies for a particular target population. Careful propensity score modeling with attention for non-linear relations is recommended.

Keywords

concordance, external validation, prediction model, propensity score, standardization, Epidemiology, Statistics and Probability, Journal Article

Citation

de Jong, V M T, Hoogland, J, Moons, K G M, Riley, R D, Nguyen, T L & Debray, T P A 2023, 'Propensity-based standardization to enhance the validation and interpretation of prediction model discrimination for a target population', Statistics in Medicine, vol. 42, no. 19, pp. 3508-3528. https://doi.org/10.1002/sim.9817