Evaluating propensity score balance measures in typical pharmacoepidemiological settings

Files

Access status: Embargo until 2050-01-01 , pages.pdf (83.39 KB)

Publication date

2012-08-01

Authors

Ali, M. SanniISNI 0000000419508349
Groenwold, Rolf H.H.ISNI 0000000394374611
Pestman, Wiebe R.
Belitser, S.ISNI 000000041942150X
Hoes, Arno W.
Roes, Kit C. B.
de Boer, A.ISNI 0000000389596105
Klungel, Olaf H.ISNI 0000000390199414

Editors

Advisors

Supervisors

Document Type

Abstract

License

Abstract

Background: Several propensity score (PS) balance measures have been compared in simulated data with normally distributed covariates. Comparisons in data with binary or mixed covariate distributions and rare outcomes, typical of pharmacoepidemiologic data sets, are scarce. Objectives: To compare balance measures in simulated data with various covariate distributions and rare outcomes. Methods: We performed Monte Carlo simulations to examine the relative ability of different balance measures to select PS models that yielded the least biased estimates. In different simulations, covariates were binary, normal or gamma distributed, considering sample sizes of n = 400, 1,600, and 3,000, incidence of outcomes of 10% and 25%, and strength of exposure-outcome association of OR = 1 and 2. Bias was estimated as the difference between the true marginal effect and the effect estimate obtained from a logistic regression model with PS as a covariate. The balance of covariates between treatment groups was assessed using the standardised difference (SD), Kolmogorov Smirnov (KS) distance, Lévy distance (Lévy) and overlapping coefficient (OVL). Pearson's correlation coefficients (r) between these balance measures and bias were calculated. Results: With large sample sizes, all balance measures were similarly correlated with bias irrespective of covariate distributions, strength of the effect, and prevalence of outcome (e.g., when all covariates were binary, OR = 2.0, n = 3,000 and incidence of outcome = 25%: correlations were 0.76, 0.79, 0.79, and -0.79 for SD, KSD, Lévy and OVL, respectively). These correlations decreased with smaller sample sizes (e.g., for n = 400: 0.51, 0.20, 0.17, and -0.43, for SD, KSD, Lévy distance and OVL, respectively). Incidence of the outcome and strength of the exposure-outcome relation didn't have much impact. For sample sizes smaller than 800, SD showed a better correlation with bias than the other balance measures. Conclusions: The SD or KS performed best across different simulation scenarios and are recommended for reporting the amount of balance reached and selecting the final PS model.

Keywords

pharmacoepidemiology, risk management, propensity score, sample size, model, simulation, exposure, logistic regression analysis, Monte Carlo method, prevalence, correlation coefficient

Citation

Ali, M S, Groenwold, R H H, Pestman, W R, Belitser, S, Hoes, A W, Roes, K C B, De Boer, A & Klungel, O H 2012, 'Evaluating propensity score balance measures in typical pharmacoepidemiological settings', Pharmacoepidemiology and Drug Safety, vol. 21, no. S3, 72, pp. 36-37. https://doi.org/10.1002/pds.3324