Reporting to Improve Reproducibility and Facilitate Validity Assessment for Healthcare Database Studies V1.0

Publication date

2017-09-01

Authors

Wang, Shirley V.
Schneeweiss, Sebastian
Berger, Marc L.
Brown, Jeffrey
de Vries, FrankORCID 0000-0003-3837-8319ISNI 0000000393640594
Douglas, Ian
Gagne, Joshua J.
Gini, Rosa
Klungel, Olaf H.ISNI 0000000390199414
Mullins, C. Daniel

Editors

Advisors

Supervisors

Document Type

Article
Open Access logo

License

Abstract

Purpose: Defining a study population and creating an analytic dataset from longitudinal healthcare databases involves many decisions. Our objective was to catalogue scientific decisions underpinning study execution that should be reported to facilitate replication and enable assessment of validity of studies conducted in large healthcare databases. Methods: We reviewed key investigator decisions required to operate a sample of macros and software tools designed to create and analyze analytic cohorts from longitudinal streams of healthcare data. A panel of academic, regulatory, and industry experts in healthcare database analytics discussed and added to this list. Conclusion: Evidence generated from large healthcare encounter and reimbursement databases is increasingly being sought by decision-makers. Varied terminology is used around the world for the same concepts. Agreeing on terminology and which parameters from a large catalogue are the most essential to report for replicable research would improve transparency and facilitate assessment of validity. At a minimum, reporting for a database study should provide clarity regarding operational definitions for key temporal anchors and their relation to each other when creating the analytic dataset, accompanied by an attrition table and a design diagram. A substantial improvement in reproducibility, rigor and confidence in real world evidence generated from healthcare databases could be achieved with greater transparency about operational study parameters used to create analytic datasets from longitudinal healthcare databases.

Keywords

healthcare databases, longitudinal data, methods, pharmacoepidemiology, replication, reproducibility, Transparency, Epidemiology, Pharmacology (medical)

Citation

Wang, S V, Schneeweiss, S, Berger, M L, Brown, J, de Vries, F, Douglas, I, Gagne, J J, Gini, R, Klungel, O, Mullins, C D, Nguyen, M D, Rassen, J A, Smeeth, L, Sturkenboom, M C J M & on behalf of the joint ISPE‐ISPOR Special Task Force on Real World Evidence in Health Care Decision Making 2017, 'Reporting to Improve Reproducibility and Facilitate Validity Assessment for Healthcare Database Studies V1.0', Pharmacoepidemiology and Drug Safety, vol. 26, no. 9, pp. 1018-1032. https://doi.org/10.1002/pds.4295