The value of statistical tools to detect data fabrication

Publication date

2016

Authors

Hartgerink, Chris
Wicherts, Jelte
van Assen, MarcelISNI 0000000377508681

Editors

Advisors

Supervisors

Document Type

Article
Open Access logo

License

Abstract

We aim to investigate how statistical tools can help detect potential data fabrication in the social- and medical sciences. In this proposal we outline three projects to assess the value of such statistical tools to detect potential data fabrication and make the first steps in order to apply them automatically to detect data anomalies, potentially due to data fabrication. In Project 1, we examine the performance of statistical methods to detect data fabrication in a mixture of genuine and fabricated data sets, where the fabricated data sets are generated by actual researchers who participate in our study. We also interview these researchers in order to investigate, in Project 2, different data fabrication characteristics and whether data generated with certain characteristics are better detected with current statistical tools than others. In Project 3 we use software to semi-automatically screen research articles to detect data anomalies that are potentially due to fabrication, and develop and test new software forming the basis for automated screening of research articles for data anomalies, potentially due to data fabrication, in the future.

Keywords

data fabrication, statistics, scientific misconduct, integrity

Citation

Hartgerink, C, Wicherts, J & Van Assen, M 2016, 'The value of statistical tools to detect data fabrication', Research Ideas and Outcomes, vol. 2, e8860. https://doi.org/10.3897/rio.2.e8860