Systematic review finds "Spin" practices and poor reporting standards in studies on machine learning-based prediction models

Publication date

2023-06

Authors

Navarro, Constanza L AndaurORCID 0000-0002-7745-2887
Damen, Johanna A A GORCID 0000-0001-7401-4593
Takada, Toshihiko
Nijman, Steven W.J.
Dhiman, Paula
Ma, Jie
Collins, Gary S.
Bajpai, Ram
Riley, Richard D.
Moons, Karel G MISNI 0000000390720943

Editors

Advisors

Supervisors

Document Type

Article

Collections

Open Access logo

License

cc_by

Abstract

Objectives: We evaluated the presence and frequency of spin practices and poor reporting standards in studies that developed and/or validated clinical prediction models using supervised machine learning techniques. Study Design and Setting: We systematically searched PubMed from 01/2018 to 12/2019 to identify diagnostic and prognostic prediction model studies using supervised machine learning. No restrictions were placed on data source, outcome, or clinical specialty. Results: We included 152 studies: 38% reported diagnostic models and 62% prognostic models. When reported, discrimination was described without precision estimates in 53/71 abstracts (74.6% [95% CI 63.4–83.3]) and 53/81 main texts (65.4% [95% CI 54.6–74.9]). Of the 21 abstracts that recommended the model to be used in daily practice, 20 (95.2% [95% CI 77.3–99.8]) lacked any external validation of the developed models. Likewise, 74/133 (55.6% [95% CI 47.2–63.8]) studies made recommendations for clinical use in their main text without any external validation. Reporting guidelines were cited in 13/152 (8.6% [95% CI 5.1–14.1]) studies. Conclusion: Spin practices and poor reporting standards are also present in studies on prediction models using machine learning techniques. A tailored framework for the identification of spin will enhance the sound reporting of prediction model studies.

Keywords

Development, Diagnosis, Misinterpretation, Overextrapolation, Overinterpretation, Prognosis, Spin, Validation, Epidemiology, Review, Journal Article

Citation

Andaur Navarro, C L, Damen, J A A, Takada, T, Nijman, S W J, Dhiman, P, Ma, J, Collins, G S, Bajpai, R, Riley, R D, Moons, K G M & Hooft, L 2023, 'Systematic review finds "Spin" practices and poor reporting standards in studies on machine learning-based prediction models', Journal of Clinical Epidemiology, vol. 158, pp. 99-110. https://doi.org/10.1016/j.jclinepi.2023.03.024