Systematic review finds "Spin" practices and poor reporting standards in studies on machine learning-based prediction models

Andaur Navarro, Constanza L.; Damen, Johanna A.A.; Takada, Toshihiko; Nijman, Steven W.J.; Dhiman, Paula; Ma, Jie; Collins, Gary S.; Bajpai, Ram; Riley, Richard D.; Moons, Karel G.M.; Hooft, Lotty

doi:https://doi.org/10.1016/j.jclinepi.2023.03.024

Systematic review finds "Spin" practices and poor reporting standards in studies on machine learning-based prediction models

Files

1_s2.0_S0895435623000756_main.pdf (347.94 KB)

Publication date

2023-06

Authors

Navarro, Constanza L Andaur

Damen, Johanna A A G

Takada, Toshihiko

Nijman, Steven W.J.

Dhiman, Paula

Ma, Jie

Collins, Gary S.

Bajpai, Ram

Riley, Richard D.

Moons, Karel G M

DOI

https://doi.org/10.1016/j.jclinepi.2023.03.024

Document Type

Article

Metadata

Show full item record

Collections

UMC Repository

License

cc_by

Abstract

Objectives: We evaluated the presence and frequency of spin practices and poor reporting standards in studies that developed and/or validated clinical prediction models using supervised machine learning techniques. Study Design and Setting: We systematically searched PubMed from 01/2018 to 12/2019 to identify diagnostic and prognostic prediction model studies using supervised machine learning. No restrictions were placed on data source, outcome, or clinical specialty. Results: We included 152 studies: 38% reported diagnostic models and 62% prognostic models. When reported, discrimination was described without precision estimates in 53/71 abstracts (74.6% [95% CI 63.4–83.3]) and 53/81 main texts (65.4% [95% CI 54.6–74.9]). Of the 21 abstracts that recommended the model to be used in daily practice, 20 (95.2% [95% CI 77.3–99.8]) lacked any external validation of the developed models. Likewise, 74/133 (55.6% [95% CI 47.2–63.8]) studies made recommendations for clinical use in their main text without any external validation. Reporting guidelines were cited in 13/152 (8.6% [95% CI 5.1–14.1]) studies. Conclusion: Spin practices and poor reporting standards are also present in studies on prediction models using machine learning techniques. A tailored framework for the identification of spin will enhance the sound reporting of prediction model studies.

Keywords

Development, Diagnosis, Misinterpretation, Overextrapolation, Overinterpretation, Prognosis, Spin, Validation, Epidemiology, Review, Journal Article

Citation

Andaur Navarro, C L, Damen, J A A, Takada, T, Nijman, S W J, Dhiman, P, Ma, J, Collins, G S, Bajpai, R, Riley, R D, Moons, K G M & Hooft, L 2023, 'Systematic review finds "Spin" practices and poor reporting standards in studies on machine learning-based prediction models', Journal of Clinical Epidemiology, vol. 158, pp. 99-110. https://doi.org/10.1016/j.jclinepi.2023.03.024

URI

https://dspace.library.uu.nl/handle/1874/448950

Systematic review finds "Spin" practices and poor reporting standards in studies on machine learning-based prediction models

Files

Publication date

Authors

Editors

Advisors

Supervisors

DOI

Document Type

Metadata

Collections

License

Abstract

Keywords

Citation

URI