AI as an intervention: improving clinical outcomes relies on a causal approach to AI development and validation

Publication date

2025-03-01

Authors

Joshi, Shalmali
Urteaga, Iñigo
van Amsterdam, Wouter A.C.ORCID 0000-0002-3181-0810
Hripcsak, George
Elias, Pierre
Recht, Benjamin
Elhadad, Noémie
Fackler, James
Sendak, Mark P.
Wiens, Jenna

Editors

Advisors

Supervisors

Document Type

Article

Collections

Open Access logo

License

cc_by

Abstract

The primary practice of healthcare artificial intelligence (AI) starts with model development, often using state-of-the-art AI, retrospectively evaluated using metrics lifted from the AI literature like AUROC and DICE score. However, good performance on these metrics may not translate to improved clinical outcomes. Instead, we argue for a better development pipeline constructed by working backward from the end goal of positively impacting clinically relevant outcomes using AI, leading to considerations of causality in model development and validation, and subsequently a better development pipeline. Healthcare AI should be "actionable,"and the change in actions induced by AI should improve outcomes. Quantifying the effect of changes in actions on outcomes is causal inference. The development, evaluation, and validation of healthcare AI should therefore account for the causal effect of intervening with the AI on clinically relevant outcomes. Using a causal lens, we make recommendations for key stakeholders at various stages of the healthcare AI pipeline. Our recommendations aim to increase the positive impact of AI on clinical outcomes.

Keywords

artificial intelligence, causal inference, healthcare, Health Informatics

Citation

Joshi, S, Urteaga, I, Van Amsterdam, W A C, Hripcsak, G, Elias, P, Recht, B, Elhadad, N, Fackler, J, Sendak, M P, Wiens, J, Deshpande, K, Wald, Y, Fiterau, M, Lipton, Z, Malinsky, D, Nayan, M, Namkoong, H, Park, S, Vogt, J E & Ranganath, R 2025, 'AI as an intervention : improving clinical outcomes relies on a causal approach to AI development and validation', Journal of the American Medical Informatics Association, vol. 32, no. 3, pp. 589-594. https://doi.org/10.1093/jamia/ocae301