Machine learning vs. rule-based methods for document classification of electronic health records within mental health care: A systematic literature review

Publication date

2025-03

Authors

Rijcken, EmilISNI 0000000511052268
Zervanou, KalliopiORCID 0000-0001-9036-354XISNI 0000000138923183
Mosteiro Romero, PabloORCID 0000-0001-7231-2773ISNI 0000000493075828
Scheepers, Floortje
Spruit, MarcoISNI 0000000077172004
Kaymak, Uzay

Editors

Advisors

Supervisors

Document Type

Article
Open Access logo

License

cc_by

Abstract

Document classification is a widely used task for analyzing mental healthcare texts. This systematic literature review focuses on the document classification of electronic health records in mental healthcare. Over the last decade, there has been a shift from rule-based to machine-learning methods. Despite this shift, no systematic comparison of these two approaches exists for mental healthcare applications. This review examines the evolution, applications, and performance of these methods over time. We find that for most of the last decade, rule-based methods have outperformed machine-learning approaches. However, with the development of more advanced machine-learning techniques, performance has improved. In particular, Transformer-based models enable machine learning approaches to outperform rule-based methods for the first time.

Keywords

Document classification, Electronic health records, Machine learning, Mental healthcare, Natural language processing, Rule-based methods, SDG 3 - Good Health and Well-being

Citation

Rijcken, E, Zervanou, K, Mosteiro, P, Scheepers, F, Spruit, M & Kaymak, U 2025, 'Machine learning vs. rule-based methods for document classification of electronic health records within mental health care : A systematic literature review', Natural Language Processing, vol. 10, 100129. https://doi.org/10.1016/j.nlp.2025.100129