Machine learning vs. rule-based methods for document classification of electronic health records within mental health care: A systematic literature review
Publication date
2025-03
Editors
Advisors
Supervisors
Document Type
Article
Metadata
Show full item recordCollections
License
cc_by
Abstract
Document classification is a widely used task for analyzing mental healthcare texts. This systematic literature review focuses on the document classification of electronic health records in mental healthcare. Over the last decade, there has been a shift from rule-based to machine-learning methods. Despite this shift, no systematic comparison of these two approaches exists for mental healthcare applications. This review examines the evolution, applications, and performance of these methods over time. We find that for most of the last decade, rule-based methods have outperformed machine-learning approaches. However, with the development of more advanced machine-learning techniques, performance has improved. In particular, Transformer-based models enable machine learning approaches to outperform rule-based methods for the first time.
Keywords
Document classification, Electronic health records, Machine learning, Mental healthcare, Natural language processing, Rule-based methods, SDG 3 - Good Health and Well-being
Citation
Rijcken, E, Zervanou, K, Mosteiro, P, Scheepers, F, Spruit, M & Kaymak, U 2025, 'Machine learning vs. rule-based methods for document classification of electronic health records within mental health care : A systematic literature review', Natural Language Processing, vol. 10, 100129. https://doi.org/10.1016/j.nlp.2025.100129