A data-driven decision model for selecting ML models in research software

Baninemeh, Elena; Steffens, Lex; Jansen, Slinger; Farshidi, Siamak

doi:https://doi.org/10.1016/j.jss.2026.112821

A data-driven decision model for selecting ML models in research software

Files

1-s2.0-S0164121226000555-main.pdf (12.01 MB)

Publication date

2026-06

Authors

Baninemeh, Elena

Steffens, Lex

Jansen, Slinger

Farshidi, Siamak

DOI

https://doi.org/10.1016/j.jss.2026.112821

Document Type

Article

Metadata

Show full item record

Collections

Utrecht University Repository

License

cc_by

Abstract

Context: The process of selecting machine learning models is complex for research software engineers, requiring careful consideration of factors like trainability and comprehensibility to ensure long-term usability and success. Objective: This study aims to develop and evaluate a data-driven decision model that supports research software engineers in systematically selecting suitable ML models for integration into research software. Method: A meta-model was created to guide model selection, drawing from systematic literature reviews, expert interviews, case studies, and design science. Each phase contributed valuable insights and refined the decision-making framework. Results: The study analyzed 43 models across 72 attributes, resulting in a taxonomy of ML paradigms, approaches, and domains. Key findings include trends in model selection, combinations, evaluation metrics, and datasets. The decision model was further refined through expert feedback and validated with 11 case studies. Contribution: This data-driven decision model supports research software engineers in selecting optimal ML models for integration into research software. Continued development is recommended to enhance its accuracy and applicability across varied research scenarios.

Keywords

Data-driven decision model, Decision-making meta-model, Machine learning model, Model taxonomy, Research software engineering, Software, Information Systems, Hardware and Architecture

Citation

Baninemeh, E, Steffens, L, Jansen, S & Farshidi, S 2026, 'A data-driven decision model for selecting ML models in research software', Journal of Systems and Software, vol. 236, 112821. https://doi.org/10.1016/j.jss.2026.112821

URI

https://dspace.library.uu.nl/handle/1874/480917

A data-driven decision model for selecting ML models in research software

Files

Publication date

Authors

Editors

Advisors

Supervisors

DOI

Document Type

Metadata

Collections

License

Abstract

Keywords

Citation

URI