Optimizing ASReview simulations: A generic multiprocessing solution for ‘light-data’ and ‘heavy-data’ users

Publication date

2024

Authors

Romanov, Sergei
Siqueira, Abel Soares
Bruin, Jonathan deORCID 0000-0002-4297-0502ISNI 000000051803672X
Teijema, Jelle JasperISNI 0000000507449721
Hofstee, LauraISNI 000000050674864X
Schoot, Rens van deISNI 0000000393562696

Editors

Advisors

Supervisors

Document Type

Article

Collections

Open Access logo

License

cc_by_nc

Abstract

Active learning can be used for optimizing and speeding up the screening phase of systematic reviews. Running simulation studies mimicking the screening process can be used to test the performance of different machine-learning models or to study the impact of different training data. This paper presents an architecture design with a multiprocessing computational strategy for running many such simulation studies in parallel, using the ASReview Makita workflow generator and Kubernetes software for deployment with cloud technologies. We provide a technical explanation of the proposed cloud architecture and its usage. In addition to that, we conducted 1140 simulations investigating the computational time using various numbers of CPUs and RAM settings. Our analysis demonstrates the degree to which simulations can be accelerated with multiprocessing computing usage. The parallel computation strategy and the architecture design that was developed in the present paper can contribute to future research with more optimal simulation time and, at the same time, ensure the safe completion of the needed processes.

Keywords

Active learning, Cloud architecture, Multiprocessing, Simulation study, Systematic review, Information Systems, Computer Science Applications, Library and Information Sciences, Artificial Intelligence

Citation

Romanov, S, Siqueira, A S, Bruin, J D, Teijema, J, Hofstee, L & Schoot, R V D 2024, 'Optimizing ASReview simulations: A generic multiprocessing solution for ‘light-data’ and ‘heavy-data’ users', Data Intelligence, vol. 6, no. 2, pp. 320-343. https://doi.org/10.1162/dint_a_00244