Event Log Extraction for Process Mining Using Large Language Models

Publication date

2025-02-14

Authors

Stein Dani, ViniciusISNI 0000000507443397
Dees, Marcus
Leopold, Henrik
Busch, Kiran
Beerepoot, I.M.ISNI 0000000492835880
Van Der Werf, Jan MartijnORCID 0000-0002-7264-381XISNI 0000000119806432
Reijers, H.A.ORCID 0000-0001-9634-5852ISNI 0000000037238136

Editors

Advisors

Supervisors

Document Type

Part of book
Open Access logo

License

taverne

Abstract

Process mining is a discipline that enables organizations to discover and analyze their work processes. A prerequisite for conducting a process mining initiative is the so-called event log, which is not always readily available. In such cases, extracting an event log involves various time-consuming tasks, such as creating tailor-made structured query language (SQL) scripts to extract an event log from a relational database. With this work, we investigate the use of large language models (LLMs) to support event log extraction, particularly by leveraging LLMs ability to produce SQL scripts. In this paper, we report on how effectively an LLM can assist with event log extraction for process mining. Despite the intrinsic non-deterministic nature of LLMs, our results show the potential of future LLM-assisted event log extraction tools, especially when domain and data knowledge are available. The implementation of such tools can increase access to event log extraction to a broader range of users within an organization by reducing the reliance on specialized technical skills for producing relational database query scripts and minimizing manual effort.

Keywords

Taverne

Citation

Stein Dani, V, Dees, M, Leopold, H, Busch, K, Beerepoot, I, van der Werf, J M & Reijers, H 2025, Event Log Extraction for Process Mining Using Large Language Models. in 30th International Conference on Cooperative Information Systems (CoopIS 2024). Lecture Notes in Computer Science, vol. 15506, Springer, pp. 56-72. https://doi.org/10.1007/978-3-031-81375-7_4