The impact of LLM-generated models on novice domain modelers: a comparative experiment

Bragilovski, Maxim; van Can, Ashley T.; Dalpiaz, Fabiano; Sturm, Arnon

doi:https://doi.org/10.1007/s10664-026-10831-5

The impact of LLM-generated models on novice domain modelers: a comparative experiment

Files

s10664-026-10831-5.pdf (2.9 MB)

Publication date

2026-03-20

Authors

Bragilovski, Maxim

van Can, Ashley T.

Dalpiaz, Fabiano

Sturm, Arnon

DOI

https://doi.org/10.1007/s10664-026-10831-5

Document Type

Article

Metadata

Show full item record

Collections

Utrecht University Repository

License

cc_by

Abstract

In software development, domain models are conceptual blueprints that capture the structure, relationships, and key entities of a problem domain. Automated techniques can support analysts and developers by extracting such models from existing artifacts. However, this is a non-trivial task, especially when the input consists of informal artifacts such as user stories. This paper investigates how providing an initial, automatically generated domain model influences novice developers’ ability to construct their own domain models. We conducted an experiment involving 127 undergraduate students, divided into three groups: one receiving an LLM-generated model that maximizes precision (validity), one receiving an LLM-generated model that boosts recall (completeness), and a control group that did not receive any initial domain model. Our findings show that novices who received an initial LLM-generated model produced more complete class identification in simple user story projects and improved relationship identification accuracy in both simple and complex projects. While initial domain models appear to aid novices in refining domain models effectively, our results also suggest a strong tendency among participants to rely heavily on these initial models. On average, students included 85% of the correct classes and 67% of the incorrect classes from the initial domain models in their own derived models. Such reliance can scaffold novice learning and refinement, but this reliance may limit creativity and hinder the deeper reasoning required to develop robust domain modeling skills.

Keywords

Comparative experiment, Domain model, Large language model, User story, Software

Citation

Bragilovski, M, van Can, A T, Dalpiaz, F & Sturm, A 2026, 'The impact of LLM-generated models on novice domain modelers : a comparative experiment', Empirical Software Engineering, vol. 31, no. 4, 96. https://doi.org/10.1007/s10664-026-10831-5

URI

https://dspace.library.uu.nl/handle/1874/483857

The impact of LLM-generated models on novice domain modelers: a comparative experiment

Files

Publication date

Authors

Editors

Advisors

Supervisors

DOI

Document Type

Metadata

Collections

License

Abstract

Keywords

Citation

URI