Towards Interpreting Topic Models with ChatGPT

Publication date

2023-08-31

Authors

Rijcken, EmilISNI 0000000511052268
Scheepers, Floortje
Zervanou, KalliopiORCID 0000-0001-9036-354XISNI 0000000138923183
Spruit, MarcoISNI 0000000077172004
Mosteiro, PabloORCID 0000-0001-7231-2773ISNI 0000000493075828
Kaymak, Uzay

Editors

Advisors

Supervisors

DOI

Document Type

Part of book
Open Access logo

License

Abstract

Topic modeling has become a popular approach to identify semantic structures in text corpora. Despite its wide applications, interpreting the outputs of topic models remains challenging. This paper presents an initial study regarding a new approach to better understand this output, leveraging the large language model ChatGPT. Our approach is built on a three- stage process where we first use topic modeling to identify the main topics in the corpus. Then, we ask a domain expert to assign themes to these topics and prompt ChatGPT to generate human- readable summaries of the topics. Lastly, we compare the human- and machine-produced interpretations. The domain expert found half of ChatGPT’s descriptions useful. This explorative work demonstrates ChatGPT’s capability to describe topics accurately and provide useful insights if prompted accurately.

Keywords

ChatGPT, Electronic Health Records, Fuzzy Topic Models, Large Language Models, Prompt Engineering, Topic Modeling

Citation

Rijcken, E, Scheepers, F, Zervanou, K, Spruit, M, Mosteiro Romero, P & Kaymak, U 2023, Towards Interpreting Topic Models with ChatGPT. in The 20th World Congress of the International Fuzzy Systems Association (IFSA 2023). International Fuzzy Systems Association, pp. 269-275, The 20th World Congress of the International Fuzzy Systems Association, Daegu, Korea, Republic of, 20/08/23., conference