JST and rJST: joint estimation of sentiment and topics in textual data using a semi-supervised approach

Pipal, Christian; Schoonvelde, Martijn; Schumacher, Gijs; Boiten, Max

doi:https://doi.org/10.1080/19312458.2024.2383453

JST and rJST: joint estimation of sentiment and topics in textual data using a semi-supervised approach

Files

s00382-018-4444-4.pdf (9.7 MB)

Publication date

2025

Authors

Pipal, Christian

Schoonvelde, Martijn

Schumacher, Gijs

Boiten, Max

DOI

https://doi.org/10.1080/19312458.2024.2383453

Document Type

Article

Metadata

Show full item record

Collections

Utrecht University Repository

License

cc_by

Abstract

This paper demonstrates the performance of the Joint Sentiment Topic model (JST) and the reversed Joint Sentiment Topic model (rJST) in measuring sentiment in political speeches, comparing them against a set of popular methods for sentiment analysis: widely used off-the-shelf sentiment dictionaries; an embeddings-enhanced dictionary approach; Latent Semantic Scaling, a semi-supervised approach; and a zero-shot transformer-based approach using a large language model (GPT-4). The findings reveal JST’s superiority over all non-transformer-based approaches in predicting human-coded sentiment in multiple languages and its ability to replicate known sentiment trends in legislative speech. rJST, meanwhile, provides valuable topic-specific sentiment estimates, responsive to political dynamics and significant events. Both models are, however, outperformed by transformer-based models like GPT-4. Additionally, the paper introduces the ’sentitopics’ R-package, designed to facilitate the use of JST and rJST in computational text analysis workflows. This package is compatible with popular text analysis tools, making the models accessible for applied researchers in communication science.

Keywords

Communication

Citation

Pipal, C, Schoonvelde, M, Schumacher, G & Boiten, M 2025, 'JST and rJST : joint estimation of sentiment and topics in textual data using a semi-supervised approach', Communication Methods and Measures, vol. 19, no. 2, pp. 112-130. https://doi.org/10.1080/19312458.2024.2383453

URI

https://dspace.library.uu.nl/handle/1874/469180

JST and rJST: joint estimation of sentiment and topics in textual data using a semi-supervised approach

Files

Publication date

Authors

Editors

Advisors

Supervisors

DOI

Document Type

Metadata

Collections

License

Abstract

Keywords

Citation

URI