Annotator disagreement in RST annotation schemes

Publication date

2025-06-13

Authors

Ignatev, DaniilORCID 0009-0006-0455-5224
Paperno, DenisISNI 000000037085651X
Poesio, MassimoORCID 0000-0001-8469-2072ISNI 0000000124478066

Editors

Advisors

Supervisors

Document Type

Article
Open Access logo

License

cc_by

Abstract

Discourse parsing within the Rhetorical Structure Theory (RST) framework has inspired extensive research; however, it remains prone to significant levels of annotator disagreement, particularly in the labeling of relations and nuclearity. This paper investigates systematic discrepancies in RST annotations, focusing on two expert-annotated corpora of closely related languages. We first compare different RST treebanks to assess the availability of parallel-labeled data and highlight their usefulness for studying disagreement. We then perform both quantitative and qualitative analyses of annotation divergences, identifying factors that contribute significantly to inconsistent interpretations. Finally, we propose two practical approaches for addressing disagreement: (1) filtering out unhelpful biases and (2) capturing legitimate ambiguity through more flexible annotation schemes.

Keywords

Citation

Ignatev, D, Paperno, D & Poesio, M 2025, 'Annotator disagreement in RST annotation schemes', Society for Computation in Linguistics, vol. 8, no. 1, 7. https://doi.org/10.7275/scil.3137