Does It Capture STEL? A Modular, Similarity-based Linguistic Style Evaluation Framework

Publication date

2021-11

Authors

Wegmann, Anna MariaISNI 0000000493074999
Nguyen, DongISNI 0000000419527451

Editors

Moens, Marie-Francine
Huang, Xuanjing
Specia, Lucia
Yih, Scott Wen-tau

Advisors

Supervisors

Document Type

Part of book
Open Access logo

License

cc_by

Abstract

Style is an integral part of natural language. However, evaluation methods for style measures are rare, often task-specific and usually do not control for content. We propose the modular, fine-grained and content-controlled similarity-based STyle EvaLuation framework (STEL) to test the performance of any model that can compare two sentences on style. We illustrate STEL with two general dimensions of style (formal/informal and simple/complex) as well as two specific characteristics of style (contrac'tion and numb3r substitution). We find that BERT-based methods outperform simple versions of commonly used style measures like 3-grams, punctuation frequency and LIWC-based approaches. We invite the addition of further tasks and task instances to STEL and hope to facilitate the improvement of style-sensitive measures.

Keywords

Citation

Wegmann, A & Nguyen, D 2021, Does It Capture STEL? A Modular, Similarity-based Linguistic Style Evaluation Framework. in M-F Moens, X Huang, L Specia & S W Yih (eds), Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Dominican Republic, pp. 7109-7130. https://doi.org/10.18653/v1/2021.emnlp-main.569