Undesirable Biases in NLP: Addressing Challenges of Measurement

Van der Wal, Oskar; Bachmann, D; Leidinger, A; van Maanen, L; Zuidema, W; Schulz, K

doi:https://doi.org/10.1613/jair.1.15195

Undesirable Biases in NLP: Addressing Challenges of Measurement

Files

jair.1.15195.pdf (528.91 KB)

Publication date

2024

Authors

Van der Wal, Oskar

Bachmann, Dominik

Leidinger, A

van Maanen, Leendert

Zuidema, W

Schulz, K

DOI

https://doi.org/10.1613/jair.1.15195

Document Type

Article

Metadata

Show full item record

Collections

Utrecht University Repository

License

cc_by

Abstract

As Large Language Models and Natural Language Processing (NLP) technology rapidly develop and spread into daily life, it becomes crucial to anticipate how their use could harm people. One problem that has received a lot of attention in recent years is that this technology has displayed harmful biases, from generating derogatory stereotypes to producing disparate outcomes for different social groups. Although a lot of effort has been invested in assessing and mitigating these biases, our methods of measuring the biases of NLP models have serious problems and it is often unclear what they actually measure. In this paper, we provide an interdisciplinary approach to discussing the issue of NLP model bias by adopting the lens of psychometrics — a field specialized in the measurement of concepts like bias that are not directly observable. In particular, we will explore two central notions from psychometrics, the construct validity and the reliability of measurement tools, and discuss how they can be applied in the context of measuring model bias. Our goal is to provide NLP practitioners with methodological tools for designing better bias measures, and to inspire them more generally to explore tools from psychometrics when working on bias measurement tools.

Keywords

Artificial Intelligence

Citation

Van der Wal, O, Bachmann, D, Leidinger, A, van Maanen, L, Zuidema, W & Schulz, K 2024, 'Undesirable Biases in NLP : Addressing Challenges of Measurement', Journal of Artificial Intelligence Research, vol. 79, pp. 1-40. https://doi.org/10.1613/jair.1.15195

URI

https://dspace.library.uu.nl/handle/1874/438112

Undesirable Biases in NLP: Addressing Challenges of Measurement

Files

Publication date

Authors

Editors

Advisors

Supervisors

DOI

Document Type

Metadata

Collections

License

Abstract

Keywords

Citation

URI