Validity of Comparative Judgment Scores: How Assessors Evaluate Aspects of Text Quality When Comparing Argumentative Texts

Lesterhuis, Marije; Bouwer, Renske; Van Daal, Tine; Donche, Vincent; De Maeyer, Sven

doi:https://doi.org/10.3389/feduc.2022.823895

Validity of Comparative Judgment Scores: How Assessors Evaluate Aspects of Text Quality When Comparing Argumentative Texts

Files

feduc_07_823895.pdf (416.26 KB)

Publication date

2022-05-13

Authors

Lesterhuis, Marije

Bouwer, I.R.

Van Daal, Tine

Donche, Vincent

De Maeyer, Sven

DOI

https://doi.org/10.3389/feduc.2022.823895

Document Type

Article

Metadata

Show full item record

Collections

Utrecht University Repository

License

cc_by

Abstract

The advantage of comparative judgment is that it is particularly suited to assess multidimensional and complex constructs as text quality. This is because assessors are asked to compare texts holistically and to make a quality judgment for each text in a pairwise comparison based upon on the most salient and critical differences. Also, the resulted rank order is based on the judgment of all assessors, representing the shared consensus. In order to be able to select the right number of assessors, the question is to what extent the conceptualization of assessors prevails in the aspects they base their judgment on, or whether comparative judgment minimizes the differences between assessors. In other words, can we detect types of assessors who tend to consider certain aspects of text quality more often than others? A total of 64 assessors compared argumentative texts, after which they provided decision statements on what aspects of text quality had informed their judgment. These decision statements were coded on six overarching themes of text quality: argumentation, organization, language use, language conventions, source use, references, and layout. Using a multilevel-latent class analysis, four different types of assessors could be distinguished: narrowly focused, broadly focused, source-focused, and language-focused. However, the analysis also showed that all assessor types mainly focused on argumentation and organization, and that assessor types only partly explained whether the aspect of text quality was mentioned in a decision statement. We conclude that comparative judgment is a strong method for comparing complex constructs like text quality. First, because the rank order combines different views on text quality, but foremost because the method of comparative judgment minimizes differences between assessors.

Keywords

assessor cognition, comparative judgment, latent class analysis, validity, writing assessment, Education

Citation

Lesterhuis, M, Bouwer, R, Van Daal, T, Donche, V & De Maeyer, S 2022, 'Validity of Comparative Judgment Scores : How Assessors Evaluate Aspects of Text Quality When Comparing Argumentative Texts', Frontiers in Education, vol. 7, 823895. https://doi.org/10.3389/feduc.2022.823895

URI

http://hdl.handle.net/1874/426470

Validity of Comparative Judgment Scores: How Assessors Evaluate Aspects of Text Quality When Comparing Argumentative Texts

Files

Publication date

Authors

Editors

Advisors

Supervisors

DOI

Document Type

Metadata

Collections

License

Abstract

Keywords

Citation

URI