Validity of Comparative Judgment Scores: How Assessors Evaluate Aspects of Text Quality When Comparing Argumentative Texts

Publication date

2022-05-13

Authors

Lesterhuis, Marije
Bouwer, I.R.ORCID 0000-0003-0434-0224ISNI 0000000419448657
Van Daal, Tine
Donche, Vincent
De Maeyer, Sven

Editors

Advisors

Supervisors

Document Type

Article
Open Access logo

License

cc_by

Abstract

The advantage of comparative judgment is that it is particularly suited to assess multidimensional and complex constructs as text quality. This is because assessors are asked to compare texts holistically and to make a quality judgment for each text in a pairwise comparison based upon on the most salient and critical differences. Also, the resulted rank order is based on the judgment of all assessors, representing the shared consensus. In order to be able to select the right number of assessors, the question is to what extent the conceptualization of assessors prevails in the aspects they base their judgment on, or whether comparative judgment minimizes the differences between assessors. In other words, can we detect types of assessors who tend to consider certain aspects of text quality more often than others? A total of 64 assessors compared argumentative texts, after which they provided decision statements on what aspects of text quality had informed their judgment. These decision statements were coded on six overarching themes of text quality: argumentation, organization, language use, language conventions, source use, references, and layout. Using a multilevel-latent class analysis, four different types of assessors could be distinguished: narrowly focused, broadly focused, source-focused, and language-focused. However, the analysis also showed that all assessor types mainly focused on argumentation and organization, and that assessor types only partly explained whether the aspect of text quality was mentioned in a decision statement. We conclude that comparative judgment is a strong method for comparing complex constructs like text quality. First, because the rank order combines different views on text quality, but foremost because the method of comparative judgment minimizes differences between assessors.

Keywords

assessor cognition, comparative judgment, latent class analysis, validity, writing assessment, Education

Citation

Lesterhuis, M, Bouwer, R, Van Daal, T, Donche, V & De Maeyer, S 2022, 'Validity of Comparative Judgment Scores : How Assessors Evaluate Aspects of Text Quality When Comparing Argumentative Texts', Frontiers in Education, vol. 7, 823895. https://doi.org/10.3389/feduc.2022.823895