Comparative approaches to the assessment of writing: Reliability and validity of benchmark rating and comparative judgement

Publication date

2024-03

Authors

Bouwer, RenskeORCID 0000-0003-0434-0224ISNI 0000000419448657
Lesterhuis, Marije
De Smedt, Fien
Van Keer, Hilde
De Maeyer, Sven

Editors

Advisors

Supervisors

Document Type

Article
Open Access logo

License

cc_by_nc_nd

Abstract

In the past years, comparative assessment approaches have gained ground as a viable method to assess text quality. Instead of providing absolute scores to a text as in holistic or analytic scoring methods, raters in comparative assessments rate text quality by comparing texts either to pre-selected benchmarks representing different levels of writing quality (i.e., benchmark rating method) or by a series of pairwise comparisons to other texts in the sample (i.e., comparative judgement; CJ). In the present study, text quality scores from the benchmarking method and CJ are compared in terms of their reliability, convergent validity and scoring distribution. Results show that benchmark ratings and CJ-ratings were highly consistent and converged to the same construct of text quality. However, the distribution of benchmark ratings showed a central tendency. It is discussed how both methods can be integrated and used such that writing can be assessed reliably, validly, but also efficiently in both writing research and practice.

Keywords

benchmark rating, comparative judgement, convergent validity, reliability, writing assessment

Citation

Bouwer, R, Lesterhuis, M, De Smedt, F, Van Keer, H & De Maeyer, S 2024, 'Comparative approaches to the assessment of writing : Reliability and validity of benchmark rating and comparative judgement', Journal of Writing Research, vol. 15, no. 3, pp. 498-518. https://doi.org/10.17239/jowr-2024.15.03.03