Written expression curriculum-based measurement (WE-CBM) is a formative assessment approach for screening and progress monitoring. To extend evaluation of WE-CBM, we compared hand-calculated and automated scoring approaches in relation to the number of screening samples needed per student for valid scores, the long-term predictive validity and diagnostic accuracy of scores, and predictive and diagnostic bias for underrepresented student groups. Second- to fifth-grade students (n = 609) completed five WE-CBM tasks during one academic year and a standardised writing test in fourth and seventh grade. Averaging WE-CBM scores across multiple samples improved validity. Complex hand-calculated metrics and automated tools outperformed simpler metrics for the long-term prediction of writing performance. No evidence of bias was observed between African American and Hispanic students. The study will illustrate the absence of test bias as necessary condition for fair and equitable screening procedures and the importance of future research to include comparisons with majority groups.

Matta, M., Mercer, S., Keller-Margulis, M. (2022). Evaluating validity and bias for hand-calculated and automated written expression curriculum-based measurement scores. ASSESSMENT IN EDUCATION, 29(2), 200-218 [10.1080/0969594X.2022.2043240].

Evaluating validity and bias for hand-calculated and automated written expression curriculum-based measurement scores

Matta M.
;
2022

Abstract

Written expression curriculum-based measurement (WE-CBM) is a formative assessment approach for screening and progress monitoring. To extend evaluation of WE-CBM, we compared hand-calculated and automated scoring approaches in relation to the number of screening samples needed per student for valid scores, the long-term predictive validity and diagnostic accuracy of scores, and predictive and diagnostic bias for underrepresented student groups. Second- to fifth-grade students (n = 609) completed five WE-CBM tasks during one academic year and a standardised writing test in fourth and seventh grade. Averaging WE-CBM scores across multiple samples improved validity. Complex hand-calculated metrics and automated tools outperformed simpler metrics for the long-term prediction of writing performance. No evidence of bias was observed between African American and Hispanic students. The study will illustrate the absence of test bias as necessary condition for fair and equitable screening procedures and the importance of future research to include comparisons with majority groups.
Articolo in rivista - Articolo scientifico
automated text evaluation; curriculum-based measurement; predictive bias; predictive validity; Written expression;
English
28-feb-2022
2022
29
2
200
218
none
Matta, M., Mercer, S., Keller-Margulis, M. (2022). Evaluating validity and bias for hand-calculated and automated written expression curriculum-based measurement scores. ASSESSMENT IN EDUCATION, 29(2), 200-218 [10.1080/0969594X.2022.2043240].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/505539
Citazioni
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 3
Social impact