☆ 4.3 Article

Validity Arguments for Automated Essay Scoring of Young Students' Writing Traits

LANGUAGE ASSESSMENT QUARTERLY (2023)

Journal

LANGUAGE ASSESSMENT QUARTERLY

Volume -, Issue -, Pages -

Publisher

ROUTLEDGE JOURNALS, TAYLOR & FRANCIS LTD

DOI: 10.1080/15434303.2023.2288253

Keywords

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

Using machines for scoring writing has significant implications for formative assessment, but the validity of machine scoring in the K-12 context needs further research.

Machines have a long-demonstrated ability to find statistical relationships between qualities of texts and surface-level linguistic indicators of writing. More recently, unlocked by artificial intelligence, the potential of using machines to identify content-related writing trait criteria has been uncovered. This development is significant, especially in formative assessment contexts where feedback is key. Yet the extent to which writing traits can be validly scored by machines remains under-researched, especially in the K-12 context. The present study investigated the validity of machine learning (ML) models designed for students in grades 3-6 to score three writing traits: task fulfillment, organization and coherence, and vocabulary and expression. The study utilized an argument-based approach, focusing on two primary inferences: evaluation and explanation. The evaluation inference investigated human-machine score alignment, the ability for the models to detect off-topic and gibberish responses, and the consistency of human-machine score alignment across grades and language backgrounds. The explanation inference investigated the relevance of features used in the models. Results indicated that human-machine score alignment was sufficient for all writing traits; however, validity concerns were raised regarding the models' performances detecting off-topic and gibberish responses and the consistency across sub-groups. Implications for language assessment professionals and other educators were discussed.

Validity Arguments for Automated Essay Scoring of Young Students' Writing Traits

Journal

LANGUAGE ASSESSMENT QUARTERLY

Publisher

ROUTLEDGE JOURNALS, TAYLOR & FRANCIS LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Validity Arguments for Automated Essay Scoring of Young Students' Writing Traits

Journal

LANGUAGE ASSESSMENT QUARTERLY

Publisher

ROUTLEDGE JOURNALS, TAYLOR & FRANCIS LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper