☆ 3.8 Article

Contrasting State-of-the-Art in the Machine Scoring of Short-Form Constructed Responses

EDUCATIONAL ASSESSMENT (2015)

Journal

EDUCATIONAL ASSESSMENT

Volume 20, Issue 1, Pages 46-65

Publisher

ROUTLEDGE JOURNALS, TAYLOR & FRANCIS LTD

DOI: 10.1080/10627197.2015.997617

Keywords

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

This study compared short-form constructed responses evaluated by both human raters and machine scoring algorithms. The context was a public competition on which both public competitors and commercial vendors vied to develop machine scoring algorithms that would match or exceed the performance of operational human raters in a summative high-stakes testing environment. Data (N = 25,683) were drawn from three different states, employed 10 different prompts, and were drawn from two different secondary grade levels. Samples ranging in size from 2,130 to 2,999 were randomly selected from the data sets provided by the states and then randomly divided into three sets: a training set, a test set, and a validation set. Machine performance on all of the agreement measures failed to match that of the human raters. The current study concluded with recommendations on steps that might improve machine-scoring algorithms before they can be used in any operational way.

Contrasting State-of-the-Art in the Machine Scoring of Short-Form Constructed Responses

Journal

EDUCATIONAL ASSESSMENT

Publisher

ROUTLEDGE JOURNALS, TAYLOR & FRANCIS LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Contrasting State-of-the-Art in the Machine Scoring of Short-Form Constructed Responses

Journal

EDUCATIONAL ASSESSMENT

Publisher

ROUTLEDGE JOURNALS, TAYLOR & FRANCIS LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper