4.2 Article

Combining Human and Automated Scoring Methods in Experimental Assessments of Writing: A Case Study Tutorial

出版社

SAGE PUBLICATIONS INC
DOI: 10.3102/10769986231207886

关键词

text analysis; randomized controlled trial; automated scoring; argumentative writing

向作者/读者索取更多资源

This article introduces a pipeline for using machine-based text analysis and data mining tools to analyze the impacts of text outcomes, providing a more comprehensive understanding of experimental evaluations. Through a case study in the field of education, it demonstrates how machine learning can enrich impact evaluations by providing a detailed picture of the mechanisms behind stronger argumentative writing.
In a randomized trial that collects text as an outcome, traditional approaches for assessing treatment impact require that each document first be manually coded for constructs of interest by human raters. An impact analysis can then be conducted to compare treatment and control groups, using the hand-coded scores as a measured outcome. This process is both time and labor-intensive, which creates a persistent barrier for large-scale assessments of text. Furthermore, enriching one's understanding of a found impact on text outcomes via secondary analyses can be difficult without additional scoring efforts. The purpose of this article is to provide a pipeline for using machine-based text analytic and data mining tools to augment traditional text-based impact analysis by analyzing impacts across an array of automatically generated text features. In this way, we can explore what an overall impact signifies in terms of how the text has evolved due to treatment. Through a case study based on a recent field trial in education, we show that machine learning can indeed enrich experimental evaluations of text by providing a more comprehensive and fine-grained picture of the mechanisms that lead to stronger argumentative writing in a first- and second-grade content literacy intervention. Relying exclusively on human scoring, by contrast, is a lost opportunity. Overall, the workflow and analytical strategy we describe can serve as a template for researchers interested in performing their own experimental evaluations of text.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据