☆ 4.2 Article

Assessing L2 English speaking using automated scoring technology: examining automarker reliability

ASSESSMENT IN EDUCATION-PRINCIPLES POLICY & PRACTICE (2021)

期刊

ASSESSMENT IN EDUCATION-PRINCIPLES POLICY & PRACTICE

卷 28, 期 4, 页码 411-436

出版社

ROUTLEDGE JOURNALS, TAYLOR & FRANCIS LTD

DOI: 10.1080/0969594X.2021.1979467

关键词

Automated scoring; L2 speaking assessment; limits of agreement

类别

Education & Educational Research

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study found that the reliability of the automarker in online oral English test is good, but it tends to be more lenient towards low-proficiency speakers. The uncertainty measure named Language Quality, which indicates the confidence of speech recognition, was found to be useful in predicting reliability and identifying abnormal speech.

Recent advances in machine learning have made automated scoring of learner speech widespread, and yet validation research that provides support for applying automated scoring technology to assessment is still in its infancy. Both the educational measurement and language assessment communities have called for greater transparency in describing scoring algorithms and research evidence about the reliability of automated scoring. This paper reports on a study that investigated the reliability of an automarker using candidate responses produced in an online oral English test. Based on 'limits of agreement' and multi-faceted Rasch analyses on automarker scores and individual examiner scores, the study found that the automarker, while exhibiting excellent internal consistency, was slightly more lenient than examiner fair average scores, particularly for low-proficiency speakers. Additionally, it was found that an automarker uncertainty measure termed Language Quality, which indicates the confidence of speech recognition, was useful for predicting automarker reliability and flagging abnormal speech.

Assessing L2 English speaking using automated scoring technology: examining automarker reliability

期刊

ASSESSMENT IN EDUCATION-PRINCIPLES POLICY & PRACTICE

出版社

ROUTLEDGE JOURNALS, TAYLOR & FRANCIS LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Assessing L2 English speaking using automated scoring technology: examining automarker reliability

期刊

ASSESSMENT IN EDUCATION-PRINCIPLES POLICY & PRACTICE

出版社

ROUTLEDGE JOURNALS, TAYLOR & FRANCIS LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文