☆ 4.5 Article

On the Validity of Machine Learning-based Next Generation Science Assessments: A Validity Inferential Network

JOURNAL OF SCIENCE EDUCATION AND TECHNOLOGY (2021)

Journal

JOURNAL OF SCIENCE EDUCATION AND TECHNOLOGY

Volume 30, Issue 2, Pages 298-312

Publisher

SPRINGER

DOI: 10.1007/s10956-020-09879-9

Keywords

Machine learning; Science assessment; Validity

Funding

Chan Zuckerberg Initiative [194933]
National Science Foundation [DUE-1561159]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This study examines the impact of machine learning on science assessments and identifies seven critical validity issues of ML-based NGSAs. A validity inferential network is proposed to address these validity issues and ensure accountable assessment design and valid interpretation and use of machine scores.

This study provides a solid validity inferential network to guide the development, interpretation, and use of machine learning-based next-generation science assessments (NGSAs). Given that machine learning (ML) has been broadly implemented in the automatic scoring of constructed responses, essays, simulations, educational games, and interdisciplinary assessments to advance the evidence collection and inference of student science learning, we contend that additional validity issues arise for science assessments due to the involvement of ML. These emerging validity issues may not be addressed by prior validity frameworks developed for either non-science or non-ML assessments. We thus examine the changes brought in by ML to science assessments and identify seven critical validity issues of ML-based NGSAs: potential risk of misrepresenting the construct of interest, potential confounders due to that more variables may involve, nonalignment between interpretation and use of scores and designed learning goals, nonalignment between interpretation and use of scores and actual learning quality, nonalignment between machine scores and rubrics, limited generalizable ability of machine algorithmic models, and limited extrapolating ability of machine algorithmic models. Based on the seven validity issues identified, we propose a validity inferential network to address the cognitive, instructional, and inferential validity of ML-based NGSAs. To demonstrate the utility of this network, we present an exemplar of ML-based next-generation science assessments that was developed using a seven-step ML framework. We articulate how we used the validity inferential network to ensure accountable assessment design, as well as valid interpretation and use of machine scores.

On the Validity of Machine Learning-based Next Generation Science Assessments: A Validity Inferential Network

Journal

JOURNAL OF SCIENCE EDUCATION AND TECHNOLOGY

Publisher

SPRINGER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

On the Validity of Machine Learning-based Next Generation Science Assessments: A Validity Inferential Network

Journal

JOURNAL OF SCIENCE EDUCATION AND TECHNOLOGY

Publisher

SPRINGER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper