4.6 Article

A predictive analytics framework as a countermeasure for attrition of students

期刊

INTERACTIVE LEARNING ENVIRONMENTS
卷 30, 期 6, 页码 1028-1043

出版社

ROUTLEDGE JOURNALS, TAYLOR & FRANCIS LTD
DOI: 10.1080/10494820.2019.1709209

关键词

Educational data mining; big data; machine learning; distance learning; learning analytics

向作者/读者索取更多资源

This article presents a method using big data mining and machine learning techniques to address the issue of student attrition in distance learning. By analyzing and predicting different datasets from the Hellenic Open University, at-risk students can be identified in a timely manner, allowing for personalized intervention.
Attrition is one of the main concerns in distance learning due to the impact on the incomes and institutions reputation. Timely identification of students at risk has high practical value in effective students' retention services. Big Data mining and machine learning methods are applied to manipulate, analyze, and predict students' failure, supporting self-directed learning. Despite the extensive application of data mining to education, the imbalance problem in minority classes of students' attrition is often overlooked in conventional models. This document proposes a large data frame using the Hadoop ecosystem and the application of machine learning techniques to different datasets of an academic year at the Hellenic Open University. Datasets were divided into 35 weeks; 32 classifiers were created, compared and statistically analyzed to address the minority classes' imbalance of student's failure. The algorithms MetaCost-SMO and C4.5 provide the most accurate performance for each target class. Early predictions of timeframes determine a remarkable performance, while the importance of written assignments and specific quizzes is noticeable. The models' performance in any week is exploited by developing a prediction tool for student attrition, contributing to timely and personalized intervention.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据