期刊
2021 18TH CONFERENCE ON ROBOTS AND VISION (CRV 2021)
卷 -, 期 -, 页码 151-157出版社
IEEE
DOI: 10.1109/CRV52889.2021.00028
关键词
engagement detection; spatio-temporal; Temporal Convolutional Network; Residual Neural Network
A novel hybrid neural network architecture, combining 2D ResNet and TCN, is proposed for detecting students' engagement levels in videos, outperforming other methods and setting a new baseline for future research.
Automatic detection of students' engagement in online learning settings is a key element to improve the quality of learning and to deliver personalized learning materials to them. Varying levels of engagement exhibited by students in an online classroom is an affective behavior that takes place over space and time. Therefore, we formulate detecting levels of students' engagement from videos as a spatio-temporal classification problem. In this paper, we present a novel end-to-end Residual Network (ResNet) and Temporal Convolutional Network (TCN) hybrid neural network architecture for students' engagement level detection in videos. The 2D ResNet extracts spatial features from consecutive video frames, and the TCN analyzes the temporal changes in video frames to detect the level of engagement. The spatial and temporal arms of the hybrid network are jointly trained on raw video frames of a large publicly available students' engagement detection dataset, DAiSEE. We compared our method with several competing students' engagement detection methods on this dataset. The ResNet+TCN architecture outperforms all other studied methods, improves the state-of-the-art engagement level detection accuracy, and sets a new baseline for future research.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据