4.2 Article

Deep Reinforcement Learning for Adaptive Learning Systems

Journal

JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS
Volume 48, Issue 2, Pages 220-243

Publisher

SAGE PUBLICATIONS INC
DOI: 10.3102/10769986221129847

Keywords

adaptive learning system; transition model estimator; Markov decision process; deep reinforcement learning; deep Q-learning; neural networks; model free

Ask authors/readers for more resources

This article discusses the adaptive learning problem of continuous latent traits and proposes a deep Q-learning algorithm along with a transition model estimator to efficiently find the optimal learning policy for learners.
The adaptive learning problem concerns how to create an individualized learning plan (also referred to as a learning policy) that chooses the most appropriate learning materials based on a learner's latent traits. In this article, we study an important yet less-addressed adaptive learning problem-one that assumes continuous latent traits. Specifically, we formulate the adaptive learning problem as a Markov decision process. We assume latent traits to be continuous with an unknown transition model and apply a model-free deep reinforcement learning algorithm-the deep Q-learning algorithm-that can effectively find the optimal learning policy from data on learners' learning process without knowing the actual transition model of the learners' continuous latent traits. To efficiently utilize available data, we also develop a transition model estimator that emulates the learner's learning process using neural networks. The transition model estimator can be used in the deep Q-learning algorithm so that it can more efficiently discover the optimal learning policy for a learner. Numerical simulation studies verify that the proposed algorithm is very efficient in finding a good learning policy. Especially with the aid of a transition model estimator, it can find the optimal learning policy after training using a small number of learners.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available