☆ 4.7 Article

Bias-Corrected Q-Learning With Multistate Extension

IEEE TRANSACTIONS ON AUTOMATIC CONTROL (2019)

Journal

IEEE TRANSACTIONS ON AUTOMATIC CONTROL

Volume 64, Issue 10, Pages 4011-4023

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TAC.2019.2912443

Keywords

Bias correction; electricity storage; Q-learning; smart grid

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Q-learning is a sample-based model-free algorithm that solves Markov decision problems asymptotically, but in finite time, it can perform poorly when random rewards and transitions result in large variance of value estimates. We pinpoint its cause to be the estimation bias due to the maximum operator in Q-learning algorithm, and present the evidence of max-operator bias in its Q value estimates. We then present an asymptotically optimal bias-correction strategy and construct an extension to bias-corrected Q-learning algorithm to multistate Markov decision processes, with asymptotic convergence properties as strong as those from Q-learning. We report the empirical performance of the bias-corrected Q-learning algorithm with multistate extension in two model problems: A multiarmed bandit version of Roulette and an electricity storage control simulation. The bias-corrected Q-learning algorithm with multistate extension is shown to control max-operator bias effectively, where the bias-resistance can be tuned predictably by adjusting a correction parameter.

Bias-Corrected Q-Learning With Multistate Extension

Journal

IEEE TRANSACTIONS ON AUTOMATIC CONTROL

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Bias-Corrected Q-Learning With Multistate Extension

Journal

IEEE TRANSACTIONS ON AUTOMATIC CONTROL

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper