4.7 Article

Q-learning with heterogeneous update strategy

Related references

Note: Only part of the references are listed.
Article Computer Science, Artificial Intelligence

Sub-AVG: Overestimation reduction for cooperative multi-agent reinforcement learning

Haolin Wu et al.

Summary: Decomposing the centralized joint action value into per-agent individual action value is attractive in cooperative multi-agent reinforcement learning. However, the Q-learning-based method suffers from overestimation. This paper presents a solution called Sub-AVG, which eliminates excessive overestimation errors by using a lower update target.

NEUROCOMPUTING (2022)

Article Mathematics, Interdisciplinary Applications

Markov Chain Monte Carlo in Practice

Galin L. Jones et al.

Summary: This article reviews methods for assessing the reliability of Markov chain Monte Carlo (MCMC) simulation results, with a focus on those most useful in practical settings.

ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION (2022)

Article Energy & Fuels

Energy saving evaluation of an energy efficient data center using a model-free reinforcement learning approach

Muhammad Haiqal Bin Mahbod et al.

Summary: In this study, a hybrid data center model is built using a deep reinforcement learning algorithm to reduce energy consumption in tropical climates. The results show that using a floating setpoint and lowering the temperature value can achieve significant energy savings, with the reduction of server fan usage being the main contributor.

APPLIED ENERGY (2022)

Article Computer Science, Information Systems

A Swapping Target Q-Value Technique for Data Augmentation in Offline Reinforcement Learning

Ho-Taek Joo et al.

Summary: This study introduces a novel data augmentation technique called Swapping Target Q-Value (SQV) to enhance offline RL algorithms and improve pixel-based learning. By matching the Q-values of transformed images with the target Q-values of original images, and considering similar states as the same and different states as more distinct, the performance of the method is observed to significantly increase in the Atari 2600 game domain.

IEEE ACCESS (2022)

Article Engineering, Civil

Multiagent Reinforcement Learning-Based Taxi Predispatching Model to Balance Taxi Supply and Demand

Yongjian Yang et al.

JOURNAL OF ADVANCED TRANSPORTATION (2020)

Article Automation & Control Systems

Bias-Corrected Q-Learning With Multistate Extension

Donghun Lee et al.

IEEE TRANSACTIONS ON AUTOMATIC CONTROL (2019)

Article Computer Science, Information Systems

Cooperative Deep Q-Learning With Q-Value Transfer for Multi-Intersection Signal Control

Hongwei Ge et al.

IEEE ACCESS (2019)

Article Multidisciplinary Sciences

Human-level control through deep reinforcement learning

Volodymyr Mnih et al.

NATURE (2015)

Article Mathematics

SOME REVERSES OF THE JENSEN INEQUALITY WITH APPLICATIONS

S. S. Dragomir

BULLETIN OF THE AUSTRALIAN MATHEMATICAL SOCIETY (2013)