Related references
Note: Only part of the references are listed.A Survey of Reinforcement Learning Algorithms for Dynamically Varying Environments
Sindhu Padakandla
ACM COMPUTING SURVEYS (2021)
Mastering the game of Go with deep neural networks and tree search
David Silver et al.
NATURE (2016)
Coordinate descent algorithms
Stephen J. Wright
MATHEMATICAL PROGRAMMING (2015)
Human-level control through deep reinforcement learning
Volodymyr Mnih et al.
NATURE (2015)
More Risk-Sensitive Markov Decision Processes
Nicole Baeuerle et al.
MATHEMATICS OF OPERATIONS RESEARCH (2014)
Reinforcement learning in robotics: A survey
Jens Kober et al.
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH (2013)
Robust Markov Decision Processes
Wolfram Wiesemann et al.
MATHEMATICS OF OPERATIONS RESEARCH (2013)
ON THE CONVERGENCE OF BLOCK COORDINATE DESCENT TYPE METHODS
Amir Beck et al.
SIAM JOURNAL ON OPTIMIZATION (2013)
Markov Decision Processes with Average-Value-at-Risk criteria
Nicole Baeuerle et al.
MATHEMATICAL METHODS OF OPERATIONS RESEARCH (2011)
Robust control of Markov decision processes with uncertain transition matrices
A Nilim et al.
OPERATIONS RESEARCH (2005)
Risk-sensitive reinforcement learning
O Mihatsch et al.
MACHINE LEARNING (2002)
Q-learning for risk-sensitive control
VS Borkar
MATHEMATICS OF OPERATIONS RESEARCH (2002)
Risk-sensitive optimal control for Markov decision processes with monotone cost
VS Borkar et al.
MATHEMATICS OF OPERATIONS RESEARCH (2002)
Learning to trade via direct reinforcement
J Moody et al.
IEEE TRANSACTIONS ON NEURAL NETWORKS (2001)
Convergence of a block coordinate descent method for nondifferentiable minimization
P Tseng
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS (2001)