4.7 Article

Risk-averse policy optimization via risk-neutral policy optimization

Related references

Note: Only part of the references are listed.
Article Computer Science, Theory & Methods

A Survey of Reinforcement Learning Algorithms for Dynamically Varying Environments

Sindhu Padakandla

Summary: This article surveys RL methods developed for handling dynamically varying environment models, aiming to help autonomous agents adapt to changing operating conditions. These methods aim to minimize reward loss during learning by the RL agent or finding suitable policies for efficient operation of the underlying system.

ACM COMPUTING SURVEYS (2021)

Article Multidisciplinary Sciences

Mastering the game of Go with deep neural networks and tree search

David Silver et al.

NATURE (2016)

Article Computer Science, Software Engineering

Coordinate descent algorithms

Stephen J. Wright

MATHEMATICAL PROGRAMMING (2015)

Article Multidisciplinary Sciences

Human-level control through deep reinforcement learning

Volodymyr Mnih et al.

NATURE (2015)

Article Operations Research & Management Science

More Risk-Sensitive Markov Decision Processes

Nicole Baeuerle et al.

MATHEMATICS OF OPERATIONS RESEARCH (2014)

Review Robotics

Reinforcement learning in robotics: A survey

Jens Kober et al.

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH (2013)

Article Operations Research & Management Science

Robust Markov Decision Processes

Wolfram Wiesemann et al.

MATHEMATICS OF OPERATIONS RESEARCH (2013)

Article Mathematics, Applied

ON THE CONVERGENCE OF BLOCK COORDINATE DESCENT TYPE METHODS

Amir Beck et al.

SIAM JOURNAL ON OPTIMIZATION (2013)

Article Operations Research & Management Science

Markov Decision Processes with Average-Value-at-Risk criteria

Nicole Baeuerle et al.

MATHEMATICAL METHODS OF OPERATIONS RESEARCH (2011)

Article Computer Science, Artificial Intelligence

Risk-sensitive reinforcement learning

O Mihatsch et al.

MACHINE LEARNING (2002)

Article Operations Research & Management Science

Q-learning for risk-sensitive control

VS Borkar

MATHEMATICS OF OPERATIONS RESEARCH (2002)

Article Operations Research & Management Science

Risk-sensitive optimal control for Markov decision processes with monotone cost

VS Borkar et al.

MATHEMATICS OF OPERATIONS RESEARCH (2002)

Article Computer Science, Artificial Intelligence

Learning to trade via direct reinforcement

J Moody et al.

IEEE TRANSACTIONS ON NEURAL NETWORKS (2001)

Article Operations Research & Management Science

Convergence of a block coordinate descent method for nondifferentiable minimization

P Tseng

JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS (2001)