4.3 Article

Variance reduced value iteration and faster algorithms for solving Markov decision processes

Related references

Note: Only part of the references are listed.
Proceedings Paper Computer Science, Theory & Methods

Efficient Inverse Maintenance and Faster Algorithms for Linear Programming

Yin Tat Lee et al.

2015 IEEE 56TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (2015)

Article Operations Research & Management Science

The value iteration algorithm is not strongly polynomial for discounted dynamic programming

Eugene A. Feinberg et al.

OPERATIONS RESEARCH LETTERS (2014)

Article Computer Science, Hardware & Architecture

Strategy Iteration Is Strongly Polynomial for 2-Player Turn-Based Stochastic Games with a Constant Discount Factor

Thomas Dueholm Hansen et al.

JOURNAL OF THE ACM (2013)

Article Computer Science, Artificial Intelligence

Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model

Mohammad Gheshlaghi Azar et al.

MACHINE LEARNING (2013)

Article Operations Research & Management Science

The Simplex and Policy-Iteration Methods Are Strongly Polynomial for the Markov Decision Problem with a Fixed Discount Rate

Yinyu Ye

MATHEMATICS OF OPERATIONS RESEARCH (2011)

Article Operations Research & Management Science

A new complexity result on solving the Markov decision problem

YY Ye

MATHEMATICS OF OPERATIONS RESEARCH (2005)

Article Computer Science, Artificial Intelligence

A sparse sampling algorithm for near-optimal planning in large Markov decision processes

M Kearns et al.

MACHINE LEARNING (2002)