☆ 4.3 Article

Variance reduced value iteration and faster algorithms for solving Markov decision processes

NAVAL RESEARCH LOGISTICS (2023)

Related references

Note: Only part of the references are listed.

Proceedings Paper Computer Science, Theory & Methods

Efficient Inverse Maintenance and Faster Algorithms for Linear Programming

Yin Tat Lee et al.

2015 IEEE 56TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (2015)

Add to Collection

Article Operations Research & Management Science

The value iteration algorithm is not strongly polynomial for discounted dynamic programming

Eugene A. Feinberg et al.

OPERATIONS RESEARCH LETTERS (2014)

Add to Collection

Article Computer Science, Hardware & Architecture

Strategy Iteration Is Strongly Polynomial for 2-Player Turn-Based Stochastic Games with a Constant Discount Factor

Thomas Dueholm Hansen et al.

JOURNAL OF THE ACM (2013)

Add to Collection

Article Computer Science, Artificial Intelligence

Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model

Mohammad Gheshlaghi Azar et al.

MACHINE LEARNING (2013)

Add to Collection

Article Operations Research & Management Science

The Simplex and Policy-Iteration Methods Are Strongly Polynomial for the Markov Decision Problem with a Fixed Discount Rate

Yinyu Ye

MATHEMATICS OF OPERATIONS RESEARCH (2011)

Add to Collection

Article Operations Research & Management Science

A new complexity result on solving the Markov decision problem

YY Ye

MATHEMATICS OF OPERATIONS RESEARCH (2005)

Add to Collection

Article Computer Science, Artificial Intelligence

A sparse sampling algorithm for near-optimal planning in large Markov decision processes

M Kearns et al.

MACHINE LEARNING (2002)

Add to Collection

Variance reduced value iteration and faster algorithms for solving Markov decision processes

Related references

Efficient Inverse Maintenance and Faster Algorithms for Linear Programming

The value iteration algorithm is not strongly polynomial for discounted dynamic programming

Strategy Iteration Is Strongly Polynomial for 2-Player Turn-Based Stochastic Games with a Constant Discount Factor

Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model

The Simplex and Policy-Iteration Methods Are Strongly Polynomial for the Markov Decision Problem with a Fixed Discount Rate

A new complexity result on solving the Markov decision problem

A sparse sampling algorithm for near-optimal planning in large Markov decision processes

Export Citation

Share Paper