☆ 4.1 Review

A survey of recent results on continuous-time Markov decision processes

TOP (2006)

期刊

TOP

卷 14, 期 2, 页码 177-243

出版社

SPRINGER

DOI: 10.1007/BF02837562

关键词

continuous-time Markov decision processes (also known as controlled Markov chains); unbounded reward and transition rates; discounted reward; average reward; bias optimality; sensitive discount criteria

类别

Operations Research & Management Science

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

This paper is a survey of recent results on continuous-time Markov decision processes (MDPs) with unbounded transition rates, and reward rates that may be unbounded from above and from below. These results pertain to discounted and average reward optimality criteria, which are the most commonly used criteria, and also to more selective concepts, such as bias optimality and sensitive discount criteria. For concreteness, we consider only MDPs with a countable state space, but we indicate how the results can be extended to more general MDPs or to Markov games.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

主要评分

4.1

评分不足

A survey of recent results on continuous-time Markov decision processes

期刊

TOP

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A survey of recent results on continuous-time Markov decision processes

期刊

TOP

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文