4.1 Review

A survey of recent results on continuous-time Markov decision processes

期刊

TOP
卷 14, 期 2, 页码 177-243

出版社

SPRINGER
DOI: 10.1007/BF02837562

关键词

continuous-time Markov decision processes (also known as controlled Markov chains); unbounded reward and transition rates; discounted reward; average reward; bias optimality; sensitive discount criteria

向作者/读者索取更多资源

This paper is a survey of recent results on continuous-time Markov decision processes (MDPs) with unbounded transition rates, and reward rates that may be unbounded from above and from below. These results pertain to discounted and average reward optimality criteria, which are the most commonly used criteria, and also to more selective concepts, such as bias optimality and sensitive discount criteria. For concreteness, we consider only MDPs with a countable state space, but we indicate how the results can be extended to more general MDPs or to Markov games.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.1
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据