☆ 4.2 Article

Average optimality inequality for continuous-time Markov decision processes in Polish spaces

MATHEMATICAL METHODS OF OPERATIONS RESEARCH (2007)

期刊

MATHEMATICAL METHODS OF OPERATIONS RESEARCH

卷 66, 期 2, 页码 299-313

出版社

SPRINGER HEIDELBERG

DOI: 10.1007/s00186-007-0157-x

关键词

continuous-time Markov decision process; average optimality inequality; general state space; unbounded cost; optimal stationary policy

类别

Operations Research & Management Science Mathematics, Applied

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

In this paper, we study the average optimality for continuous-time controlled jump Markov processes in general state and action spaces. The criterion to be minimized is the average expected costs. Both the transition rates and the cost rates are allowed to be unbounded. We propose another set of conditions under which we first establish one average optimality inequality by using the well-known vanishing discounting factor approach. Then, when the cost (or reward) rates are nonnegative (or nonpositive), from the average optimality inequality we prove the existence of an average optimal stationary policy in all randomized history dependent policies by using the Dynkin formula and the Tauberian theorem. Finally, when the cost (or reward) rates have neither upper nor lower bounds, we also prove the existence of an average optimal policy in all (deterministic) stationary policies by constructing a new cost (or reward) rate.

Average optimality inequality for continuous-time Markov decision processes in Polish spaces

期刊

MATHEMATICAL METHODS OF OPERATIONS RESEARCH

出版社

SPRINGER HEIDELBERG

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Average optimality inequality for continuous-time Markov decision processes in Polish spaces

期刊

MATHEMATICAL METHODS OF OPERATIONS RESEARCH

出版社

SPRINGER HEIDELBERG

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文