☆ 4.6 Article

Some advances on constrained Markov decision processes in Borel spaces with random state-dependent discount factors

OPTIMIZATION (2022)

期刊

OPTIMIZATION

卷 -, 期 -, 页码 -

出版社

TAYLOR & FRANCIS LTD

DOI: 10.1080/02331934.2022.2130699

关键词

Markov decision processes; random non-constant discount factor; constrained control problems; convex programming; Pareto optimality

类别

Operations Research & Management Science Mathematics, Applied

资金

Consejo Nacional de Ciencia y Tecnologia (CONACYT)-Mexico [Ciencia Frontera 2019-87787, PRODEP-2021, CA-38]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper addresses a class of discrete-time Markov decision processes with cost constraints, and proves the existence of optimal control policies and characterizes them based on certain optimality criteria by solving a new problem on a space of occupation measures and a convex program.

This paper addresses a class of discrete-time Markov decision processes in Borel spaces with a finite number of cost constraints. The constrained control model considers costs of discounted type with state-dependent discount factors which are subject to external disturbances. Our objective is to prove the existence of optimal control policies and characterize them according to certain optimality criteria. Specifically, by rewriting appropriately our original constrained problem as a new one on a space of occupation measures, we apply the direct method to show solvability. Next, the problem is defined as a convex program, and we prove that the existence of a saddle point of the associated Lagrangian operator is equivalent to the existence of an optimal control policy for the constrained problem. Finally, we turn our attention to multi-objective optimization problems, where the existence of Pareto optimal policies can be obtained from the existence of saddle-points of the aforementioned Lagrangian or equivalently from the existence of optimal control policies of constrained problems.

Some advances on constrained Markov decision processes in Borel spaces with random state-dependent discount factors

期刊

OPTIMIZATION

出版社

TAYLOR & FRANCIS LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Some advances on constrained Markov decision processes in Borel spaces with random state-dependent discount factors

期刊

OPTIMIZATION

出版社

TAYLOR & FRANCIS LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文