☆ 4.6 Article

RISK-SENSITIVE AVERAGE OPTIMALITY FOR DISCRETE-TIME MARKOV DECISION PROCESSES

SIAM JOURNAL ON CONTROL AND OPTIMIZATION (2023)

期刊

SIAM JOURNAL ON CONTROL AND OPTIMIZATION

卷 61, 期 1, 页码 72-104

出版社

SIAM PUBLICATIONS

DOI: 10.1137/22M1476757

关键词

Markov decision processes; risk-sensitive average cost criterion; optimal policies; policy iteration algorithm

类别

Automation & Control Systems Mathematics, Applied

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

In this paper, we investigate the risk-sensitive average optimality in discrete-time Markov decision processes with denumerable states and unbounded costs. By utilizing an approximation method, we derive the multiplicative Poisson equation under suitable ergodicity conditions. Furthermore, we establish the existence of a unique solution to the risk-sensitive average cost optimality equation and provide an equivalent characterization of the set of all optimal stationary policies. Finally, we introduce the policy iteration algorithm and demonstrate its convergence.

In this paper we study the risk-sensitive average optimality for discrete-time Markov decision processes with denumerable states and unbounded costs. We derive the multiplicative Poisson equation under the suitable ergodicity conditions via an approximation method. Moreover, we prove the existence of a unique solution to the risk-sensitive average cost optimality equation and give an equivalent characterization of the set of all optimal stationary policies. Finally, we present the policy iteration algorithm and show its convergence.

RISK-SENSITIVE AVERAGE OPTIMALITY FOR DISCRETE-TIME MARKOV DECISION PROCESSES

期刊

SIAM JOURNAL ON CONTROL AND OPTIMIZATION

出版社

SIAM PUBLICATIONS

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

RISK-SENSITIVE AVERAGE OPTIMALITY FOR DISCRETE-TIME MARKOV DECISION PROCESSES

期刊

SIAM JOURNAL ON CONTROL AND OPTIMIZATION

出版社

SIAM PUBLICATIONS

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文