☆ 4.2 Article

Ambiguous partially observable Markov decision processes: Structural results and applications

JOURNAL OF ECONOMIC THEORY (2018)

期刊

JOURNAL OF ECONOMIC THEORY

卷 178, 期 -, 页码 1-35

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE

DOI: 10.1016/j.jet.2018.08.006

关键词

POMDP; Unknown probabilities; Model ambiguity; Structural results; Control-limit policies

类别

Economics

资金

National Science Foundation [CMMI-1562645]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Markov Decision Processes (MDPs) have been widely used as invaluable tools in dynamic decision making, which is a central concern for economic agents operating at both the micro and macro levels. Often the decision maker's information about the state is incomplete; hence, the generalization to Partially Observable MDPs (POMDPs). Unfortunately, POMDPs may require a large state and/or action space, creating the well-known curse of dimensionality. However, recent computational contributions and blindingly fast computers have helped to dispel this curse. This paper introduces and addresses a second curse termed curse of ambiguity, which refers to the fact that the exact transition probabilities are often hard to quantify, and are rather ambiguous. For instance, for a monetary authority concerned with dynamically setting the inflation rate so as to control the unemployment, the dynamics of unemployment rate under any given inflation rate is often ambiguous. Similarly, in worker-job matching, the dynamics of worker-job match/proficiency level is typically ambiguous. This paper addresses the curse of ambiguity by developing a generalization of POMDPs termed Ambiguous POMDPs (APOMDPs), which not only allows the decision maker to take into account imperfect state information, but also tackles the inevitable ambiguity with respect to the correct probabilistic model of transitions. Importantly, this paper extends various structural results from POMDPs to APOMDPs. These results enable the decision maker to make robust decisions. Robustness is achieved by using alpha-maximin expected utility (alpha-MEU), which (a) differentiates between ambiguity and ambiguity attitude, (b) avoids the over conservativeness of traditional maximin approaches, and (c) is found to be suitable in laboratory experiments in various choice behaviors including those in portfolio selection. The structural results provided also help to handle the curse of dimensionality, since they significantly simplify the search for an optimal policy. The analysis also identifies a performance guarantee for the proposed approach by developing a bound for its maximum reward loss due to model ambiguity. (C) 2018 Elsevier Inc. All rights reserved.

Ambiguous partially observable Markov decision processes: Structural results and applications

期刊

JOURNAL OF ECONOMIC THEORY

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Ambiguous partially observable Markov decision processes: Structural results and applications

期刊

JOURNAL OF ECONOMIC THEORY

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文