4.7 Article

A Unified Approach to Dynamic Decision Problems With Asymmetric Information: Nonstrategic Agents

Journal

IEEE TRANSACTIONS ON AUTOMATIC CONTROL
Volume 67, Issue 3, Pages 1105-1119

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TAC.2021.3060835

Keywords

Asymmetric information; decision making; dynamic programming; multiagent systems; stochastic systems

Funding

  1. NSF [CNS-1238962, CCF-1111061]
  2. ARO-MURI [W911NF-13-1-0421]
  3. ARO [W911NF-17-1-0232]

Ask authors/readers for more resources

We study a class of dynamic multi-agent decision problems with asymmetric information and nonstrategic agents, including signaling phenomenon. By introducing the notion of sufficient information, we propose an information state for each agent that is sufficient for decision-making purposes. We generalize the policy-independence property of belief in partially observed Markov decision processes to dynamic multi-agent decision problems.
We study a general class of dynamic multi- agent decision problems with asymmetric information and nonstrategic agents, which include dynamic teams as a special case. When agents are nonstrategic, an agent's strategy is known to the other agents. Nevertheless, the agents' strategy choices and beliefs are interdependent over times, a phenomenon known as signaling. We introduce the notion of sufficient information that effectively compresses the agents' information in a mutually consistent manner. Based on the notion of sufficient information, we propose an information state for each agent that is sufficient for decision-making purposes. We present instances of dynamic multiagent decision problems where we can determine an information state with a time-invariant domain for each agent. Furthermore, we present a generalization of the policy-independence property of belief in partially observed Markov decision processes (POMDP) to dynamic multiagent decision problems. Within the context of dynamic teams with asymmetric information, the proposed set of information states leads to a sequential decomposition that decouples the interdependence between the agents' strategies and beliefs over time and enables us to formulate a dynamic program to determine a globally optimal policy via backward induction.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available