4.7 Article

A differentiable path-following method to compute subgame perfect equilibria in stationary strategies in robust stochastic games and its applications

期刊

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH
卷 298, 期 3, 页码 1032-1050

出版社

ELSEVIER
DOI: 10.1016/j.ejor.2021.06.059

关键词

Game theory; Robust stochastic game; Subgame perfect equilibrium in stationary strategies; Logarithmic-barrier differentiable path-following method; Convex-quadratic-penalty differentiable

资金

  1. Government of Hong Kong SAR [GRF: CityU 11304620]

向作者/读者索取更多资源

This paper presents a globally convergent differentiable path-following method for computing subgame perfect equilibria in robust stochastic games. By incorporating a logarithmic-barrier term and an extra variable, the method solves a convex optimization problem in each state and establishes a polynomial equilibrium system. Numerical comparisons demonstrate the superiority of the logarithmic-barrier method over the convex-quadratic-penalty method.
As an effective paradigm to address uncertainty in payoffs and transition probabilities, robust stochastic games have been formulated in the literature. This paper is concerned with the computation of subgame perfect equilibria in stationary strategies (SSPEs) in robust stochastic games. To tackle this problem, we develop in this paper a globally convergent differentiable path-following method by exploiting the structures of the games. Incorporating a logarithmic-barrier term into each player's payoff function with an extra variable between zero and one, we constitute a logarithmic-barrier robust stochastic game in which each player solves in each state a convex optimization problem. An application of the optimality conditions to the barrier game together with a fixed-point argument yields a polynomial equilibrium system for the barrier game. As a result of this system, we establish the existence of a smooth path that starts from an arbitrary mixed strategy profile and ends at an SSPE as the extra variable descends from one to zero. As an alternative scheme, we make up a convex-quadratic-penalty robust stochastic game and attain a globally convergent convex-quadratic-penalty differentiable path-following method for SSPEs in robust stochastic games. Numerical comparisons show that the logarithmic-barrier path-following method significantly outperforms the convex-quadratic-penalty path-following method. To further evince the value of the proposed methods, we apply the logarithmic-barrier path-following method to solve a supply chain configuration problem and a market entry problem from medical waste recycling. (c) 2021 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据