☆ 4.7 Article

A differentiable path-following method to compute subgame perfect equilibria in stationary strategies in robust stochastic games and its applications

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2022)

期刊

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH

卷 298, 期 3, 页码 1032-1050

出版社

ELSEVIER

DOI: 10.1016/j.ejor.2021.06.059

关键词

Game theory; Robust stochastic game; Subgame perfect equilibrium in stationary strategies; Logarithmic-barrier differentiable path-following method; Convex-quadratic-penalty differentiable

类别

Management Operations Research & Management Science

资金

Government of Hong Kong SAR [GRF: CityU 11304620]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper presents a globally convergent differentiable path-following method for computing subgame perfect equilibria in robust stochastic games. By incorporating a logarithmic-barrier term and an extra variable, the method solves a convex optimization problem in each state and establishes a polynomial equilibrium system. Numerical comparisons demonstrate the superiority of the logarithmic-barrier method over the convex-quadratic-penalty method.

As an effective paradigm to address uncertainty in payoffs and transition probabilities, robust stochastic games have been formulated in the literature. This paper is concerned with the computation of subgame perfect equilibria in stationary strategies (SSPEs) in robust stochastic games. To tackle this problem, we develop in this paper a globally convergent differentiable path-following method by exploiting the structures of the games. Incorporating a logarithmic-barrier term into each player's payoff function with an extra variable between zero and one, we constitute a logarithmic-barrier robust stochastic game in which each player solves in each state a convex optimization problem. An application of the optimality conditions to the barrier game together with a fixed-point argument yields a polynomial equilibrium system for the barrier game. As a result of this system, we establish the existence of a smooth path that starts from an arbitrary mixed strategy profile and ends at an SSPE as the extra variable descends from one to zero. As an alternative scheme, we make up a convex-quadratic-penalty robust stochastic game and attain a globally convergent convex-quadratic-penalty differentiable path-following method for SSPEs in robust stochastic games. Numerical comparisons show that the logarithmic-barrier path-following method significantly outperforms the convex-quadratic-penalty path-following method. To further evince the value of the proposed methods, we apply the logarithmic-barrier path-following method to solve a supply chain configuration problem and a market entry problem from medical waste recycling. (c) 2021 Elsevier B.V. All rights reserved.

A differentiable path-following method to compute subgame perfect equilibria in stationary strategies in robust stochastic games and its applications

期刊

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A differentiable path-following method to compute subgame perfect equilibria in stationary strategies in robust stochastic games and its applications

期刊

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文