☆ 3.8 Proceedings Paper

A Distributed Primal-Dual Algorithm for Bandit Online Convex Optimization with Time-Varying Coupled Inequality Constraints

2020 AMERICAN CONTROL CONFERENCE (ACC) (2020)

期刊

2020 AMERICAN CONTROL CONFERENCE (ACC)

卷 -, 期 -, 页码 327-332

出版社

IEEE

DOI: 10.23919/acc45564.2020.9147354

关键词

类别

Automation & Control Systems Engineering, Electrical & Electronic

资金

Knut and Alice Wallenberg Foundation
Swedish Foundation for Strategic Research
Ministry of Education of Republic of Singapore under Grant MoE Tier 1 [RG72/19]
National Natural Science Foundation of China [61991403, 61991404, 61991400]
Swedish Research Council

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

This paper considers distributed bandit online optimization with time-varying coupled inequality constraints. The global cost and the coupled constraint functions are the summations of local convex cost and constraint functions, respectively. The local cost and constraint functions are held privately and only at the end of each period the constraint functions are fully revealed, while only the values of cost functions at queried points are revealed, i.e., in a so called bandit manner. A distributed bandit online primal-dual algorithm with two queries for the cost functions per period is proposed. The performance of the algorithm is evaluated using its expected regret, i.e., the expected difference between the outcome of the algorithm and the optimal choice in hindsight, as well as its constraint violation. We show that O(T-c) expected regret and O(T1-c/2) constraint violation are achieved by the proposed algorithm, where T is the total number of iterations and c is an element of [0.5, 1) is a user-defined trade-off parameter. Assuming Slater's condition, we show that O(root T) expected regret and O(root T) constraint violation are achieved. The theoretical results are illustrated by numerical simulations.

A Distributed Primal-Dual Algorithm for Bandit Online Convex Optimization with Time-Varying Coupled Inequality Constraints

期刊

2020 AMERICAN CONTROL CONFERENCE (ACC)

出版社

IEEE

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A Distributed Primal-Dual Algorithm for Bandit Online Convex Optimization with Time-Varying Coupled Inequality Constraints

期刊

2020 AMERICAN CONTROL CONFERENCE (ACC)

出版社

IEEE

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文