☆ 4.6 Article

Distributed online convex optimization with a bandit primal-dual mirror descent push-sum algorithm

NEUROCOMPUTING (2022)

期刊

NEUROCOMPUTING

卷 497, 期 -, 页码 204-215

出版社

ELSEVIER

DOI: 10.1016/j.neucom.2022.05.024

关键词

Distributed online convex optimization; Time-varying directed network topology; Time-varying constraint; Bandit feedback

类别

Computer Science, Artificial Intelligence

资金

NSFC [62073166, 61673215, 62022042, 61922044]
333 Project
Priority Academic Program Development of Jiangsu
Key Laboratory of Jiangsu Province
Open Project of the Key Laboratory of Advanced Perception and Intelligent Control of High-end Equipment [GDSC202017]
Shandong Provincial Natural Science Foundation [ZR2021ZD13]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This article addresses the distributed online convex optimization problem with time-varying constraints for multi-agent networks. It proposes the BDPDMDPS algorithm to optimize cost functions and constraint functions, and measures its operational performance in terms of expected regret and expected constraint violation. The algorithm is proven to have sublinear properties with respect to the total iteration span.

The distributed online convex optimization problem with time-varying constraints for multi-agent networks is addressed in this article. The purpose is to optimize a sequence of time-varying global cost functions defined as the accumulated values of local cost functions, also attempt to meet the requirement of a sequence of time-varying coupled constraint functions which denote the sum of local constraint functions. Cost functions and constraint functions are unknown to agents beforehand. It is supposed that each agent in the network communicates with its neighbours through a uniformly strongly connected sequence of time-varying directed communication topologies. This paper proposes the bandit distributed primal-dual mirror descent push-sum (BDPDMDPS) algorithm constructed by bandit primal-dual, mirror descent and push-sum methods. Operational performance of the presented algorithm is measured by expected regret and expected constraint violation, both of which are proved to be sublinear with respect to the total iteration span T in this paper. Finally, a numerical simulation example is shown, which confirms the results for expected regret and expected constraint violation of BDPDMDPS algorithm. (c) 2022 Elsevier B.V. All rights reserved.

Distributed online convex optimization with a bandit primal-dual mirror descent push-sum algorithm

期刊

NEUROCOMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Distributed online convex optimization with a bandit primal-dual mirror descent push-sum algorithm

期刊

NEUROCOMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文