4.6 Article

Distributed online convex optimization with a bandit primal-dual mirror descent push-sum algorithm

期刊

NEUROCOMPUTING
卷 497, 期 -, 页码 204-215

出版社

ELSEVIER
DOI: 10.1016/j.neucom.2022.05.024

关键词

Distributed online convex optimization; Time-varying directed network topology; Time-varying constraint; Bandit feedback

资金

  1. NSFC [62073166, 61673215, 62022042, 61922044]
  2. 333 Project
  3. Priority Academic Program Development of Jiangsu
  4. Key Laboratory of Jiangsu Province
  5. Open Project of the Key Laboratory of Advanced Perception and Intelligent Control of High-end Equipment [GDSC202017]
  6. Shandong Provincial Natural Science Foundation [ZR2021ZD13]

向作者/读者索取更多资源

This article addresses the distributed online convex optimization problem with time-varying constraints for multi-agent networks. It proposes the BDPDMDPS algorithm to optimize cost functions and constraint functions, and measures its operational performance in terms of expected regret and expected constraint violation. The algorithm is proven to have sublinear properties with respect to the total iteration span.
The distributed online convex optimization problem with time-varying constraints for multi-agent networks is addressed in this article. The purpose is to optimize a sequence of time-varying global cost functions defined as the accumulated values of local cost functions, also attempt to meet the requirement of a sequence of time-varying coupled constraint functions which denote the sum of local constraint functions. Cost functions and constraint functions are unknown to agents beforehand. It is supposed that each agent in the network communicates with its neighbours through a uniformly strongly connected sequence of time-varying directed communication topologies. This paper proposes the bandit distributed primal-dual mirror descent push-sum (BDPDMDPS) algorithm constructed by bandit primal-dual, mirror descent and push-sum methods. Operational performance of the presented algorithm is measured by expected regret and expected constraint violation, both of which are proved to be sublinear with respect to the total iteration span T in this paper. Finally, a numerical simulation example is shown, which confirms the results for expected regret and expected constraint violation of BDPDMDPS algorithm. (c) 2022 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据