4.7 Article

Riemannian gradient methods for stochastic composition problems

期刊

NEURAL NETWORKS
卷 153, 期 -, 页码 224-234

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.neunet.2022.06.004

关键词

Riemannian manifold; Composition optimization; Deep neural networks; Stiefel manifold; Principal component analysis; Grassmann manifold

资金

  1. NSFC, China [61806093]

向作者/读者索取更多资源

The paper investigates novel stochastic composition optimization problems over Riemannian manifolds, proposing RCG and M-RCG algorithms with different sample complexities for solving these problems. Extensive numerical experiments demonstrate the effectiveness of these algorithms in training DNNs and learning PCA. This study represents the first exploration of composition optimization problems over Riemannian manifolds.
In the paper, we study a class of novel stochastic composition optimization problems over Riemannian manifold, which have been raised by multiple emerging machine learning applications such as distributionally robust learning in Riemannian manifold setting. To solve these composition problems, we propose an effective Riemannian compositional gradient (RCG) algorithm, which has a sample complexity of O(epsilon(-4)) for finding an epsilon-stationary point. To further reduce sample complexity, we propose an accelerated momentum-based Riemannian compositional gradient (M-RCG) algorithm. Moreover, we prove that the M-RCG obtains a lower sample complexity of O(epsilon(-3)) without large batches, which achieves the best known sample complexity for its Euclidean counterparts. Extensive numerical experiments on training deep neural networks (DNNs) over Stiefel manifold and learning principal component analysis (PCA) over Grassmann manifold demonstrate effectiveness of our proposed algorithms. To the best of our knowledge, this is the first study of the composition optimization problems over Riemannian manifold. (C) 2022 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据