4.6 Article

Higher order derivatives of quantum neural networks with barren plateaus

期刊

QUANTUM SCIENCE AND TECHNOLOGY
卷 6, 期 3, 页码 -

出版社

IOP PUBLISHING LTD
DOI: 10.1088/2058-9565/abf51a

关键词

barren; plateau; Hessian; variational; algorithm

资金

  1. Los Alamos National Laboratory [20180628ECR]
  2. Center for Nonlinear Studies at LANL
  3. LANL ASC Beyond Moore's Law project
  4. U.S. Department of Energy (DOE), Office of Science, Office of Advanced Scientific Computing Research, under the Accelerated Research in Quantum Computing (ARQC) program

向作者/读者索取更多资源

The phenomenon of barren plateaus in quantum neural networks causes exponential suppression of elements in the Hessian matrix, making estimation at this situation require exponential precision with system size n. This indicates that Hessian-based approaches do not overcome the exponential scaling associated with barren plateaus. Additionally, higher order derivatives are also exponentially suppressed, impacting optimization strategies beyond first-order gradient descent.
Quantum neural networks (QNNs) offer a powerful paradigm for programming near-term quantum computers and have the potential to speed up applications ranging from data science to chemistry to materials science. However, a possible obstacle to realizing that speed-up is the barren plateau (BP) phenomenon, whereby the gradient vanishes exponentially in the system size n for certain QNN architectures. The question of whether high-order derivative information such as the Hessian could help escape a BP was recently posed in the literature. Here we show that the elements of the Hessian are exponentially suppressed in a BP, so estimating the Hessian in this situation would require a precision that scales exponentially with n. Hence, Hessian-based approaches do not circumvent the exponential scaling associated with BPs. We also show the exponential suppression of higher order derivatives. Hence, BPs will impact optimization strategies that go beyond (first-order) gradient descent. In deriving our results, we prove novel, general formulas that can be used to analytically evaluate any high-order partial derivative on quantum hardware. These formulas will likely have independent interest and use for training QNNs (outside of the context of BPs).

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据