4.6 Article

Parameter Conjugate Gradient with Secant Equation Based Elman Neural Network and its Convergence Analysis

期刊

ADVANCED THEORY AND SIMULATIONS
卷 5, 期 9, 页码 -

出版社

WILEY-V C H VERLAG GMBH
DOI: 10.1002/adts.202200047

关键词

conjugate gradient; Elman; secant equation; Wolfe condition

资金

  1. Natural Science Basic Research Plan in Shaanxi Province of China
  2. National Science Foundation of China [11771347]
  3. 65th China Postdoctoral Science Foundation [2019M652837]
  4. Natural Science Foundation Guidance Project of Liaoning Province

向作者/读者索取更多资源

This paper presents a novel parametric conjugate gradient method based on the secant equation for training Elman neural network. The theoretical convergence of the algorithm is rigorously proved, and the feasibility and correctness of the method are demonstrated through numerical experiments.
Elman neural network (ENN) is one of the local recursive networks with a feedback mechanism. The parameter conjugate gradient method is a promising alternative to the gradient descent method, due to its faster convergence speed that results from searching for the conjugate descent direction with an adaptive step size (obtained by Wolfe conditions). However, there are still some challenges such as how to avoid the sawtooth phenomenon in gradient algorithms to improve the learning accuracy of the second-order curvature of an objective function. As such, this paper presents a novel parametric conjugate gradient method that is based on the secant equation for training ENN in an effective way. Strict proof of the theoretical convergence of the proposed algorithm is provided in detail. In particular, the weak convergence and strong convergence of the algorithm, as well as the monotonicity of the error function are proved. Except for the theoretical analysis, the three numerical experiments have been conducted by applying the algorithm to three problems of classification, regression, and function approximation on nine real-world datasets. The experimental results have demonstrated the feasibility of the proposed algorithm and the correctness of this theoretical analysis.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据