4.6 Article

A fractional-order momentum optimization approach of deep neural networks

Journal

NEURAL COMPUTING & APPLICATIONS
Volume 34, Issue 9, Pages 7091-7111

Publisher

SPRINGER LONDON LTD
DOI: 10.1007/s00521-021-06765-2

Keywords

Deep neural networks; Optimization; Gradient descent; Fractional-order; Residual network; Image classification

Ask authors/readers for more resources

The development of universal and high-efficiency optimization algorithms is crucial for neural networks. Fractional-Order Momentum (FracM), based on fractional-order calculus, is proposed as a method to address the issue of traps in SGDM algorithm and improve training process speed. FracM outperforms other optimizers in terms of classification accuracy on CIFAR10/100 and IMDB datasets.
The development of universal and high-efficiency optimization algorithms is a very important research direction of neural networks. Stochastic Gradient Decent Momentum(SGDM) is one of the most successful optimization algorithms, and easily fall into local extremes minimum. Inspired by the prominent success of Fractional-order Calculus in automatic control, we proposed a method based on Fractional-Order named Fractional-Order Momentum(FracM). As a natural extension of integral calculus, fractional order calculus inherits almost all the characteristics of integral calculus, and have some memorization and nonlocality. FracM performs fractional-order difference of momentum and gradient in SGDM algorithm. FracM can partially solve the problem of traps in the local minimum point and accelerated the train process. The proposed FracM optimization method can compare with the most advanced SGDM and Adam and other advanced optimization algorithm in terms of classification accuracy. The experiments show that FracM outperforms other optimizers on CIFAR10/100 and textual datasets IMDB with transformer-based models.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available