4.7 Article

Single-objective and multi-objective optimization for variance counterbalancing in stochastic learning

Journal

APPLIED SOFT COMPUTING
Volume 142, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.asoc.2023.110331

Keywords

Variance counterbalancing; Stochastic learning; Neural networks; Single -objective optimization; Multi -objective optimization

Ask authors/readers for more resources

Artificial neural networks have become increasingly important in science and engineering, as they have proven to be useful in demanding applications. The challenge lies in training neural networks for large-scale problems using the stochastic gradient descent method, which requires diminishing step sizes. Variance counterbalancing was proposed as a remedy for this issue by minimizing the error variance along with the mean squared error. This approach promotes the use of advanced optimization algorithms instead of gradient descent.
Artificial neural networks have proved to be useful in a host of demanding applications, there-fore becoming increasingly important in science and engineering. Large-scale problems constitute a challenging task for training neural networks using the stochastic gradient descent method and variations, which are based on the random selection of mini-batches of training points at every iteration. The challenge lies on the mandatory use of diminishing search step sizes in order to retain mild error fluctuations throughout the training set preserving so the quality of the network's generalization capability. Variance counterbalancing was recently proposed as a remedy for addressing the diminishing step sizes in neural network training using stochastic gradient methods. It is based on the concurrent minimization of the average mean squared error of the network along with the error variance over random sets of mini-batches. Also, it promotes the use of advanced optimization algorithms instead of the slowly convergent gradient descent. The present work aims at enriching our understanding of the original variance counterbalancing approach, as well as reformulating it as a multi-objective problem by taking advantage of its bi-objective nature. Experimental analysis reveals the performances of the studied approaches and their competitive edge over the established Adam method.& COPY; 2023 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available