☆ 4.6 Article Proceedings Paper

Theoretical analysis of batch and on-line training for gradient descent learning in neural networks

NEUROCOMPUTING (2009)

Journal

NEUROCOMPUTING

Volume 73, Issue 1-3, Pages 151-159

Publisher

ELSEVIER

DOI: 10.1016/j.neucom.2009.05.017

Keywords

Neural networks; Gradient descent learning; Batch training; On-line training; Quadratic loss functions; Convergence

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

In this study, we theoretically analyze two essential training schemes for gradient descent learning in neural networks: batch and on-line training. The convergence properties of the two schemes applied to quadratic loss functions are analytically investigated. We quantify the convergence of each training scheme to the optimal weight using the absolute value of the expected difference (Measure I) and the expected squared difference (Measure 2) between the optimal weight and the weight computed by the scheme. Although on-line training has several advantages over batch training with respect to the first measure, it does not converge to the optimal weight with respect to the second measure if the variance of the per-instance gradient remains constant. However, if the variance decays exponentially, then on-line training converges to the optimal weight with respect to Measure 2. Our analysis reveals the exact degrees to which the training set size, the variance of the per-instance gradient, and the learning rate affect the rate of convergence for each scheme. (C) 2009 Elsevier B.V. All rights reserved.

Theoretical analysis of batch and on-line training for gradient descent learning in neural networks

Journal

NEUROCOMPUTING

Publisher

ELSEVIER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Theoretical analysis of batch and on-line training for gradient descent learning in neural networks

Journal

NEUROCOMPUTING

Publisher

ELSEVIER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper