☆ 4.7 Article

Effective neural network training with adaptive learning rate based on training loss

NEURAL NETWORKS (2018)

Journal

NEURAL NETWORKS

Volume 101, Issue -, Pages 68-78

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.neunet.2018.01.016

Keywords

Multilayer perceptron; Deep learning; Neural network training; Stochastic gradient descent; Learning rate; Beam search

Funding

Global Station for Big Data and CyberSecurity, a project of Global Institution for Collaborative Research and Education at Hokkaido University

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

A method that uses an adaptive learning rate is presented for training neural networks. Unlike most conventional updating methods in which the learning rate gradually decreases during training, the proposed method increases or decreases the learning rate adaptively so that the training loss (the sum of cross-entropy losses for all training samples) decreases as much as possible. It thus provides a wider search range for solutions and thus a lower test error rate. The experiments with some well-known datasets to train a multilayer perceptron show that the proposed method is effective for obtaining a better test accuracy under certain conditions. (c) 2018 Elsevier Ltd. All rights reserved.

Effective neural network training with adaptive learning rate based on training loss

Journal

NEURAL NETWORKS

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Effective neural network training with adaptive learning rate based on training loss

Journal

NEURAL NETWORKS

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper