☆ 4.6 Article

LLR: Learning learning rates by LSTM for training neural networks

NEUROCOMPUTING (2020)

期刊

NEUROCOMPUTING

卷 394, 期 -, 页码 41-50

出版社

ELSEVIER

DOI: 10.1016/j.neucom.2020.01.106

关键词

Neural networks; Learning rates; LSTM; Optimization; Gradients

类别

Computer Science, Artificial Intelligence

资金

National Natural Science Foundation of China [61772124]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

In the training process of the deep neural networks, the learning rate plays an important role in whether the training process can converge and how fast it can achieve converge. In order to ensure convergence, most of the existing optimization methods adopt a multi-stage descending small learning rate which is hand-designed. However, this method converges slowly especially in the early stage of training. Based on this, a learning rate adjustment strategy that can automatically adjust and has a faster speed of loss decline will be helpful to the training of the deep model. In this paper, a dynamic adjustment strategy of learning rate is developed based on the Long Short Term Memory(LSTM) model and the gradients of loss function. This method effectively utilizes the advantages of the LSTM model considering the multistep learning rate as a whole, and generates the learning rate of the current step based on the memory information of the previous learning rate. Three datasets and four architectures are used in the experiments. We applied the learning rate adjustment method to various optimization methods and achieved good results that our method can achieve even smaller loss under the same number of iterations. (C) 2020 Elsevier B.V. All rights reserved.

LLR: Learning learning rates by LSTM for training neural networks

期刊

NEUROCOMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

LLR: Learning learning rates by LSTM for training neural networks

期刊

NEUROCOMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文