☆ 3.8 Proceedings Paper

On the vanishing and exploding gradient problem in Gated Recurrent Units

IFAC PAPERSONLINE (2020)

期刊

IFAC PAPERSONLINE

卷 53, 期 2, 页码 1243-1248

出版社

ELSEVIER

DOI: 10.1016/j.ifacol.2020.12.1342

关键词

Nonlinear system identification; Recurrent Neural Networks; Gated Recurrent Units

类别

Automation & Control Systems

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Recurrent Neural Networks are applied in areas such as speech recognition, natural language and video processing, and the identification of nonlinear state space models. Conventional Recurrent Neural Networks, e.g. the Elman Network, are hard to train. A more recently developed class of recurrent neural networks, so-called Gated Units, outperform their counterparts on virtually every task. This paper aims to provide additional insights into the differences between RNNs and Gated Units in order to explain the superior perfomance of gated recurrent units. It is argued, that Gated Units are easier to optimize not because they solve the vanishing gradient problem, but because they circumvent the emergence of large local gradients. Copyright (C) 2020 The Authors.

On the vanishing and exploding gradient problem in Gated Recurrent Units

期刊

IFAC PAPERSONLINE

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

On the vanishing and exploding gradient problem in Gated Recurrent Units

期刊

IFAC PAPERSONLINE

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文