☆ 4.3 Article

DPReLU: Dynamic Parametric Rectified Linear Unit and Its Proper Weight Initialization Method

INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS (2023)

期刊

INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS

卷 16, 期 1, 页码 -

出版社

SPRINGERNATURE

DOI: 10.1007/s44196-023-00186-w

关键词

Activation function; Deep learning; Deep neural network; ReLU; Weight initialization

类别

Computer Science, Artificial Intelligence Computer Science, Interdisciplinary Applications

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Activation functions play a crucial role in deep learning, and the rectified linear unit (ReLU) has become the most widely used activation function due to its ability to address the vanishing gradient issue. However, ReLU suffers from the dying ReLU problem and bias shift effect, limiting its ability to utilize negative values effectively. To tackle this problem, numerous variants of ReLU have been proposed. In this study, Dynamic Parametric ReLU (DPReLU) is introduced, which allows for dynamic control of the overall shape of ReLU through four learnable parameters. The parameters of DPReLU are determined through training, making it more suitable and flexible for each model and dataset. Additionally, an appropriate and robust weight initialization method for DPReLU is proposed. Experimental results on various image datasets demonstrate that DPReLU and the weight initialization method lead to faster convergence and better accuracy compared to the original ReLU and previous variants.

Activation functions are essential in deep learning, and the rectified linear unit (ReLU) is the most widely used activation function to solve the vanishing gradient problem. However, owing to the dying ReLU problem and bias shift effect, deep learning models using ReLU cannot exploit the potential benefits of negative values. Numerous ReLU variants have been proposed to address this issue. In this study, we propose Dynamic Parametric ReLU (DPReLU), which can dynamically control the overall functional shape of ReLU with four learnable parameters. The parameters of DPReLU are determined by training rather than by humans, thereby making the formulation more suitable and flexible for each model and dataset. Furthermore, we propose an appropriate and robust weight initialization method for DPReLU. To evaluate DPReLU and its weight initialization method, we performed two experiments on various image datasets: one using an autoencoder for image generation and the other using the ResNet50 for image classification. The results show that DPReLU and our weight initialization method provide faster convergence and better accuracy than the original ReLU and the previous ReLU variants.

DPReLU: Dynamic Parametric Rectified Linear Unit and Its Proper Weight Initialization Method

期刊

INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS

出版社

SPRINGERNATURE

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

DPReLU: Dynamic Parametric Rectified Linear Unit and Its Proper Weight Initialization Method

期刊

INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS

出版社

SPRINGERNATURE

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文