4.6 Article

APRS: Automatic pruning ratio search using Siamese network with layer-level rewards ?

期刊

DIGITAL SIGNAL PROCESSING
卷 133, 期 -, 页码 -

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.dsp.2022.103864

关键词

Structured pruning; Deep reinforcement learning; Pruning ratio search; Siamese network

向作者/读者索取更多资源

In this paper, an Automatic Pruning Ratio Search (APRS) algorithm is proposed to find the layer-wise optimal pruning ratio within the deep reinforcement learning framework. A novel layer-level reward function is designed to address the coarse-granularity reward problem, and a computationally efficient method is used to evaluate the effect of pruning action on each single layer. Experimental results show that the proposed method can better reveal the underlying sparse sensitivities of different layers and achieve higher network accuracy after pruning compared to traditional methods.
Structured pruning is still a mainstream model compression technique, for its merit of easy to implement and no reliance on specific hardware supporting library. In most previous works, the layer-wise channel pruning ratios were determined empirically. In this paper, we propose an Automatic Pruning Ratio Search (APRS) algorithm that can find the layer-wise optimal pruning ratio within the deep reinforcement learning framework. To solve the coarse-granularity reward problem existing in some previous works like AMC and CACP, a novel layer-level reward function is designed based on the Siamese network architecture for the fine-granularity agent-environment interaction purpose. We use a computationally efficient way to evaluate the effect of pruning action on each single layer. The incurred backwardness disadvantage problem has also been analyzed and addressed. The experiments are performed using the VGG-16, and MobileNet-v1 on the CIFAR10/100 and UC Merced Land-use datasets. The results verified that our method can better reveal the underlying sparse sensitivities of different layers in both high redundancy networks and compact networks, so that resulting a higher network accuracy after pruning compared to the traditional methods.(c) 2022 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据