4.6 Article

Compression of deep neural networks: bridging the gap between conventional-based pruning and evolutionary approach

Journal

NEURAL COMPUTING & APPLICATIONS
Volume 34, Issue 19, Pages 16493-16514

Publisher

SPRINGER LONDON LTD
DOI: 10.1007/s00521-022-07161-0

Keywords

Deep neural networks; Evolutionary algorithm; Filter pruning; Multiobjective optimization

Funding

  1. Nuclear Power Institute of China
  2. Sichuan University
  3. National Natural Science Foundation of China [62076172]
  4. Key Research and Development Project of Sichuan [2019YFG0494, 2021YFG0027]
  5. National Key Research and Development Project of China [2017YFB0202403]
  6. State Key Program of National Science Foundation of China [61836006]
  7. National Natural Science Fund for Distinguished Young Scholar [61625204]

Ask authors/readers for more resources

In this paper, a novel structured pruning method called CEA-MOP is proposed to address the problem of model compression in convolutional neural networks. By utilizing conventional pruning methods for the evolutionary process, CEA-MOP achieves a delicate balance between pruning rate and model accuracy through a multiobjective optimization evolutionary model.
Recently, many studies have been carried out on model compression to handle the high computational cost and high memory footprint brought by the implementation of deep neural networks. In this paper, model compression of convolutional neural networks is constructed as a multiobjective optimization problem with two conflicting objectives, reducing the model size and improving the performance. A novel structured pruning method called Conventional-based and Evolutionary Approaches Guided Multiobjective Pruning (CEA-MOP) is proposed to address this problem, where the power of conventional pruning methods is effectively exploited for the evolutionary process. A delicate balance in pruning rate and model accuracy has been automated achieved by a multiobjective optimization evolutionary model. First, an ensemble framework integrates pruning metrics to establish a codebook for further evolutionary operations. Then, an efficient coding method is developed to shorten the length of chromosome, thus ensuring its superior scalability. Finally, sensitivity analysis is automatically carried out to determine the upper bound of pruning rate for each layer. Notably, on CIFAR-10, CEA-MOP reduces more than 50% FLOPs on ResNet-110 and improves the relative accuracy. Moreover, on ImageNet, CEA-MOP reduces more than 50% FLOPs on ResNet-101 with negligible top-1 accuracy drop.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available