☆ 3.8 Proceedings Paper

A Discriminant Information Approach to Deep Neural Network Pruning

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) (2021)

Journal

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR)

Volume -, Issue -, Pages 9553-9560

Publisher

IEEE COMPUTER SOC

DOI: 10.1109/ICPR48806.2021.9412693

Keywords

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

Network pruning is essential for accelerating deep neural networks for mobile and edge applications. The proposed channel pruning method based on feature-map discriminant introduces a Discriminant Information (DI) criterion to accurately quantify channel importance. By utilizing a greedy pruning algorithm and structure distillation technique, the method can automatically select pruned structure meeting resource constraints. Extensive experiments demonstrate significant reduction in FLOPs with no loss in accuracy on ImageNet dataset.

Network pruning has become the de facto tool to accelerate deep neural networks for mobile and edge applications. Recently, feature-map discriminant based channel pruning has shown promising results, as it aligns well with the CNN's objective of differentiating multiple classes and offers better interpretability of the pruning decision. However, existing discriminant-based methods are challenged by computation inefficiency, as there is a lack of theoretical guidance on quantifying the feature-map discriminant power. In this paper, we develop a mathematical formulation to accurately and efficiently quantify the feature-map discriminativeness, which gives rise to a novel criterion, Discriminant Information (DI). We analyze the theoretical property of DI, specifically the non-decreasing property, that makes DI a valid channel selection criterion. By measuring the differential discriminant, we can identify and remove those channels with minimum influence to the discriminant power. The versatility of DI criterion also enables an intra-layer mixed precision quantization to further compress the network. Moreover, we propose a DI-based greedy pruning algorithm and structure distillation technique to automatically decide the pruned structure that satisfies certain resource budget, which is a common requirement in reality. Extensive experiments demonstrate the effectiveness of our method: our pruned ResNet50 on ImageNet achieves 44% FLOPs reduction without any Top-1 accuracy loss compared to unpruned model.

A Discriminant Information Approach to Deep Neural Network Pruning

Journal

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR)

Publisher

IEEE COMPUTER SOC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

A Discriminant Information Approach to Deep Neural Network Pruning

Journal

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR)

Publisher

IEEE COMPUTER SOC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper