☆ 4.3 Article

Deep Model Compression and Architecture Optimization for Embedded Systems: A Survey

JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY (2021)

期刊

JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY

卷 93, 期 8, 页码 863-878

出版社

SPRINGER

DOI: 10.1007/s11265-020-01596-1

关键词

Deep learning; Compression; Neural networks; Architecture

类别

Computer Science, Information Systems Engineering, Electrical & Electronic

资金

Auvergne Regional Council
European funds of regional development (FEDER)

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper surveys methods suitable for porting deep neural networks on resource-limited devices, especially for smart cameras, which can be roughly divided into compression techniques and architecture optimization. Compression techniques include knowledge distillation, pruning, quantization, hashing, reduction of numerical precision and binarization, while architecture optimization focuses on enhancing network structures and neural architecture search techniques.

Over the past, deep neural networks have proved to be an essential element for developing intelligent solutions. They have achieved remarkable performances at a cost of deeper layers and millions of parameters. Therefore utilising these networks on limited resource platforms for smart cameras is a challenging task. In this context, models need to be (i) accelerated and (ii) memory efficient without significantly compromising on performance. Numerous works have been done to obtain smaller, faster and accurate models. This paper presents a survey of methods suitable for porting deep neural networks on resource-limited devices, especially for smart cameras. These methods can be roughly divided in two main sections. In the first part, we present compression techniques. These techniques are categorized into: knowledge distillation, pruning, quantization, hashing, reduction of numerical precision and binarization. In the second part, we focus on architecture optimization. We introduce the methods to enhance networks structures as well as neural architecture search techniques. In each of their parts, we describe different methods, and analyse them. Finally, we conclude this paper with a discussion on these methods.

Deep Model Compression and Architecture Optimization for Embedded Systems: A Survey

期刊

JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Deep Model Compression and Architecture Optimization for Embedded Systems: A Survey

期刊

JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文