☆ 4.7 Article

Multiobjective Evolutionary Design of Deep Convolutional Neural Networks for Image Classification

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION (2021)

期刊

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION

卷 25, 期 2, 页码 277-291

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TEVC.2020.3024708

关键词

Computer architecture; Optimization; Search problems; Task analysis; Neural networks; Computational modeling; Graphics processing units; Convolutional neural networks (CNNs); evolutionary deep learning; genetic algorithms (GAs); neural architecture search (NAS)

类别

Computer Science, Artificial Intelligence Computer Science, Theory & Methods

资金

National Science Foundation [DBI-0939454]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study proposes an evolutionary algorithm for searching neural architectures, which fills a set of architectures through genetic operations to approximate the entire Pareto frontier, improves computational efficiency, and reinforces shared patterns among past successful architectures through Bayesian model learning. The method achieves competitive performance in image classification tasks, while considering multiple objectives.

Convolutional neural networks (CNNs) are the backbones of deep learning paradigms for numerous vision tasks. Early advancements in CNN architectures are primarily driven by human expertise and by elaborate design processes. Recently, neural architecture search was proposed with the aim of automating the network design process and generating task-dependent architectures. While existing approaches have achieved competitive performance in image classification, they are not well suited to problems where the computational budget is limited for two reasons: 1) the obtained architectures are either solely optimized for classification performance, or only for one deployment scenario and 2) the search process requires vast computational resources in most approaches. To overcome these limitations, we propose an evolutionary algorithm for searching neural architectures under multiple objectives, such as classification performance and floating point operations (FLOPs). The proposed method addresses the first shortcoming by populating a set of architectures to approximate the entire Pareto frontier through genetic operations that recombine and modify architectural components progressively. Our approach improves computational efficiency by carefully down-scaling the architectures during the search as well as reinforcing the patterns commonly shared among past successful architectures through Bayesian model learning. The integration of these two main contributions allows an efficient design of architectures that are competitive and in most cases outperform both manually and automatically designed architectures on benchmark image classification datasets: CIFAR, ImageNet, and human chest X-ray. The flexibility provided from simultaneously obtaining multiple architecture choices for different compute requirements further differentiates our approach from other methods in the literature.

Multiobjective Evolutionary Design of Deep Convolutional Neural Networks for Image Classification

期刊

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Multiobjective Evolutionary Design of Deep Convolutional Neural Networks for Image Classification

期刊

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文