☆ 4.5 Article

AutoRank: Automated Rank Selection for Effective Neural Network Customization

IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS (2021)

期刊

IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS

卷 11, 期 4, 页码 611-619

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/JETCAS.2021.3127433

关键词

Hardware; Tensors; Costs; Runtime; Matrix decomposition; Training; Field programmable gate arrays; Computational and artificial intelligence; neural networks; artificial neural networks

类别

Engineering, Electrical & Electronic

资金

National Science Foundation (NSF) [2016737]
Semiconductor Research Corporation (SRC) [2899.001]
Intelligence Advanced Research Projects Activity (IARPA) [2018-18022100004]
Intel Private AI Institute
Direct For Computer & Info Scie & Enginr
Division Of Computer and Network Systems [2016737] Funding Source: National Science Foundation

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Tensor decomposition is a promising method for implementing low-power and real-time neural network applications on resource-constrained embedded devices. The proposed AutoRank framework allows customization of neural network decomposition through cross-layer rank selection, incorporating both inference accuracy and platform specifications while minimizing engineering costs. This framework is hardware-aware and delivers high accuracy decomposed deep neural networks with low execution costs, with an automated API for compatibility with popular deep learning libraries.

Tensor decomposition is a promising approach for low-power and real-time application of neural networks on resource-constrained embedded devices. This paper proposes AutoRank, an end-to-end framework for customizing neural network decomposition using cross-layer rank-selection. For many-layer networks, determining the optimal decomposition ranks is a cumbersome task. To overcome this challenge, we establish a state-action-reward system that effectively absorbs inference accuracy and platform specifications into the rank-selection policy. Our proposed framework brings platform characteristics and performance in the customization loop to enable direct incorporation of hardware cost, e.g., runtime and memory footprint. By means of this hardware-awareness, AutoRank customization engine delivers high accuracy decomposed deep neural networks with low execution cost. Our framework minimizes the engineering cost associated with rank selection by providing an automated API for AutoRank that is compatible with popular deep learning libraries and can be readily used by developers.

AutoRank: Automated Rank Selection for Effective Neural Network Customization

期刊

IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

AutoRank: Automated Rank Selection for Effective Neural Network Customization

期刊

IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文