☆ 4.5 Article

Hardware Architecture Exploration for Deep Neural Networks

ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING (2021)

Journal

ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING

Volume 46, Issue 10, Pages 9703-9712

Publisher

SPRINGER HEIDELBERG

DOI: 10.1007/s13369-021-05455-4

Keywords

AI architecture; Neural network architecture; CNN; Design space exploration

Funding

Samsung Electronics Co., Ltd.

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This research developed design space exploration techniques and environments for optimal system design, including computing modules and memories. By modifying SRAM and computing module sizes, designers can efficiently explore the design space and choose the optimal architecture that minimizes cost while meeting performance specifications.

Owing to good performance, deep Convolution Neural Networks (CNNs) are rapidly rising in popularity across a broad range of applications. Since high accuracy CNNs are both computation intensive and memory intensive, many researchers have shown significant interest in the accelerator design. Furthermore, the AI chip market size grows and the competition on the performance, cost, and power consumption of the artificial intelligence SoC designs is increasing. Therefore, it is important to develop design techniques and platforms that are useful for the efficient design of optimized AI architectures to satisfy the given specifications in a short design time. In this research, we have developed design space exploration techniques and environments for the optimal design of the overall system including computing modules and memories. Our current design platform is built using NVIDIA Deep Learning Accelerator as a computing model, SRAM as a buffer, and DRAM with GDDR6 as an off-chip memory. We also developed a program to estimate the processing time of a given neural network. By modifying both the on-chip SRAM size and the computing module size, a designer can explore the design space efficiently, and then choose the optimal architecture which shows the minimal cost while satisfying the performance specification. To illustrate the operation of the design platform, two well-known deep CNNs are used, which are YOLOv3 and faster RCNN. This technology can be used to explore and to optimize the hardware architectures of the CNNs so that the cost can be minimized.

Hardware Architecture Exploration for Deep Neural Networks

Journal

ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING

Publisher

SPRINGER HEIDELBERG

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Hardware Architecture Exploration for Deep Neural Networks

Journal

ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING

Publisher

SPRINGER HEIDELBERG

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper