4.4 Article

Towards Efficient In-Memory Computing Hardware for Quantized Neural Networks: State-of-the-Art, Open Challenges and Perspectives

Journal

IEEE TRANSACTIONS ON NANOTECHNOLOGY
Volume 22, Issue -, Pages 377-386

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNANO.2023.3293026

Keywords

In-memory computing; quantized neural network; hardware; quantization

Ask authors/readers for more resources

The increasing amount of data processed in the cloud, the development of Internet-of-Things (IoT) applications, and the growing concerns about data privacy are driving the transition from cloud-based to edge-based processing. Limited energy and computational resources on the edge are pushing the move from traditional von Neumann architectures to In-memory Computing (IMC), especially for machine learning and neural network applications. Network compression techniques, such as quantization, are being applied to implement neural networks on limited hardware resources, reducing memory footprint, latency, and energy consumption. This article provides a comprehensive review of IMC-based Quantized Neural Networks (QNN) and connects software-based quantization approaches to IMC hardware implementation, while addressing open challenges, QNN design requirements, recommendations, and perspectives, along with an IMC-based QNN hardware roadmap.
The amount of data processed in the cloud, the development of Internet-of-Things (IoT) applications, and growing data privacy concerns force the transition from cloud-based to edge-based processing. Limited energy and computational resources on edge push the transition from traditional von Neumann architectures to In-memory Computing (IMC), especially for machine learning and neural network applications. Network compression techniques are applied to implement a neural network on limited hardware resources. Quantization is one of the most efficient network compression techniques allowing to reduce the memory footprint, latency, and energy consumption. This article provides a comprehensive review of IMC-based Quantized Neural Networks (QNN) and links software-based quantization approaches to IMC hardware implementation. Moreover, open challenges, QNN design requirements, recommendations, and perspectives along with an IMC-based QNN hardware roadmap are provided.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available