相关参考文献
注意:仅列出部分参考文献,下载原文获取全部文献信息。A Survey on Aspect-Based Sentiment Classification
Gianni Brauwers et al.
ACM COMPUTING SURVEYS (2023)
Deploying deep learning networks based advanced techniques for image processing on FPGA platform
Refka Ghodhbani et al.
NEURAL COMPUTING & APPLICATIONS (2023)
Model Compression for Deep Neural Networks: A Survey
Zhuo Li et al.
COMPUTERS (2023)
A Survey on Efficient Convolutional Neural Networks and Hardware Acceleration
Deepak Ghimire et al.
ELECTRONICS (2022)
Optimization-Based Post-Training Quantization With Bit-Split and Stitching
Peisong Wang et al.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)
Quantized Sparse Training: A Unified Trainable Framework for Joint Pruning and Quantization in DNNs
Jun-Hyung Park et al.
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS (2022)
Explicit Model Size Control and Relaxation via Smooth Regularization for Mixed-Precision Quantization
Vladimir Chikin et al.
COMPUTER VISION, ECCV 2022, PT XII (2022)
BASQ: Branch-wise Activation-clipping Search Quantization for Sub-4-bit Neural Networks
Han-Byul Kim et al.
COMPUTER VISION, ECCV 2022, PT XII (2022)
Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation
Zechun Liu et al.
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) (2022)
Pruning and quantization for deep neural network acceleration: A survey
Tailin Liang et al.
NEUROCOMPUTING (2021)
Layer Importance Estimation with Imprinting for Neural Network Quantization
Hongyang Liu et al.
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021 (2021)
Adaptive Binary-Ternary Quantization
Ryan Razani et al.
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021 (2021)
Bi-Real Net: Binarizing Deep Network Towards Real-Network Performance
Zechun Liu et al.
INTERNATIONAL JOURNAL OF COMPUTER VISION (2020)
Grow and Prune Compact, Fast, and Accurate LSTMs
Xiaoliang Dai et al.
IEEE TRANSACTIONS ON COMPUTERS (2020)
Binary neural networks: A survey
Haotong Qin et al.
PATTERN RECOGNITION (2020)
Model Compression and Hardware Acceleration for Neural Networks: A Comprehensive Survey
Lei Deng et al.
PROCEEDINGS OF THE IEEE (2020)
TiM-DNN: Ternary In-Memory Accelerator for Deep Neural Networks
Shubham Jain et al.
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS (2020)
Eyeriss v2: A Flexible Accelerator for Emerging Deep Neural Networks on Mobile Devices
Yu-Hsin Chen et al.
IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS (2019)
Efficient Weights Quantization of Convolutional Neural Networks Using Kernel Density Estimation based Non-uniform Quantizer
Sanghyun Seo et al.
APPLIED SCIENCES-BASEL (2019)
BSHIFT: A Low Cost Deep Neural Networks Accelerator
Yong Yu et al.
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING (2019)
Blended coarse gradient descent for full quantization of deep neural networks
Penghang Yin et al.
RESEARCH IN THE MATHEMATICAL SCIENCES (2019)
Model Compression and Acceleration for Deep Neural Networks The principles, progress, and challenges
Yu Cheng et al.
IEEE SIGNAL PROCESSING MAGAZINE (2018)
Weighted Quantization-Regularization in DNNs for Weight Memory Minimization Toward HW Implementation
Matthias Wess et al.
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS (2018)
Recent advances in efficient computation of deep convolutional neural networks
Jian Cheng et al.
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING (2018)
SpWA: An Efficient Sparse Winograd Convolutional Neural Networks Accelerator on FPGAs
Liqiang Lu et al.
2018 55TH ACM/ESDA/IEEE DESIGN AUTOMATION CONFERENCE (DAC) (2018)
Explicit Loss-Error-Aware Quantization for Low-Bit Deep Neural Networks
Aojun Zhou et al.
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2018)
BinaryRelax: A Relaxation Approach for Training Deep Neural Networks with Quantized Weights
Penghang Yin et al.
SIAM JOURNAL ON IMAGING SCIENCES (2018)
Balanced Quantization: An Effective and Efficient Approach to Quantized Neural Networks
Shu-Chang Zhou et al.
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY (2017)
Efficient Processing of Deep Neural Networks: A Tutorial and Survey
Vivienne Sze et al.
PROCEEDINGS OF THE IEEE (2017)
Structured Pruning of Deep Convolutional Neural Networks
Sajid Anwar et al.
ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS (2017)
Densely Connected Convolutional Networks
Gao Huang et al.
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)
Channel Pruning for Accelerating Very Deep Neural Networks
Yihui He et al.
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)
Deep Learning with Low Precision by Half-wave Gaussian Quantization
Zhaowei Cai et al.
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)
Accelerating Very Deep Convolutional Networks for Classification and Detection
Xiangyu Zhang et al.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2016)
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky et al.
INTERNATIONAL JOURNAL OF COMPUTER VISION (2015)
Channel-Level Acceleration of Deep Face Representations
Adam Polyak et al.
IEEE ACCESS (2015)
ShiDianNao: Shifting Vision Processing Closer to the Sensor
Zidong Du et al.
2015 ACM/IEEE 42ND ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA) (2015)
DaDianNao: A Machine-Learning Supercomputer
Yunji Chen et al.
2014 47TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO) (2014)
Deep Neural Networks for Acoustic Modeling in Speech Recognition
Geoffrey Hinton et al.
IEEE SIGNAL PROCESSING MAGAZINE (2012)