TY - GEN
T1 - Design of 2D Systolic Array Accelerator for Quantized Convolutional Neural Networks
AU - Liu, Chia Ning
AU - Lai, Yu An
AU - Kuo, Chih Hung
AU - Zhan, Shi An
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/4/19
Y1 - 2021/4/19
N2 - Quantization techniques have been studied to reduce the computing and memory requirement of deep neural networks. The full precision floating-point numbers are quantized into integer representation with lower bit-width. In this work, we quantize both activations and weights in CNN to 8-bit integers and apply our quantization method to the hardware accelerator. The accelerator is designed with a systolic-based structure, which can support both the convolutional layers and the fully-connected layers for various network models. By the proposed quantization scheme, there is only 1.68% mAP loss on YOLOv3-tiny model compared to the floating-point model. Benchmarked with AlexNet and VGG-16, the external memory access of convolutional layers is reduced by 1.63x and 1.79x compared with Eyeriss, and the internal memory access is also reduced by 7.31x and 17.48x.
AB - Quantization techniques have been studied to reduce the computing and memory requirement of deep neural networks. The full precision floating-point numbers are quantized into integer representation with lower bit-width. In this work, we quantize both activations and weights in CNN to 8-bit integers and apply our quantization method to the hardware accelerator. The accelerator is designed with a systolic-based structure, which can support both the convolutional layers and the fully-connected layers for various network models. By the proposed quantization scheme, there is only 1.68% mAP loss on YOLOv3-tiny model compared to the floating-point model. Benchmarked with AlexNet and VGG-16, the external memory access of convolutional layers is reduced by 1.63x and 1.79x compared with Eyeriss, and the internal memory access is also reduced by 7.31x and 17.48x.
UR - http://www.scopus.com/inward/record.url?scp=85106618302&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85106618302&partnerID=8YFLogxK
U2 - 10.1109/VLSI-DAT52063.2021.9427336
DO - 10.1109/VLSI-DAT52063.2021.9427336
M3 - Conference contribution
AN - SCOPUS:85106618302
T3 - 2021 International Symposium on VLSI Design, Automation and Test, VLSI-DAT 2021 - Proceedings
BT - 2021 International Symposium on VLSI Design, Automation and Test, VLSI-DAT 2021 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 International Symposium on VLSI Design, Automation and Test, VLSI-DAT 2021
Y2 - 19 April 2021 through 22 April 2021
ER -