TY - GEN
T1 - Sparse Basis Approach for Lightweight AI System Design
AU - Lee, Wei Chieh
AU - Lee, Gwo Giun Chris
AU - Yang, Chu Chun
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - In recent years, the demand for image processing tasks has increasingly been delegated to AI, with Convolutional Neural Network (CNN) being a commonly used model for image processing. The convolution operation within CNN involves extensive computation, leading to amount of time requirements. This paper introduces an optimization algorithm specifically designed for the convolution operation in CNN models. The comparison is made between the conventional convolution method and the proposed sparse basis approach method, evaluating the required number of operations and data storage for each. The experiment utilizes Google's CFU playground platform to establish a VexRiscV CPU operating at a frequency of 200MHz for profiling the sparse basis approach algorithm. This profiling aids in determining whether to do software/hardware partitioning. The algorithm proposed in this paper is applicable to various CNN models, including LeNet, AlexNet, VGG16, VGG19, and others. Furthermore, this paper introduces dataflow analysis for the optimized convolution operation to provide effective reconfigurable support across different CNN models. The importance of dataflow in hardware modeling is discussed, along with a comparison of the impact of different dataflow code implementations on CPU execution. In contrast to traditional behavioral code profiling, profiling dataflow code allows for a more accurate measurement of the intrinsic complexity of the algorithm.
AB - In recent years, the demand for image processing tasks has increasingly been delegated to AI, with Convolutional Neural Network (CNN) being a commonly used model for image processing. The convolution operation within CNN involves extensive computation, leading to amount of time requirements. This paper introduces an optimization algorithm specifically designed for the convolution operation in CNN models. The comparison is made between the conventional convolution method and the proposed sparse basis approach method, evaluating the required number of operations and data storage for each. The experiment utilizes Google's CFU playground platform to establish a VexRiscV CPU operating at a frequency of 200MHz for profiling the sparse basis approach algorithm. This profiling aids in determining whether to do software/hardware partitioning. The algorithm proposed in this paper is applicable to various CNN models, including LeNet, AlexNet, VGG16, VGG19, and others. Furthermore, this paper introduces dataflow analysis for the optimized convolution operation to provide effective reconfigurable support across different CNN models. The importance of dataflow in hardware modeling is discussed, along with a comparison of the impact of different dataflow code implementations on CPU execution. In contrast to traditional behavioral code profiling, profiling dataflow code allows for a more accurate measurement of the intrinsic complexity of the algorithm.
UR - http://www.scopus.com/inward/record.url?scp=85189246211&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85189246211&partnerID=8YFLogxK
U2 - 10.1109/ICEIC61013.2024.10457138
DO - 10.1109/ICEIC61013.2024.10457138
M3 - Conference contribution
AN - SCOPUS:85189246211
T3 - 2024 International Conference on Electronics, Information, and Communication, ICEIC 2024
BT - 2024 International Conference on Electronics, Information, and Communication, ICEIC 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 International Conference on Electronics, Information, and Communication, ICEIC 2024
Y2 - 28 January 2024 through 31 January 2024
ER -