Sparse Basis Approach for Lightweight AI System Design

Wei Chieh Lee, Gwo Giun Chris Lee, Chu Chun Yang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In recent years, the demand for image processing tasks has increasingly been delegated to AI, with Convolutional Neural Network (CNN) being a commonly used model for image processing. The convolution operation within CNN involves extensive computation, leading to amount of time requirements. This paper introduces an optimization algorithm specifically designed for the convolution operation in CNN models. The comparison is made between the conventional convolution method and the proposed sparse basis approach method, evaluating the required number of operations and data storage for each. The experiment utilizes Google's CFU playground platform to establish a VexRiscV CPU operating at a frequency of 200MHz for profiling the sparse basis approach algorithm. This profiling aids in determining whether to do software/hardware partitioning. The algorithm proposed in this paper is applicable to various CNN models, including LeNet, AlexNet, VGG16, VGG19, and others. Furthermore, this paper introduces dataflow analysis for the optimized convolution operation to provide effective reconfigurable support across different CNN models. The importance of dataflow in hardware modeling is discussed, along with a comparison of the impact of different dataflow code implementations on CPU execution. In contrast to traditional behavioral code profiling, profiling dataflow code allows for a more accurate measurement of the intrinsic complexity of the algorithm.

Original languageEnglish
Title of host publication2024 International Conference on Electronics, Information, and Communication, ICEIC 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350371888
DOIs
Publication statusPublished - 2024
Event2024 International Conference on Electronics, Information, and Communication, ICEIC 2024 - Taipei, Taiwan
Duration: 2024 Jan 282024 Jan 31

Publication series

Name2024 International Conference on Electronics, Information, and Communication, ICEIC 2024

Conference

Conference2024 International Conference on Electronics, Information, and Communication, ICEIC 2024
Country/TerritoryTaiwan
CityTaipei
Period24-01-2824-01-31

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Computer Science Applications
  • Hardware and Architecture
  • Information Systems
  • Energy Engineering and Power Technology
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Sparse Basis Approach for Lightweight AI System Design'. Together they form a unique fingerprint.

Cite this