Deep Learning Acceleration Design Based on Low Rank Approximation

Yi Hsiang Chang, Gwo Giun Chris Lee, Shiu Yu Chen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Recently, artificial intelligence applications require large resources for training and inferencing. Therefore, intensive computation or large memory requirements often become the bottleneck of AI. This paper proposes the singular value decomposition (SVD) low-rank approximation (LRA) method applied to the CNN model. By exploiting the fact that redundancy exists between different channels and filters, the SVD matrix decomposition is used to estimate the most informative parameters in deep CNNs, and by reducing the convolutional layer parameters in this way, a special structure of the convolutional layers is designed to accelerate the trained neural network, to control the accuracy degradation within 2% but to greatly reduce the data storage and the number of operations. The design process is based on algorithm/architecture co-design, and the analysis of the number of operations and data storage.

Original languageEnglish
Title of host publicationProceedings of 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1304-1307
Number of pages4
ISBN (Electronic)9786165904773
DOIs
Publication statusPublished - 2022
Event2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2022 - Chiang Mai, Thailand
Duration: 2022 Nov 72022 Nov 10

Publication series

NameProceedings of 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2022

Conference

Conference2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2022
Country/TerritoryThailand
CityChiang Mai
Period22-11-0722-11-10

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Information Systems
  • Signal Processing

Fingerprint

Dive into the research topics of 'Deep Learning Acceleration Design Based on Low Rank Approximation'. Together they form a unique fingerprint.

Cite this