Deep Learning Acceleration Design Based on Low Rank Approximation

Yi Hsiang Chang, Gwo Giun Chris Lee, Shiu Yu Chen

研究成果: Conference contribution

1 引文 斯高帕斯(Scopus)

摘要

Recently, artificial intelligence applications require large resources for training and inferencing. Therefore, intensive computation or large memory requirements often become the bottleneck of AI. This paper proposes the singular value decomposition (SVD) low-rank approximation (LRA) method applied to the CNN model. By exploiting the fact that redundancy exists between different channels and filters, the SVD matrix decomposition is used to estimate the most informative parameters in deep CNNs, and by reducing the convolutional layer parameters in this way, a special structure of the convolutional layers is designed to accelerate the trained neural network, to control the accuracy degradation within 2% but to greatly reduce the data storage and the number of operations. The design process is based on algorithm/architecture co-design, and the analysis of the number of operations and data storage.

原文English
主出版物標題Proceedings of 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2022
發行者Institute of Electrical and Electronics Engineers Inc.
頁面1304-1307
頁數4
ISBN(電子)9786165904773
DOIs
出版狀態Published - 2022
事件2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2022 - Chiang Mai, Thailand
持續時間: 2022 11月 72022 11月 10

出版系列

名字Proceedings of 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2022

Conference

Conference2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2022
國家/地區Thailand
城市Chiang Mai
期間22-11-0722-11-10

All Science Journal Classification (ASJC) codes

  • 電腦網路與通信
  • 資訊系統
  • 訊號處理

指紋

深入研究「Deep Learning Acceleration Design Based on Low Rank Approximation」主題。共同形成了獨特的指紋。

引用此