ONNC: A Compilation Framework Connecting ONNX to Proprietary Deep Learning Accelerators

Wei Fen Lin, Der Yu Tsai, Luba Tang, Cheng Tao Hsieh, Cheng Yi Chou, Ping Hao Chang, Luis Hsu

研究成果: Conference contribution

20 引文 斯高帕斯(Scopus)

摘要

This paper presents ONNC (Open Neural Network Compiler), a retargetable compilation framework designed to connect ONNX (Open Neural Network Exchange) models to proprietary deep learning accelerators (DLAs). The intermediate representations (IRs) of ONNC have one-to-one mapping to ONNX IRs, thus making porting ONNC to proprietary DLAs much simpler than other compilation frameworks such as TVM and Glow especially for hardware with coarse-grained operators that are not part of the generic IRs in the LLVM backend. ONNC also has a flexible pass manager designed to support compiler optimizations at all levels. A docker image of ONNC bundled with a Vanilla backend is released with this paper to enable fast porting to new hardware targets. To illustrate how an ONNC-based toolkit guides our research and development in DLA design, we present a case study on compiler optimizations for activation memory consumption. The study shows that the Best-Fit algorithm with a proposed heuristic and a reordering scheme may act as a near-optimal strategy, getting the memory consumption close to the ideal lower bound in 11 of 12 models from the ONNX model zoo. To our best knowledge, ONNC is the first open source compilation framework that is specially designed to support the ONNX-based models for both commercial and research projects for deep learning applications.

原文English
主出版物標題Proceedings 2019 IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2019
發行者Institute of Electrical and Electronics Engineers Inc.
頁面214-218
頁數5
ISBN(電子)9781538678848
DOIs
出版狀態Published - 2019 3月
事件1st IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2019 - Hsinchu, Taiwan
持續時間: 2019 3月 182019 3月 20

出版系列

名字Proceedings 2019 IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2019

Conference

Conference1st IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2019
國家/地區Taiwan
城市Hsinchu
期間19-03-1819-03-20

All Science Journal Classification (ASJC) codes

  • 人工智慧
  • 硬體和架構
  • 電氣與電子工程

指紋

深入研究「ONNC: A Compilation Framework Connecting ONNX to Proprietary Deep Learning Accelerators」主題。共同形成了獨特的指紋。

引用此