基於深度學習之視覺定位-以故宮南院為例

Chia Hao Tu, Eric Hsueh Chan Lu

研究成果: Article同行評審

摘要

Visual localization uses images to regress camera position and orientation. It has many applications in computer vision such as autonomous driving, augmented reality (AR) and virtual reality (VR), and so on. The convolutional neural network simulates biological vision and has a good image feature extraction ability, so using it in visual localization can improve regression accuracy. Although our team has built an image indoor localization model for Southern Branch of the National Palace Museum, this paper tries to use new network and loss function to achieve better positioning accuracy. In this paper, we use ResNet-50 as backbone network, and change the output layer to 3-dimensional position and 4-dimensional orientation quaternion, and use learnable weights loss function. We compare different pretrained models and normalization methods, and the best result improves the positioning accuracy by about 60%.

貢獻的翻譯標題Visual Localization Based on Deep Learning ⎯ Take Southern Branch of the National Palace Museum for Example
原文???core.languages.zh_TW???
頁(從 - 到)215-220
頁數6
期刊Journal of the Chinese Institute of Civil and Hydraulic Engineering
34
發行號3
DOIs
出版狀態Published - 2022 5月

All Science Journal Classification (ASJC) codes

  • 土木與結構工程

指紋

深入研究「基於深度學習之視覺定位-以故宮南院為例」主題。共同形成了獨特的指紋。

引用此